We are looking for Research Scientists passionate about large generative models!
NVIDIA is searching for researchers in deep learning and natural language processing (NLP) to join our Conversational AI research team. Our team is pushing the boundaries of generative AI by building state-of-the-art large language models (LLM). We work on new neural architectures and new algorithms to accelerate LLM inference. If you are passionate about the latest research and technologies revolutionizing generative AI and want to explore new paradigms for applied foundation models , our team will be a great fit for you. After building prototypes that demonstrate the promise of your research, you will collaborate with product teams to apply your ideas into industry-leading real-world applications.
Want more jobs like this?
Get jobs in Tel Aviv, Israel delivered to your inbox every week.
What you will be doing:
- Work on new deep learning algorithms and techniques for efficient LLM inference
- Develop new architectures for advanced large language models
- Adapt foundation AI models to downstream tasks such as math and code reasoning
- Contribute these new models to Nemo framework
- Work closely with product and hardware architecture teams to integrate your research and developments into products
What we need to see:
- M.Sc. or PhD in Computer Science/ Electrical Engineering
- 5+ years of machine learning / deep learning research experience
- Solid knowledge of application areas such as natural language processing or speech processing
- Excellent programming skills and rapid prototyping in Python
- Expertise with PyTorch.deep learning framework
- A track record of research excellence demonstrated in publications at leading conferences and journals.
Ways to stand out from the crowd:
- Strong C++ programming skills
- Systems software engineering knowledge and expertise in optimizing software for computational performance
- Contribution to open source software projects