Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Senior Deep Learning Software Engineer, Algorithmic Model Optimization

AT NVIDIA
NVIDIA

Senior Deep Learning Software Engineer, Algorithmic Model Optimization

Santa Clara, CA / Remote

We are now looking for a Senior Deep Learning Software Engineer, for Algorithmic Model Optimization!

Join our team of algorithmic model optimization experts and take part in unlocking the biggest potential for AI with generative models such as large language models (LLM) and diffusion models. As a Senior Deep Learning Software Engineer, you will be at the forefront of pushing the boundaries of these models and enabling their deployment at a larger scale with unmatched efficiency. We are developing an innovative software platform that will not only be utilized internally but also have a significant impact externally by enabling the creation of groundbreaking AI products. This is an exceptional opportunity for passionate software engineers like you, who have a strong background in Deep Learning, to join us in solving the most significant challenges in the field.

Want more jobs like this?

Get jobs delivered to your inbox every week.

Select a location
By signing up, you agree to our Terms of Service & Privacy Policy.


Your role will be pivotal in our mission to maximize the potential of our rapidly expanding data center deployments. Additionally, you will play a crucial part in adopting a data-driven approach to hardware design and system software development. Collaboration is at the heart of what we do, and you will have the chance to work closely with a diverse range of teams at NVIDIA, including the Applied Deep Learning Research teams, CUDA Kernel and DL Framework development teams, and the Silicon Architecture Team. In this position, you will actively engage with internal stakeholders, users, and members of the open-source community. Your input will be vital in defining and implementing cutting-edge model optimization algorithms. The scope of your work will encompass researching and developing highly efficient search algorithms, defining public APIs, implementation, and various other software engineering tasks. We are seeking individuals who are as enthusiastic as we are about pushing the boundaries of AI and contributing to groundbreaking advancements in the field. If you are passionate about innovation, tackling complex DL problems, and working in a collaborative environment, this is the perfect opportunity for you. Join us, and together, we will shape the future of AI model optimization and its impact on the world.

What you'll be doing:

  • Prototype and develop model optimization methods, and build a most impactful model optimization platform
  • Collaborate with internal and external partners to accelerate the adoption of deep learning model optimization
  • Stay up to date with the latest research and innovations in generative AI and model optimization techniques
  • Analyze and optimize the theoretical and practical performance of DL models generated
  • Publish findings in top AI conferences, and create Intellectual Property

What we need to see:

  • Masters, PhD, or equivalent experience in Computer Science, AI, Applied Math, or related field.
  • 10+ years of relevant work or research experience in Deep Learning.
  • Excellent software design skills, including debugging, performance analysis, and test design
  • Strong algorithms and programming fundamentals
  • Ability to work independently, define project goals and scope, and run your own development effort
  • Good communication, documentation habits, and interpersonal skills
  • Experience with one or more: Python, C++, performance tuning

Ways to stand out from the crowd:

  • Contributions to PyTorch, JAX, or other Machine Learning Frameworks
  • Knowledge of GPU architecture and compilation stack, and capability of understanding and debugging end-to-end performance
  • Familiarity with Nvidia's deep learning SDK such as TensorRT
  • Strong understanding of deep learning algorithms and solutions
  • Strong understanding of ML model optimization techniques such as quantization, pruning, distillation.

Increasingly known as "the AI computing company" and widely considered to be one of the technology world's most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. Are you creative, motivated, and love a challenge? If so, we want to hear from you! Come, join our model optimization group, where you can help build real-time, cost-effective computing platforms driving our success in this exciting and rapidly growing field.

The base salary range is 220,000 USD - 419,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Client-provided location(s): Santa Clara, CA, USA
Job ID: NVIDIA-JR1969684
Employment Type: Full Time