Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Director, Software Engineering - NIM Factory

AT NVIDIA
NVIDIA

Director, Software Engineering - NIM Factory

Santa Clara, CA

Are you ready to usher in the new world of Artificial Intelligence? Do you want to build the rockets launching the AI revolution? We are seeking a Director of Software Engineering for building a GPU accelerated software platform for inference applications. The right candidate for this role brings a mix of humanity and technical talent to provide the drive and creative direction to the way NVIDIA optimizes, serves, and measures the performance of AI models. Your team will build optimized scalable AI services and software products engineered for fast software delivery with the most recent research breakthroughs. Our work is a collaboration with internal and external partners who are developing innovative AI models and accelerating them to provide the highest performing inference on the market. NVIDIA NIMs are easy to use and designed for all deployment scenarios, in the cloud, on self-hosted infrastructure, and locally on all NVIDIA GPUs. You are your team's heart, its strategist, and its north-star. You must have a passion for engineering fundamentals. Your team is building software products and cloud services with good engineering principle and innovative software designs. You believe in teamwork, collaboration and love growing a strong diverse team with a culture of learning.

Want more jobs like this?

Get Software Engineering jobs in Santa Clara, CA delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.


NVIDIA is building a new category of products by combining deep learning innovation, accelerated computing with industry-leading generative AI techniques at scale. You will harness groundbreaking GPU technologies and build highly efficient automation and software processes for NVIDIA's NIM Factory. Together with our partners, your team will innovate from model ingestion, optimization, predictable testing methods all the way through deployment. You collaborate on sophisticated techniques with other NVIDIA technologists. Your team's work strives to accelerate the delivery of every AI model on GPUs anywhere. You will continuously be learning about the latest acceleration techniques in AI Inference and learn from top talent in the industry.

What you'll be doing:

  • You will direct the design and implementation of an industry-leading software platform built on top of heterogeneous hardware and cloud infrastructure.
  • Optimize the internal software operations across all of NVIDIA's products using the latest generative AI techniques.
  • Manage the deployment of large-scale GPU systems to support a reliable and efficient enterprise-grade operational platform.
  • Collaborate with diverse teams within and outside of NVIDIA to drive innovation.
  • Improve development practices and strategies across the organization.
  • Remain involved and gain a deep understanding of the technology, demonstrating your knowledge in cloud and software platforms.
  • Drive improvements in performance by implementing engineering practices that prioritize measuring and addressing bottlenecks.
  • Foster continuous feedback loops to ensure flawless optimization of AI models and NIMs.

What we need to see:

  • Proven authority in engineering leadership with a Master's or equivalent experience in Computer Science, Computer Engineering, or a related field.
  • Extensive experience developing software as a service platforms, infrastructure, automation, and microservices.
  • Demonstrated success in designing high-performance, cloud-scale APIs and distributed software platforms.
  • Exceptional engineering fundamentals with the ability to navigate complicated and ambiguous problems.
  • History of teamwork across interpersonal matrix operations, setting up strategy, and collaborating with different levels of the organization.
  • 15+ years of hands-on experience delivering software platforms and services.
  • 5+ years of proven skills in growing individuals in their careers and developing teams.

Ways to stand out from the crowd:

  • Curiosity and drive to tackle problems with a diverse range of abilities in various programming languages.
  • Strong track record of crafting well-designed solutions and delivering high-quality software on time.
  • Previous work on accelerating software across inference stack.
  • Hands on experience on NVIDIA accelerated libraries, tools and GPU technologies.

We are widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and talented people in the world working for us. If you're creative and autonomous with a real passion for technology we want to hear from you.

#LI-Hybrid

The base salary range is 308,000 USD - 471,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Client-provided location(s): Santa Clara, CA, USA
Job ID: NVIDIA-JR1988731
Employment Type: Full Time