Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Deep Learning Performance Architect

AT NVIDIA
NVIDIA

Deep Learning Performance Architect

Beijing, China

NVIDIA is developing processor and system architectures that accelerate deep learning and high-performance computing applications. We are looking for an expert deep learning system performance architect to join our AI performance modelling and analysis efforts. In this position, you will have a chance to work on DL performance modelling, analysis, and optimization on state-of-the-art hardware architectures for various LLM workloads. You will make your contributions to our dynamic technology focused company.

What you'll be doing:

  • Analyze state of the art DL networks (LLM etc.), identify and prototype performance opportunities to influence SW and Architecture team for NVIDIA's current and next gen inference products
  • Develop analytical models for the state of the art deep learning networks and algorithm to innovate processor and system architectures design for performance and efficiency.
  • Specify hardware/software configurations and metrics to analyze performance, power, and accuracy in existing and future uni-processor and multiprocessor configurations.
  • Collaborate across the company to guide the direction of next-gen deep learning HW/SW by working with architecture, software, and product teams.

Want more jobs like this?

Get jobs delivered to your inbox every week.

Select a location
By signing up, you agree to our Terms of Service & Privacy Policy.

What we need to see:

  • BS, MS or PhD in relevant discipline (CS, EE, Math, etc.) or equivalent experience.
  • 5+ years work experience.
  • Experience with popular AI models (e.g., LLM and AIGC models)
  • Be familiar with typical deep learning SW framework (e.g., Torch/JAX/TensorFlow/TensorRT)
  • Knowledge and experience on hardware architectures for deep learning applications

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you! NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

#deeplearning

Client-provided location(s): Beijing, China; Shanghai, China
Job ID: NVIDIA-JR1992422
Employment Type: Full Time