Skip to main contentA logo with &quat;the muse&quat; in dark blue text.
NVIDIA

Deep Learning Performance Architect

Beijing, China

Are you passionate about exploring computer architectures for deep learning? Do you like to work at the intersection of hardware and software? NVIDIA is seeking world-class programmers and performance architects who love to squeeze out every cycle of performance from deep learning codes. In this role, you will write code that ships in our deep learning libraries, as well as guide the direction of our future GPU architectures. This position offers the opportunity to have real impact in a fast-moving, technology-focused company.

What you'll be doing:

  • Developing production-quality software that ships as part of NVIDIA's AI software stack.
  • Analyzing the performance of important workloads, tuning our current software, and proposing improvements for future software.
  • Working with cross-collaborative teams of deep learning software engineers and GPU architects to develop innovative solutions
  • Adapting to the constantly evolving AI industry by being agile and excited to contribute across the codebase, including API design, software architecture, testing, and GPU kernel development.

Want more jobs like this?

Get Software Engineering jobs delivered to your inbox every week.

Select a location
By signing up, you agree to our Terms of Service & Privacy Policy.

What we need to see:

  • MS or PhD in relevant discipline (CS, EE, Math) or equivalent experience
  • Strong background in computer architecture
  • Strong C/C++ programming and software design skill
  • GPU programming experience (CUDA) desired
  • LLVM/MLIR development experience a plus
  • Good communication and organizational skills

#deeplearning

Client-provided location(s): Beijing, China; Shanghai, China
Job ID: NVIDIA-JR1974459
Employment Type: Full Time