Skip to main contentA logo with &quat;the muse&quat; in dark blue text.
NVIDIA

Deep Learning Performance Architect Intern

Shanghai, China

NVIDIA is developing processor and system architectures that accelerate various deep learning applications. We are looking for an expert deep learning system performance architect to join our AI performance projection and analysis efforts. In this position, you will have a chance to work on performance projection, analysis, and optimization on state-of-the-art hardware architectures for various AI workloads. You will make your contributions to our dynamic technology focused company.

What you'll be doing:

  • Analyze state-of-the-art AI models on various GPU hardware platforms (e.g., Client (Desktop/Laptop) platforms and SoCs)
  • Identify performance bottlenecks and propose optimizations
  • Performance analysis of DL workloads (e.g., LLM)

Want more jobs like this?

Get jobs in Shanghai, China delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.

What we need to see:

  • BS, MS or PhD students in relevant discipline (CS/EE/Math etc.,)
  • Experience with popular AI models (e.g., LLM and AIGC models)
  • Be familiar with typical deep learning SW framework (e.g., Torch/JAX/TensorFlow)
  • Knowledge and experience on hardware architectures for deep learning applications

#deeplearning

Client-provided location(s): Shanghai, China
Job ID: NVIDIA-JR1982254
Employment Type: Intern