Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Senior Deep Learning Engineer - Performance Benchmarking

AT NVIDIA
NVIDIA

Senior Deep Learning Engineer - Performance Benchmarking

Warsaw, Poland / Remote

We are seeking engineers with a passion for performance analysis and optimization to join our team in advancing novel technologies such as deep learning compilers and low precision training. You will work alongside world-class engineers on optimizing end-to-end performance of NVIDIA's deep learning software ecosystem that is powering the AI revolution. You will have the chance to work on powerful, enterprise-grade GPU clusters delivering hundreds of PetaFLOPs, and gain access to unreleased hardware that will be shaping the future of AI.

What you'll be doing:

  • Profile, analyze, and optimize the performance of deep learning workloads on ground breaking hardware and software platforms.
  • Develop tooling for profiling and microbenchmarking of DL workloads running compiled models uncovering optimization opportunities.
  • Collaborate with teams across NVIDIA to provide performance insights and recommendations that improve the design and efficiency of DL frameworks and workloads.
  • Own the development and implementation of standard methodologies for compiling, testing, and deploying high-performance deep learning models.
  • Conduct performance benchmarking on enterprise-grade GPU clusters and pre-release hardware, driving improvements to NVIDIA's DL software stack and hardware roadmap.

Want more jobs like this?

Get jobs delivered to your inbox every week.

Select a location
By signing up, you agree to our Terms of Service & Privacy Policy.

What we need to see:

  • 5+ years of experience in deep learning model implementation, software development, and performance optimization.
  • BSc, MS, or PhD in Computer Science, Computer Engineering, Electrical Engineering, Mathematics, Physics, or a related technical field, or equivalent practical experience.
  • Proficiency in Python, with extensive hands-on experience using at least one major deep learning framework (e.g., PyTorch, TensorFlow, JAX).
  • Strong problem-solving and analytical skills, with a proven record in debugging, performance tuning, and workload optimization.

Ways to stand out from the crowd:

  • Experience with compilers (e.g., PyTorch's torch.compile, XLA, or other similar technologies).
  • Experience with dashboarding and metric tracking systems.
  • Experience with running large-scale workloads in HPC clusters.
  • Knowledge and passion for DevOps/MLOps practices for Deep Learning-based product's development.

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most hard-working and forward-thinking people in the world working for us. If you're creative and autonomous, we want to hear from you! We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

#deeplearning

Client-provided location(s): Warsaw, Poland
Job ID: NVIDIA-JR1990202
Employment Type: Full Time