NVIDIA is seeking an outstanding Solutions Architect to lead development of LLM for Agentic AI to join our fast-growing Generative AI team, who are enabling a global network of Professional Services partners on NVIDIA's full-stack accelerated computing platforms. Our team is dedicated to applying next-generation technologies to solve customer problems. We are looking for an ambitious and forward-thinking engineer to contribute to the develop of AI applications and solving real world problems for enterprise customers using the latest Generative AI models and research, including NLP, RAG, distributed computing and large-scale system design. In this role, you will be a lead AI developer and trusted technical expert on the latest Generative AI frameworks and LLM family of products and work closely with partners and customers to build scalable industry-specific enterprise AI solutions including project scoping to POC to production.
Want more jobs like this?
Get jobs in Santa Clara, CA delivered to your inbox every week.
As part of Generative AI enablement team, you'll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world by applying accelerated computing AI and solve category defining systems and AI/ML solutions.
What you will be doing:
- Building agentic LLM applications and exploring the latest advancements in model training, fine-tuning and customization.
- Enabling NVIDIA strategic service delivery partners to build enterprise AI solutions using accelerated computing stack including NIMs and NeMo microserviecs.
- Collaborate with developers and onboard them to NVIDIA AI platforms and services by providing deep technical guidance.
- Anticipate customer and partners needs and find enablement opportunities to expand adoption and utilization of NVIDIA Generative AI products and platforms.
- Establishing and building repeatable reference architecture, communicate standard processes and understand solution trade-offs. Share findings and feedback to improve products and services.
What we need to see:
- MSc, PhD in Computer Science, Electrical Engineering, Software Engineer, ML Engineer, or related fields (or equivalent experience).
- 5+ years of relevant work experience in developing and deploying AI models at scale as a Software Engineer or deep learning engineer.
- Proven track record of building enterprise-grade RAG based systems using open-source models and orchestration frameworks with strong foundation in deep learning, with a particular emphasis on generative models.
- Proficiency in Python, C++ programming and Deep Learning frameworks,
- Excellent communication and presentation skills to effectively collaborate with both internal and external customers.
Ways to stand out from the crowd:
- Demonstrate expertise and hands-on experience with NVIDIA AI platforms. Some products of interest include natural language processing and Large Language Models (NVIDIA NeMo) and inference at scale (NIMs).
- Excellent practical knowledge of Generative AI and LLM development. Ability to train GPT and Megatron Models.
- Understanding of MLOps life cycle management and experience with LLMOps workflows.
- Experience with CUDA programming and benchmarking and analyzing performance AI Agentic systems.
The base salary range is 148,000 USD - 276,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.
You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.