What you'd actually do

Lead the design and development of large-scale RL training frameworks to accelerate the development of multi-modal AV foundation models.

Design, build, and optimize simulation and data processing pipelines to enable scalable training of driving policies.

Focus on measuring and enhancing simulation quality and refining the reward function for RL training.

Ensure the reliability and performance of training workflows on large GPU clusters through the development of robust monitoring and debugging tools.

Partner with researchers to integrate state-of-the-art model architectures into efficient and scalable training pipelines.

Skills

Required

C++
Python
Reinforcement Learning (RL) algorithms
hyperparameter tuning
reward function design
large-scale GPU clusters
High-Performance Computing (HPC)
job scheduling/orchestration tools (e.g., Kubernetes, SLURM)

Nice to have

RL infrastructure
general LLM training/fine-tuning infrastructure
simulation & closed-loop evaluation of autonomous driving end-to-end models
large-scale data pipeline development
algorithm optimization

What the JD emphasized

Deep proficiency in RL algorithms, such as PPO and GRPO, including practical experience with hyperparameter tuning and reward function design.

Extensive experience with large-scale GPU clusters, High-Performance Computing (HPC) environments, and job scheduling/orchestration tools (e.g., Kubernetes, SLURM).

We are seeking exceptional Senior Machine Learning and Simulation Engineers to join NVIDIA's Autonomous Vehicles (AV) Simulation team! This role requires strong technical leadership and outstanding software engineering skills, coupled with deep expertise in both simulation and artificial intelligence, including deep learning, reinforcement learning, end-to-end driving and Physics AI models. The successful candidate will have a solid track record of productizing ML solutions for autonomous driving and simulation at scale.

This position centers on developing a Closed-Loop Simulation-based Reinforcement Learning (RL) framework in order to train advanced end-to-end AV models, such asAlpamayo R1. This position will design and improve the accuracy and performance of the RL framework and simulation, leveraging SOTA techs includingNuRec,Traffic Models, and Cosmos World Model. Success in this role requires close collaboration with the AV Platform, AV Product, and Research teams.

What you will be doing:

Lead the design and development of large-scale RL training frameworks to accelerate the development of multi-modal AV foundation models.
Design, build, and optimize simulation and data processing pipelines to enable scalable training of driving policies.
Focus on measuring and enhancing simulation quality and refining the reward function for RL training.
Ensure the reliability and performance of training workflows on large GPU clusters through the development of robust monitoring and debugging tools.
Partner with researchers to integrate state-of-the-art model architectures into efficient and scalable training pipelines.

What we need to see:

Bachelor's degree in Computer Science, Robotics, Engineering, or a related field (or equivalent experience).
12+ years of relevant professional experience encompassing large-scale ML training, AV systems, simulation, and AI infrastructure development.
Deep proficiency in RL algorithms, such as PPO and GRPO, including practical experience with hyperparameter tuning and reward function design.
Exceptional programming skills in C++ and Python, vital for developing efficient systems and data pipelines.
Extensive experience with large-scale GPU clusters, High-Performance Computing (HPC) environments, and job scheduling/orchestration tools (e.g., Kubernetes, SLURM).

Ways to stand out from the crowd:

Experience in RL infrastructure or general LLM training/fine-tuning infrastructure in industry.
Experience in simulation & closed-loop evaluation of autonomous driving end-to-end models.
Proven record on large-scale data pipeline development and algorithm optimization.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 224,000 USD - 356,500 USD for Level 5, and 272,000 USD - 431,250 USD for Level 6.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until April 19, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.