Senior Research Scientist, Nemotron Post-training

NVIDIA · Semiconductors · Santa Clara, CA +1 · Remote

Research Scientist/Engineer at NVIDIA focused on building Nemotron models, specifically working on post-training pipelines, synthetic data, agentic RL, data/training infrastructure, and large-scale model post-training. The role involves advancing open-source foundation models, developing training data, benchmarks, LLMs, and software, and solving end-to-end foundation model post-training challenges. Requires a Master's/PhD and 5+ years of experience in model post-training, RL, and agentic systems, with experience in data curation, model training, and inference/deployment environments.

What you'd actually do

You will be engaged as core contributors to Nemotron models post-training, working at the intersection of the areas: 1) Synthetic data and algorithmic research for agentic RL 2) Data and training Infrastructure implementation 3) Collaborating in vendor data acquisition and experimentation 4) Large-scale research & production model post-training
Advance open-source foundation models by developing training data, benchmarks, LLMs and software (including [NeMo-RL](https://github.com/NVIDIA-NeMo/RL), Nemo-Gym and yet to be announced software)
Solve large-scale, end-to-end foundation model post-training challenges, spanning the full model lifecycle from initial orchestration, data pre-processing, running of model training and tuning, to model deployment.
Publish and present your results at academic and industry conferences

Skills

Required

Master or PhD degrees in computer science, machine learning or other quantitative domains (or equivalent experience)
5+ year working or research experience in model mid-training / post-training, reinforcement learning and agentic systems
Hands-on experience in data curation and model training for Agentic and Reasoning capabilities
In-depth experience in using or developing inference and deployment environments such as vLLM, SGLang or TRT-LLM

Nice to have

Industrial experience in reinforcement learning for leading foundation models
Experience in optimizing model quality from real-world traffic feedbacks

What the JD emphasized

5+ year working or research experience in model mid-training / post-training, reinforcement learning and agentic systems
Hands-on experience in data curation and model training for Agentic and Reasoning capabilities
In-depth experience in using or developing inference and deployment environments such as vLLM, SGLang or TRT-LLM

Other signals

post-training pipelines
foundation models
open-source generative AI
agentic RL
large-scale research & production model post-training

Read full job description

Join NVIDIA and help build the Nemotron models that will define the foundation of open-source generative AI. We are looking for a research scientist / engineer who is passionate about open-source and excited to create our next-generation post-training pipelines. You will work at the intersection of research and engineering to invent, implement, and scale the core post-training technologies behind our Nemotron models.

What you’ll be doing:

You will be engaged as core contributors to Nemotron models post-training, working at the intersection of the areas: 1) Synthetic data and algorithmic research for agentic RL 2) Data and training Infrastructure implementation 3) Collaborating in vendor data acquisition and experimentation 4) Large-scale research & production model post-training
Advance open-source foundation models by developing training data, benchmarks, LLMs and software (including NeMo-RL, Nemo-Gym and yet to be announced software)
Solve large-scale, end-to-end foundation model post-training challenges, spanning the full model lifecycle from initial orchestration, data pre-processing, running of model training and tuning, to model deployment.
Publish and present your results at academic and industry conferences

What we need to see:

Master or PhD degrees in computer science, machine learning or other quantitative domains (or equivalent experience).
5+ year working or research experience in model mid-training / post-training, reinforcement learning and agentic systems.
Hands-on experience in data curation and model training for Agentic and Reasoning capabilities
In-depth experience in using or developing inference and deployment environments such as vLLM, SGLang or TRT-LLM.

Ways to stand out from the crowd:

Industrial experience in reinforcement learning for leading foundation models.
Experience in optimizing model quality from real-world traffic feedbacks

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you’re passionate about leading breakthrough AI research and building exceptional teams that shape the future of computing, we want to hear from you.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 192,000 USD - 304,750 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until June 20, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.