LLM Reinforcement Learning Framework Engineer

NVIDIA NVIDIA · Semiconductors · Shanghai, China

NVIDIA is seeking an LLM Reinforcement Learning Framework Engineer to develop and deploy RL algorithms for LLM post-training, focusing on improving reasoning and alignment. The role involves integrating RL components into NVIDIA's LLM stack, crafting experiments, and ensuring production readiness. Requires strong Python, PyTorch, and practical RL experience with LLMs, along with familiarity in async/distributed orchestration.

What you'd actually do

  1. Developing and deploying reinforcement learning algorithms for LLM post‑training to improve reasoning and alignment.
  2. Integrating RL components into NVIDIA’s LLM training and serving stack with a cross‑functional team of engineers and researchers.
  3. Crafting and running experiments, evaluations, and debugging workflows to ensure robustness, scalability, and reliability in production.

Skills

Required

  • Python
  • PyTorch
  • Reinforcement Learning
  • LLMs
  • distributed training
  • asyncio
  • torch.distributed
  • Ray

Nice to have

  • NeMo RL
  • Megatron-LM
  • DeepSpeed
  • vLLM
  • TensorRT-LLM
  • GPU architecture
  • performance optimization

What the JD emphasized

  • production-quality PyTorch experience
  • modern LLM frameworks
  • reinforcement learning applied to LLMs
  • async and distributed orchestration

Other signals

  • reinforcement learning
  • LLM post-training
  • reasoning abilities
  • agentic AI