Senior Applied Scientist - Sovereign AI

NVIDIA NVIDIA · Semiconductors · Bangalore, India +2

Senior Applied Scientist/AI Engineer at NVIDIA focusing on Sovereign AI efforts. The role involves end-to-end model training (pre-training, CPT, SFT, alignment), rigorous evaluation and benchmarking, and inference optimization using tools like TensorRT-LLM and NIM. Requires strong Python, PyTorch, and experience with large-scale ML frameworks.

What you'd actually do

  1. Lead complex training experiments (Pre-training, CPT, SFT, and Alignment). You will design architecture ablations and upstream your optimized recipes into the Sovereign AI Playbook and NeMo core libraries.
  2. Deep dive into model evaluation strategies. You will design custom benchmarks, conduct eval-driven ablation experiments, and ensure models meet strict Sovereign AI quality and safety bars.
  3. Drive end-to-end modeling efficiency. You will operationalize advanced compression (Quantization, Distillation) and leverage inference engines (TensorRT-LLM, NIM) to ensure models are fast, cheap, and deployable.
  4. Act as a highly autonomous technical leader with a bias for action. You will take ambiguous problems across the entire LLM lifecycle and methodically drive them to shipped, reproducible solutions at the speed of light.

Skills

Required

  • Python
  • PyTorch
  • Transformers
  • Scaling laws
  • Training dynamics
  • Megatron-LM
  • NeMo
  • TensorRT-LLM
  • vLLM
  • Triton

Nice to have

  • Masters or PhD in Computer Science, Machine Learning, or related field
  • Full-Stack ML Impact
  • NVIDIA Stack Power User
  • Open Source Contributions

What the JD emphasized

  • End-to-End Model Training
  • Rigorous Evaluation & Benchmarking
  • Efficiency & Inference Optimization
  • Full-Stack ML Impact
  • Masters or PhD in Computer Science, Machine Learning, or related field (or equivalent experience)
  • 8+ years of technical experience, with 4+ years deeply focused on the LLM/Deep Learning lifecycle
  • Expert-level Python and PyTorch
  • Hands-on experience modifying large-scale frameworks like Megatron-LM, NeMo, TensorRT-LLM, vLLM, or Triton

Other signals

  • End-to-End Model Training
  • Rigorous Evaluation & Benchmarking
  • Efficiency & Inference Optimization
  • Full-Stack ML Impact