Senior Deep Learning Engineer

NVIDIA NVIDIA · Semiconductors · United Kingdom +4 · Remote

Senior Deep Learning Engineer at NVIDIA to optimize and deploy foundation models for physical AI applications (AVs, robots, video analytics) on GPU platforms, focusing on high-performance inference.

What you'd actually do

  1. Improve inference speed for Cosmos WFMs on GPU platforms.
  2. Effectively carry out the production deployment of Cosmos WFMs.
  3. Profile and analyze deep learning workloads to identify and remove bottlenecks.

Skills

Required

  • Python
  • PyTorch
  • inference optimization techniques
  • quantization
  • TensorRT
  • TensorRT-LLM
  • vLLM
  • SGLang

Nice to have

  • Docker
  • Triton Inference Server
  • CUDA programming
  • diffusion models
  • GPU workloads performance analysis
  • training performance tuning

What the JD emphasized

  • production-grade systems
  • inference optimization techniques
  • production settings

Other signals

  • optimize and deploy models for high-performance inference
  • production-grade systems
  • physical AI
  • autonomous vehicles
  • robots
  • video analytics AI agents