Senior Deep Learning Algorithm Engineer

NVIDIA · Semiconductors · Ho Chi Minh City, Vietnam +1 · Remote

Senior Deep Learning Algorithm Engineer at NVIDIA focused on optimizing deep learning training and inference workloads on state-of-the-art hardware and software platforms. The role involves performance analysis, profiling, and implementation of production-quality software, with a focus on squeezing performance from hardware and software stacks.

What you'd actually do

  1. Understand, analyze, profile, and optimize deep learning training and inference workloads on state-of-the-art hardware and software platforms.
  2. Collaborate with researchers and engineers across NVIDIA, providing guidance on improving the performance of workloads.
  3. Implement production-quality software across NVIDIA's deep learning platform stack.
  4. Build tools to automate workload analysis, workload optimization, and other critical workflows.

Skills

Required

  • 5+ years of experience
  • MSc or PhD in CS, EE or CSEE or equivalent experience
  • Strong background in deep learning and neural networks, both training & inference
  • Deep understanding of computer architecture
  • fundamentals of GPU architecture
  • Proven experience analyzing, modeling and tuning application performance
  • Programming skills in C++ and Python

Nice to have

  • Experience with modern LLM inference frameworks (TRT-LLM, vLLM, Ollama, etc.)
  • Strong fundamentals in algorithms
  • Experience with production deployment of Deep Learning models
  • Proven experience with processor and system-level performance modelling
  • GPU programming experience (CUDA or OpenCL)

What the JD emphasized

  • performance analysis and optimization
  • Deep Learning training
  • inference workloads
  • hardware/software stack
  • GPU architecture
  • peak performance
  • training speeds

Other signals

  • performance optimization
  • deep learning training
  • inference workloads
  • hardware/software stack
  • GPU architecture