Senior Software Engineer - Autonomous Driving

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA

Senior Software Engineer role focused on optimizing and deploying deep neural networks for autonomous driving on NVIDIA automotive compute platforms. Responsibilities include leading architecture and technical strategy for inference workloads, driving performance analysis, and developing model optimization techniques. Requires extensive experience in systems software, AI/ML infrastructure, deep learning inference, and compiler/runtime technology.

What you'd actually do

  1. Lead architecture and technical strategy for optimizing inference workloads in autonomous driving applications.
  2. Drive end-to-end performance analysis across DNN models, TensorRT/compiler flows, CUDA kernels, memory behavior, scheduling, runtime services, and automotive platform constraints.
  3. Develop and guide model optimization techniques such as quantization, pruning, distillation, graph optimization, operator fusion, kernel selection, and layout/memory optimization.
  4. Collaborate with TensorRT, CUDA, compiler, silicon architecture, perception, planning, DriveOS and safety platform teams.
  5. Build tools, methodologies, and metrics for profiling, benchmarking, debugging, and validating model and platform performance.

Skills

Required

  • BS, MS, or PhD in Computer Science, Computer Engineering, Electrical Engineering, or related field (or equivalent experience).
  • 12+ years of software engineering experience in systems software, AI/ML infrastructure, deep learning inference, compiler/runtime technology, or platform performance.
  • Strong C/C++ and practical Python experience.
  • Deep familiarity with TensorRT, TensorRT-LLM, ONNX, PyTorch, CUDA, Triton, or related frameworks.
  • Experience optimizing DNN models for latency, throughput, memory footprint, and power.

Nice to have

  • Hands-on experience with TensorRT internals, CUDA kernels, Triton kernels, or other compiler/runtime technologies.
  • Experience deploying optimized DNNs, LLMs, VLMs, or perception models on embedded, edge, robotics, or automotive platforms.
  • Background in autonomous driving, ADAS, robotics, real-time systems, safety-aware software, or deterministic low-latency systems.
  • Experience with ISO 26262, QNX, Safe RTOS, DriveOS, Linux, hypervisors, or virtualization.

What the JD emphasized

  • optimizing and deployment of deep neural networks
  • optimizing inference workloads
  • end-to-end performance analysis
  • model optimization techniques
  • deploying optimized DNNs

Other signals

  • optimizing inference workloads
  • end-to-end performance analysis
  • model optimization techniques
  • deploying optimized DNNs