Senior/staff Software Engineer, ML Infrastructure, Optimization

Nuro Nuro · Robotics · CA · Software

Nuro is seeking a Senior/Staff Software Engineer to build and improve core ML infrastructure for autonomy teams. The role focuses on optimizing ML models using techniques like quantization and distillation, developing a model compiler framework (FTL), and deploying optimized models on self-driving robots. This involves working across the full stack of ML solutions, from pipelines to optimized inference.

What you'd actually do

  1. Optimize Nuro’s autonomy stack with cutting-edge optimization techniques like quantization, distillation, and model compression.
  2. Work with autonomy engineers to optimize, validate, and deploy large language models.
  3. Develop and maintain a world-class model compiler framework, [FTL](https://medium.com/nuro/ftl-model-compiler-framework-d6b85c670f67).
  4. Write robust, high quality software to increase our confidence in our vehicle’s ability to navigate safely on-road.
  5. Collaborate closely with machine learning domain experts and engineers across behavior, perception and mapping to design and implement end-to-end learned ML solutions.

Skills

Required

  • 3+ years of relevant experience in ML optimization infrastructure
  • Experience with ML optimization techniques such as quantization and pruning, and ML compilers
  • Experience maintaining, profiling, and optimizing GPU ML compilers & runtimes
  • Proficient in Python
  • Working experience with C++ and CUDA
  • Working experience deep learning frameworks (like PyTorch, Jax, Tensorflow, Keras)
  • Proficient in Python and working experience with C++

What the JD emphasized

  • ML optimization infrastructure
  • ML optimization techniques such as quantization and pruning
  • ML compilers
  • GPU ML compilers & runtimes

Other signals

  • ML infrastructure
  • model optimization
  • ML compilers
  • deploying optimized models