Senior Engineer - Deep Learning Compiler Verification and Infrastructure

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA +4 · Remote

Senior Engineer focused on Deep Learning Compiler Verification and Infrastructure at NVIDIA. The role involves implementing compiler verification software and related infrastructure to accelerate deep learning workloads, working closely with compiler developers on functional and performance testing, and applying deep learning techniques to verification solutions.

What you'd actually do

  1. In this role you will work closely with compiler developers to verify new and state of the art deep learning related features and components including implementing and executing functional and performance testing and benchmarking software solutions.
  2. This would include authoring and reviewing verification plans, and implementing verification programs, scripts, and libraries.
  3. You will apply deep learning and other sophisticated techniques to implement compiler verification solutions.
  4. You will help identify potential or observed weaknesses in the current process, offer ideas for actions that can improve code coverage, and participate in quality initiatives and drive continuous improvement.

Skills

Required

  • BS or MS in Computer Science, Computer/Electrical Engineering, Mathematics or related field (or equivalent experience)
  • 3+ years programming experience in Machine Learning domain, preferably using Python
  • Experience working with Deep Learning frameworks such as Pytorch, Scikit Learn, JAX/XLA or TensorRT
  • Focused, learn quickly, and have strong analytical skills with attention to detail. Strong troubleshooting and debugging skills.
  • Proven uses of creative thinking for solutions to exciting problems that matter.

Nice to have

  • Experience with Large Language Models and application of deep learning to solve software engineering problems
  • Hands-on compiler development or verification experience
  • Knowledge of related programming languages and domains such as CUDA, Docker and GPU-Accelerated Cloud

What the JD emphasized

  • 3+ years programming experience in Machine Learning domain
  • Deep Learning frameworks
  • compiler development or verification experience

Other signals

  • compiler verification
  • deep learning workloads
  • infrastructure