Senior Dl Compiler Engineer -cuda Tile

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA +5 · Remote

NVIDIA is hiring a Senior DL Compiler Engineer for the CUDA Tile team. This role involves designing and implementing compiler transformations, developing MLIR-based dialects and lowering passes, and optimizing performance for tile-based kernels on NVIDIA GPUs. The CUDA Tile programming model is a new addition to CUDA, shipped with CUDA 13.1.

What you'd actually do

  1. design and implement compiler transformations
  2. develop MLIR-based dialects and lowering passes
  3. optimize the performance of tile-based kernels to ensure they execute efficiently across multiple generations of NVIDIA GPU architectures
  4. defining public APIs
  5. crafting and implementing compiler and optimization techniques, performance optimization, and other general software engineering work

Skills

Required

  • compiler optimization
  • performance analysis
  • IR design
  • C/C++ programming
  • software design
  • debugging
  • performance analysis
  • test design

Nice to have

  • CPU and/or GPU architecture knowledge
  • CUDA programming
  • OpenCL programming
  • MLIR
  • LLVM
  • XLA
  • TVM
  • deep learning models
  • deep learning algorithms

What the JD emphasized

  • compiler optimization
  • performance analysis
  • IR design
  • C/C++ programming
  • software design skills
  • debugging
  • performance analysis
  • test design

Other signals

  • CUDA Tile shipped with CUDA 13.1
  • new tile-based programming model for our GPUs
  • design and implement compiler transformations
  • develop MLIR-based dialects and lowering passes
  • optimize the performance of tile-based kernels