Software Engineer, Cudnn - Deep Learning

NVIDIA · Semiconductors · Shanghai, China

Software Engineer role focused on developing and optimizing cuDNN, a GPU-accelerated library for deep neural networks, including LLM support. The role involves performance analysis, tuning, and collaboration with cross-functional teams to innovate across various AI applications.

What you'd actually do

  1. Develop production-quality software that ships as part of NVIDIA's AI software stack, including optimized large language model (LLM) support.
  2. Analyze the performance of important workloads, tuning our current software, and proposing improvements for future software.
  3. Work with cross-collaborative teams of deep learning software engineers and GPU architects to innovate across applications like generative AI, autonomous driving, computer vision, and recommender systems.
  4. Adapt to the constantly evolving AI industry by being agile and excited to contribute across the codebase, including API design, software architecture, performance modeling, testing, and GPU kernel development.

Skills

Required

  • C/C++ development
  • CUDA development
  • Python
  • linear algebra
  • machine learning trends
  • software architecture design
  • algorithms and data structures
  • performance analysis
  • profiling
  • code optimization

Nice to have

  • GPU programming and optimization expertise (e.g. CUDA or OpenCL)
  • practical experience with machine learning, especially deep learning
  • computer architecture and building performance models for CPUs, GPUs, or other accelerators
  • MLIR development and compiler optimization

What the JD emphasized

  • production-quality software
  • performance analysis
  • code optimization
  • GPU programming and optimization expertise

Other signals

  • shipping production software
  • performance tuning
  • GPU optimization