Principal AI Developer Technology Engineer

NVIDIA NVIDIA · Semiconductors · Munich, Germany +4

Seeking a Principal Developer Technology Engineer to research and develop techniques for GPU acceleration of AI workloads, focusing on performance optimization of deep learning and HPC algorithms on modern CPU and GPU architectures. This role involves collaborating with internal teams and the developer community, influencing hardware/software design, and publishing findings.

What you'd actually do

  1. In this position, you will research and develop techniques to GPU accelerate workloads in deep learning, machine learning or other AI domains.
  2. Work directly with other technical experts in their fields (industry and academia) to perform in-depth analysis and optimization of complex AI and HPC algorithms to ensure optimal AI solutions on modern CPU and GPU architectures.
  3. Publish and/or present discovered optimization techniques in developer blogs or relevant conferences to engage and educate the developer community.
  4. Influence the design of next-generation hardware architectures, software, and programming models in collaboration with research, hardware, system software, libraries, and tools teams at NVIDIA.

Skills

Required

  • C/C++
  • algorithms
  • software development
  • parallel programming (CUDA, OpenACC, OpenMP, MPI, pthreads, etc.)
  • low-level performance optimizations
  • CPU and GPU architecture fundamentals
  • communication skills
  • organization skills
  • logical approach to problem solving
  • time management
  • prioritization skills

Nice to have

  • parallelization and performance optimization of Deep Learning models (NLP, Computer Vision, Recommender Systems)
  • linear algebra

What the JD emphasized

  • 15+ years of relevant experience
  • low-level performance optimizations
  • CPU and GPU architecture fundamentals

Other signals

  • GPU acceleration
  • performance optimization
  • parallel algorithms
  • deep learning
  • HPC