AI Developer Technology Engineer

NVIDIA NVIDIA · Semiconductors · France · Remote

NVIDIA is seeking a Senior Developer Technology Engineer, Artificial Intelligence to research and develop techniques for GPU acceleration of AI workloads, optimize complex AI and HPC algorithms on modern CPU and GPU architectures, and engage with the developer community. The role involves influencing next-generation hardware and software design.

What you'd actually do

  1. In this position, you will research and develop techniques to GPU accelerate workloads in deep learning, machine learning or other AI domains.
  2. Work directly with other technical experts in their fields (industry and academia) to perform in-depth analysis and optimization of complex AI and HPC algorithms to ensure optimal AI solutions on modern CPU and GPU architectures.
  3. Publish and/or present discovered optimization techniques in developer blogs or relevant conferences to engage and educate the developer community.
  4. Influence the design of next-generation hardware architectures, software, and programming models in collaboration with research, hardware, system software, libraries, and tools teams at NVIDIA.

Skills

Required

  • C/C++
  • algorithms
  • software development
  • parallel programming
  • CUDA
  • OpenACC
  • OpenMP
  • MPI
  • pthreads
  • low-level performance optimizations
  • CPU and GPU architecture fundamentals
  • communication skills
  • organization skills
  • problem solving
  • time management
  • prioritization skills

Nice to have

  • parallelization
  • performance optimization of Deep Learning models
  • Natural Language Processing
  • Computer Vision
  • Recommender Systems
  • linear algebra

What the JD emphasized

  • 5+ years of relevant experience in software development or research work
  • Hands on experience doing low-level performance optimizations
  • Expertise in parallelization and performance optimization of Deep Learning models arising from Natural Language Processing, Computer Vision, Recommender Systems, etc.

Other signals

  • GPU acceleration
  • performance optimization
  • parallel algorithms
  • deep learning
  • HPC