Senior Developer Technology Engineer, Public Sector

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA +2 · Remote

Senior Developer Technology Engineer for the Public Sector team at NVIDIA. This role involves researching and developing techniques to GPU-accelerate leading applications in the federal ecosystem, including AI in HPC. The engineer will perform in-depth analysis and optimization for current and next-generation GPU architectures, collaborating with application developers and NVIDIA teams.

What you'd actually do

  1. Working directly with key application developers to understand the current and future problems they are solving, crafting and optimizing core parallel algorithms and data structures to provide the best solutions using GPUs, through reference code development, direct contribution to the full software stack including libraries, applications.
  2. Collaborating closely with diverse groups at NVIDIA such as the architecture, research, libraries, tools, and system software teams to influence the design of architectures, software, and programming models, by investigating the impact on application performance and developer productivity.
  3. Occasional travel from time to time for conferences and on-site visits with developers.

Skills

Required

  • MS or PhD degree or equivalent experience in Computer Science, Engineering or STEM field
  • Programming fluency in C/C++
  • Deep understanding of software design, programming techniques, and algorithms
  • 5+ years of relevant work experience with parallel programming, ideally CUDA C/C++, OpenMP or MPI, or SHMEM (OpenSHMEM or NVSHMEM)
  • Strong computer science fundamentals, ideally including parallel data structures and algorithms, combinatorics, and sparse representations.
  • Passion for optimizing codes to run exceptionally fast through parallel programming.

Nice to have

  • Experience optimizing complex codes, especially for GPUs, including kernel optimization and strong understanding of how software runs on hardware.
  • Domain expertise in any of the following: electronic design automation, high-performance computing, computational fluid dynamics, data and graph analytics, data-science, network analysis, machine learning, or deep learning.
  • Experience in profiling and optimizing applications and frameworks including experience with Nsight Systems and Nsight Compute.
  • Experience in developing or optimizing workflows involving HPC and AI models.
  • algorithm and architecture codesign

What the JD emphasized

  • GPU-accelerate leading applications
  • AI in HPC
  • optimize applications and frameworks
  • Experience optimizing complex codes, especially for GPUs, including kernel optimization and strong understanding of how software runs on hardware.
  • Experience in profiling and optimizing applications and frameworks including experience with Nsight Systems and Nsight Compute.
  • Experience in developing or optimizing workflows involving HPC and AI models.

Other signals

  • GPU-accelerate leading applications
  • AI in HPC
  • optimize applications and frameworks