Senior Life Cycle Engineer, Pytorch - Deep Learning

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA +1 · Remote

NVIDIA is seeking Senior Software Engineers to build and maintain systems for continuous integration, testing, and delivery of advanced software stacks, focusing on improving developer productivity for NVIDIA's AI software ecosystem and ensuring deep learning frameworks run optimally on NVIDIA hardware. This is a software engineering role focused on tools and infrastructure.

What you'd actually do

  1. Develop and maintain continuous development systems through various software verticals spanning from end-users, through layers of open-source software, down to NVIDIA's math libraries, compilers, and drivers.
  2. The team creates tools and infrastructure, debugs issues as they arise, and fixes problems across the entire stack.
  3. Together, we stabilize and deliver the entire NVIDIA software stack to the world at global scale.
  4. This is a development-focused role that drives software quality and reliability without on-call duties.

Skills

Required

  • C++
  • Python
  • scripting languages
  • continuous build and release management
  • Linux systems
  • cloud-hosted developer environments
  • CI/CD systems

Nice to have

  • software development experience with many contributing teams
  • shipping multi-cloud large-scale systems
  • supporting those systems in production
  • C++ build systems
  • dependency issues
  • symbol resolution
  • linking problems
  • communication
  • empathy
  • dependability
  • patience
  • persistence

What the JD emphasized

  • MS or PhD in Computer Science or Engineering (or equivalent experience)
  • 5+ years of experience building or supporting large software projects or equivalent
  • Proficiency in C++ and/or Python
  • Proven expertise with scripting languages
  • Experience in continuous build and release management for Linux systems
  • Development of cloud-hosted developer environments and CI/CD systems