Cloud Platform Software Engineer – Platform Apis

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA +2 · Remote

Software Engineer role focused on building and extending Kubernetes-like APIs and Go-based controllers for NVIDIA's DGX Cloud, powering GPU-accelerated infrastructure for AI teams. The role involves end-to-end ownership of multi-cloud GPU clusters, collaborating with various engineering teams, and automating software validation.

What you'd actually do

  1. Develop software systems to support large scale deployments of cloud infrastructure
  2. Design and develop APIs to support Infrastructure as Code (IaC) automation and deployment workflows.
  3. Responsible for contributing to multiple source code projects to fulfill NVIDIA requirements with software services
  4. Work and collaborate with engineering managers, architects, designers, and frontend engineers to deliver high quality software
  5. Automate the validation of software solutions with unit and integration tests

Skills

Required

  • BS in Computer Science, Information Systems, Computer Engineering or equivalent experience
  • 8+ years of proven experience in large scale software development
  • Experience building and shipping services on Kubernetes
  • Background with using and chipping in to open-source projects
  • Collaborated with teams to write software to support cloud services at scale
  • Programming experience in a relevant language, e.g. Golang, Python
  • Communicate design and quality strategy in written, visual, and oral formats
  • Experience with a wide range of modern infrastructure tools and technologies

Nice to have

  • Experience with Kubernetes Cluster API, Terraform, Tinkerbell, and other infrastructure tooling
  • Practical experience with Azure, GCP, or AWS
  • Capable of refactoring software to run in systems such as Kubernetes
  • Ability to discuss and work with CSI, CNI, and CRI and/or familiarity with the CNCF and the tooling across the ecosystem
  • Upstream contribution in open source projects

What the JD emphasized

  • 8+ years of proven experience in large scale software development
  • Experience building and shipping services on Kubernetes