Principal Engineer, Data Center Power S… at NVIDIA

What you'd actually do

Gathering use cases + requirements, translating those into software roadmaps, and executing those roadmaps across internal NVIDIA teams and external partners.

Reporting project status, risks, help needed, and roadmap pivots to internal and external executives via status reports and in-person meetings.

Brokering technical discussions between highly technical subject matter experts

Leveraging AI tools and workflows to quickly iterate on designs, prototypes, documentation, tests, and code.

Architecting distributed, robust, and scalable GoLang and Rust system software, deployed to monitor and manage large datacenters

Skills

Required

BS or higher in Computer Science or equivalent experience
15+ years of meaningful industry experience with a strong scalable system software development background
Experience with APIs and interface design
Experience with AI tools and development workflows
Outstanding written and verbal interpersonal skills
Business level English
Strong motivation and commitment to learn new skills
Ability to manage time in a fast, heavily multitasked environment
Development experience with Rust, Python, and/or GoLang
Development experience with distributed systems and concurrent applications, especially in a Kubernetes environment
Ability to quickly understand unfamiliar technical domains, identify core problems, and translate ambiguous requirements into actional engineering plans.
Skilled at producing clear technical documentation, design docs, and status updates that keep cross-functional partners aligned.

Nice to have

Development experience in relevant coding languages like GoLang and Rust
Experience with SCADA or Data Center power related software
Background with containers (e.g. Docker, OCI), orchestration frameworks, and logging/telemetry backends with Kubernetes monitoring stacks with tools such as Prometheus, Loki and Grafana
Experience with modern UI development in React and Node.js or similar frameworks
Experience developing Kubernetes operators or Helm charts
Experience with HPC job schedulers like Slurm or Run.AI
Familiarity with Kubernetes internals
Exposure to GPU programming with CUDA

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for phenomenal people like you to help us accelerate the next wave of artificial intelligence.

What you'll be doing:

Gathering use cases + requirements, translating those into software roadmaps, and executing those roadmaps across internal NVIDIA teams and external partners.
Reporting project status, risks, help needed, and roadmap pivots to internal and external executives via status reports and in-person meetings.
Brokering technical discussions between highly technical subject matter experts
Leveraging AI tools and workflows to quickly iterate on designs, prototypes, documentation, tests, and code.
Architecting distributed, robust, and scalable GoLang and Rust system software, deployed to monitor and manage large datacenters

What we need to see:

BS or higher in Computer Science or equivalent experience. 15+ years of meaningful industry experience with a strong scalable system software development background
Experience with APIs and interface design. Experience with AI tools and development workflows
Outstanding written and verbal interpersonal skills. Business level English. Strong motivation and commitment to learn new skills
Ability to manage time in a fast, heavily multitasked environment. Development experience with Rust, Python, and/or GoLang.
Development experience with distributed systems and concurrent applications, especially in a Kubernetes environment
Ability to quickly understand unfamiliar technical domains, identify core problems, and translate ambiguous requirements into actional engineering plans.
Skilled at producing clear technical documentation, design docs, and status updates that keep cross-functional partners aligned.
Track record of identifying process inefficiencies and introducing automation, tooling, or AI-power workflows that measurably improve team out.

Ways to stand out from the crowd:

Development experience in relevant coding languages like GoLang and Rust. Experience with SCADA or Data Center power related software
Background with containers (e.g. Docker, OCI), orchestration frameworks, and logging/telemetry backends with Kubernetes monitoring stacks with tools such as Prometheus, Loki and Grafana
Experience with modern UI development in React and Node.js or similar frameworks. Experience developing Kubernetes operators or Helm charts. Experience with HPC job schedulers like Slurm or Run.AI Familiarity with Kubernetes internals. Exposure to GPU programming with CUDA.

NVIDIA is often recognized as one of the technology industry's most esteemed employers. We have some of the brightest and most driven individuals in the world working with us. If you are a self-motivated and imaginative individual, we encourage you to apply!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 272,000 USD - 431,250 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until May 8, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

What we need to see:

BS or higher in Computer Science or equivalent experience. 15+ years of meaningful industry experience with a strong scalable system software development background

Experience with APIs and interface design. Experience with AI tools and development workflows

Outstanding written and verbal interpersonal skills. Business level English. Strong motivation and commitment to learn new skills

Ability to manage time in a fast, heavily multitasked environment. Development experience with Rust, Python, and/or GoLang.

Development experience with distributed systems and concurrent applications, especially in a Kubernetes environment

Ability to quickly understand unfamiliar technical domains, identify core problems, and translate ambiguous requirements into actional engineering plans.

Skilled at producing clear technical documentation, design docs, and status updates that keep cross-functional partners aligned.

Track record of identifying process inefficiencies and introducing automation, tooling, or AI-power workflows that measurably improve team out.

Ways to stand out from the crowd:

Development experience in relevant coding languages like GoLang and Rust. Experience with SCADA or Data Center power related software

Background with containers (e.g. Docker, OCI), orchestration frameworks, and logging/telemetry backends with Kubernetes monitoring stacks with tools such as Prometheus, Loki and Grafana

Experience with modern UI development in React and Node.js or similar frameworks. Experience developing Kubernetes operators or Helm charts. Experience with HPC job schedulers like Slurm or Run.AI Familiarity with Kubernetes internals. Exposure to GPU programming with CUDA.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 272,000 USD - 431,250 USD.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until May 8, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

Principal Engineer, Data Center Power Software

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

What we need to see:

Ways to stand out from the crowd:

What we need to see:

Ways to stand out from the crowd: