Senior Manager, Datacenter Software - Firmware Release

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA +2 · Remote

NVIDIA is seeking a Senior Manager for Datacenter Software Firmware Release to lead the release of software and firmware for GPU-based datacenter servers. This role involves defining release scope, influencing architecture, partnering with cross-functional teams, owning software/firmware ingestion and packaging, and driving process improvements. The ideal candidate will have extensive experience in system software/firmware development, release management, CI/CD, and Python/Linux, with a proven track record of technical leadership and delivering high-quality releases.

What you'd actually do

  1. In this technical role, you will be bringing in leadership on how releases should be delivered to end customers of rack-scale computing based on tightly coupled compute and switch trays and build end to end infra and workflows to ensure the highest quality releases for data center firmware and software.
  2. Define release scope for rack scale products working cross functionally with product management, technical architects and program management. Deliver these releases that flow through the validation matrix for customer end use cases, ensuring delivered firmware and software is of the highest quality. Solutions must scale, be resilient, and support secure upgrades or rollbacks across diverse customer scenarios.
  3. Influence architecture, design and implementation decisions for compute and switch trays software and firmware - ensuring quality across nightly, dev and production drops for all customer use cases, with the right release-validation strategy at each phase of development life cycle.
  4. Partner with all matrixed organizations: Developers, SWQA, Product engineering to left-shift release quality from dev to QA in a very fast-moving environment with end-to-end CI/CD to ensure no bug is found at customer site. Enforce it with well-placed quality metrics for any product milestone and track KPIs published at regular cadence that are enforced. Monitor and report progress of releases to all stakeholders.
  5. Own ingestion and packaging of software and firmware binaries, readying them for deployment across multiple platforms at scale across different CSP environments.

Skills

Required

  • 12+ overall years in the software industry with specialization in system software and/or firmware development.
  • 5+ years of proven technical hands-on leadership for multi-team organizations across data center firmware like BMC, FPGA, CPLDs, network switches, building Infrastructure for continue improvement for quality of releases.
  • BS/MS/PhD in CS, CE, EE, or a related technical field — or equivalent experience
  • Prior experience in systems software or firmware development with a proven history of guiding complex software features or products throughout the entire product life cycle. Ideally, on rack-scale datacenter products.
  • Strong understanding of computer system architecture, operating systems principles, HW-SW interactions, and performance analysis/optimizations.
  • Working fluency in Python and Linux sufficient to review designs, prototype tooling, and debug production issues alongside the team.
  • Hands-on experience with web application frameworks and CI/CD platforms (Jenkins, GitLab, Artifactory).
  • Track record of balancing multiple projects with competing priorities and delivering against measurable benchmarks (MTTR, specification compliance, release cadence, automation coverage).
  • Excellent communication and collaboration skills across teams and time zones.

Nice to have

  • Familiarity with the architecture of datacenter server software and experience with the in-band and out-of-band management of firmware and hardware components.
  • Understanding REST architecture style especially JSON over HTTPs with OAuth and DMTF / PLDM / SPDM firmware management protocols.
  • Proven experience in developing a self-service release infrastructure, resulting in clear reductions in onboarding SLA times.
  • Experience integrating AI/LLM tooling into engineering workflows – for triage, test generation, code review, or release validation.

What the JD emphasized

  • hard-working and experienced senior manager having experience with Datacenter Software and Firmware release management and infrastructure
  • proven technical hands-on leadership for multi-team organizations across data center firmware like BMC, FPGA, CPLDs, network switches, building Infrastructure for continue improvement for quality of releases
  • proven history of guiding complex software features or products throughout the entire product life cycle
  • Track record of balancing multiple projects with competing priorities and delivering against measurable benchmarks (MTTR, specification compliance, release cadence, automation coverage)