Senior Systems Software Engineer - Rust, Go, C++

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA +1 · Remote

Senior Systems Software Engineer role at NVIDIA, focusing on designing, developing, and maintaining high-performance, rack-scale management solutions for datacenter environments. The role involves systems software development, hardware/firmware integration, distributed systems, and telemetry/health monitoring, primarily using Rust, Go, and C++.

What you'd actually do

  1. Architect, implement, and maintain core components of an internally developed IaaS (Infrastructure-as-a-Service) product and related microservices primarily in Rust, C++, or Go.
  2. Develop and automate workflows for device discovery, firmware updates, and health monitoring using protocols such as Redfish and other BMC interfaces.
  3. Build and extend distributed microservices and gRPC APIs for rack management, supporting multi-rack, multi-tenant, and multi-site deployments.
  4. Implement telemetry collection, aggregation, and analysis pipelines using Prometheus, OpenTelemetry, and Grafana; contribute to Health-as-a-Service initiatives.

Skills

Required

  • Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or a related field (or equivalent experience)
  • 5+ years of experience in systems software engineering with a focus on distributed systems, software/firmware development, or infrastructure automation
  • Strong hands-on experience with Rust, Go, and C++ for systems-level development
  • Datacenter or computer architecture experience
  • Experience with hardware management protocols (Redfish, IPMI, BMC) and firmware update automation

Nice to have

  • Experience with rack-scale or data center management platforms
  • Familiarity with test automation, simulation/mocking frameworks, and CI/CD pipelines
  • Knowledge of hardware validation, health monitoring, and diagnostics (DCGM, nvbandwidth, Field Diag)
  • Contributions to open-source infrastructure or systems software projects

What the JD emphasized

  • Datacenter or computer architecture experience is required
  • you should understand server, rack, and network topologies, as well as hardware/firmware/software interactions
  • Experience with hardware management protocols (Redfish, IPMI, BMC) and firmware update automation