Senior Firmware Engineer – Csp Engagements

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA

Senior Firmware Engineer role focused on system software for Datacenter products, involving embedded firmware development, customer engagement, hardware bring-up, protocol stacks, and debugging for NVIDIA's next-generation computing platforms.

What you'd actually do

  1. Design and develop firmware solutions for manageability and observability of data center servers.
  2. Actively participate in hardware bring-up activities, OOB firmware development, protocol stacks (Redfish, PLDM, MCTP, NSM) and hardware-software co-design for Cloud Service Provider deployments.
  3. Debug and troubleshoot NVIDIA GPU firmware issues, power management, performance, and thermal control problems for data center deployments, providing active support to CSPs.
  4. Partner directly with CSPs to deliver technical solutions, co-develop & co-debug features and optimizations, and provide support during new product introductions.
  5. Perform advanced system debugging, root cause analysis, and performance optimization for large-scale data center environments.

Skills

Required

  • data center server architectures
  • HPC systems
  • hardware-software co-design
  • embedded firmware
  • server management controllers
  • hardware bring-up
  • DMTF protocols (Redfish, IPMI, PLDM, MCTP, SPDM)
  • telemetry frameworks
  • out-of-band management architectures
  • C/C++ in resource-constrained embedded environments
  • RTOS
  • device drivers
  • low-level protocols (I2C, SPI, UART, PCIe, MCTP)
  • RAS including error handling, error injection, fault isolation, and system health monitoring

Nice to have

  • cloud and cluster level deployment and management systems
  • GPU computing (CUDA)
  • deep learning workloads
  • Memory fabric and CXL architectures

What the JD emphasized

  • shipping production BMC solutions