Senior System Software Engineer Platform - Openbmc

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA +1 · Remote

Senior System Software Engineer to design and implement OpenBMC Firmware for NVIDIA's GPU Server platforms, focusing on Arm architecture. Responsibilities include BMC firmware bring-up, performance analysis, developing manageability features, designing solutions for errors and configurations, and ensuring enterprise readiness. The role involves performance-optimized monitoring solutions using DMTF standards, instrumenting code for coverage, automating unit tests, and working with security and hardware teams. Requires a BS/MS in EE/CS or equivalent, 8+ years of experience in BMC firmware development, board bring-up expertise, and strong programming skills in C/C++, Bash, Python, Go.

What you'd actually do

  1. Design and implement OpenBMC Firmware for GPU Server platforms focus on but not limited to Arm architecture.
  2. Hands on work with bringing up of BMC firmware, performance analysis and coding various manageability features for NVIDIA’s Server platforms
  3. Developing and reviewing code, writing and reviewing design documents, reviewing QA test plan and working closely with all team members to achieve consensus for design and testability as per product requirements.
  4. Designing solutions for errors, stats & configuration appropriate to CPU, GPU, DIMM, SSDs, NICs, IB, PSU, BMC, FPGA, CPLD etc. for enterprise readiness of NVIDIA Server platforms.
  5. Designing and developing performance optimized active monitoring BMC solutions using DMTF Standards including MCTP, Redfish, SPDM and PLDM specifications

Skills

Required

  • Bachelor of Science Degree (or higher) in Electrical Engineering or Computer Science or equivalent experience
  • 8+ years of experience
  • Domain expertise in BMC Firmware development on X86 or ARM Platforms including BMC-BIOS communication, thermal management, power management, firmware update, device monitoring, firmware security, etc.
  • Board Bring-up expertise with hands-on experience in Device drivers like I2C/I3C, SPI, PCIe, SMBus, Mail-box etc. as well as the device trees for uboot and Linux kernel.
  • OOB or In-band System Management experience with exposure to standards IPMI, KCS, DMTF Standards (PLDM, MCTP, Redfish, etc), PMBus, NVMe, etc.
  • Understanding on REST architecture style especially JSON over HTTPs with OAuth
  • Strong programming and scripting skills using C/C++, Bash, Python, Go etc. both for Linux user-space programs and system programs with thorough code reviewing skills.
  • Strong in Linux fundamentals, various Linux distributions and packages, Linux upgrade mechanisms, building and deploying Linux images.
  • excellent written and oral communication skills
  • good work ethics
  • high sense of team-work
  • love to produce quality work
  • commitment to finish your tasks every single day
  • self-starter who loves to find creative solutions to challenging problems

Nice to have

  • Contributor to industry standards like Open Compute, OpenBMC, IPMI, DMTF Standards, and open source.
  • Expertise in system software and platform security for x86/ARM based Rack/Blade server systems.

What the JD emphasized

  • 8+ years of experience
  • Domain expertise in BMC Firmware development on X86 or ARM Platforms including BMC-BIOS communication, thermal management, power management, firmware update, device monitoring, firmware security, etc.
  • Board Bring-up expertise with hands-on experience in Device drivers like I2C/I3C, SPI, PCIe, SMBus, Mail-box etc. as well as the device trees for uboot and Linux kernel.
  • OOB or In-band System Management experience with exposure to standards IPMI, KCS, DMTF Standards (PLDM, MCTP, Redfish, etc), PMBus, NVMe, etc.
  • Strong programming and scripting skills using C/C++, Bash, Python, Go etc. both for Linux user-space programs and system programs with thorough code reviewing skills.
  • Contributor to industry standards like Open Compute, OpenBMC, IPMI, DMTF Standards, and open source.