Senior System Software Engineer, Data Center Diagnostics

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA

Senior System Software Engineer focused on high-speed interconnect technologies for GPUs and HPC servers, involving close collaboration with hardware and silicon teams. Responsibilities include developing software tools for chip development, validation, and productization, assessing new hardware features, and architecting diagnostic software. Requires strong C++, systems software, OS fundamentals, and computer architecture knowledge.

What you'd actually do

  1. Working closely with architecture, hardware and driver teams through the product development lifecycle of computing and graphics processors, as well as compute products.
  2. Responsible for crafting software tools and infrastructure required for new chip development, validation, and productization.
  3. You will assess new hardware features and architect diagnostic software using pre-beta CUDA extensions.
  4. This job will require an understanding of our hardware and software architectures.

Skills

Required

  • C++ programming
  • networking concepts and algorithms
  • systems software
  • OS fundamentals
  • computer architecture
  • operating systems
  • algorithms
  • problem solving
  • debugging

Nice to have

  • PCIE
  • NVLink
  • server IO technologies
  • Infiniband
  • Ethernet
  • Python
  • leadership
  • interpersonal skills

What the JD emphasized

  • strong C++ programming skills
  • detailed knowledge of networking concepts and algorithms
  • solid foundation of systems software with emphasis on OS fundamentals
  • deep understanding of current generation hardware architecture at systems level
  • Excellent communication skills
  • dedication to meticulous engineering practices are a requirement
  • extensive use of your knowledge of operating systems, algorithms, and computer architecture
  • 5+ years experience in a related hardware/software position
  • Strong C/C++ programming and Python skills
  • Familiarity with computer architecture, and ability to work close to the hardware
  • Very strong problem solving and debugging skills
  • Ability to self-manage, show leadership, and have good interpersonal skills