Director, Rack Scale Software Architecture

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA +4 · Remote

Lead and manage a team responsible for rack-scale system software architecture, including firmware, kernel drivers, operating systems, networking, fabrics, and manageability software, for NVIDIA's data center platform and node designs. This role involves driving end-to-end software architecture, translating roadmaps into requirements, ensuring software quality, and collaborating with major customers and internal teams.

What you'd actually do

  1. Drive the software end-to-end architecture for NVIDIA's rack-scale products
  2. Maintain deep understanding of the product portfolio and roadmap; translate forward-looking plans into clear, formal software requirements that anchor execution across the organization.
  3. Ensure high quality & reliable software; serving as a trusted architectural partner to teams requiring guidance or oversight.
  4. Work directly with major customers to understand their requirements and work to align their roadmap with NVIDIA’s roadmap.
  5. Using strong communication skills, present the team vision to senior NVIDIA and external leaders.

Skills

Required

  • System architecture and design
  • Management
  • Scalable and performant server systems design
  • SW/HW interface design
  • Leadership
  • Complex system software for accelerators (GPUs, DPUs, FPGAs)
  • Managerial skills
  • Problem solving
  • Critical thinking
  • Communication skills

Nice to have

  • Knowledge of large-scale cloud and cluster level deployment and management systems
  • Experience with designing robust, resilient and performant scale-up fabrics
  • Leading data center products across the entire lifecycle
  • Strong understanding of networking technology & protocols (e.g. Ethernet, Infiniband)
  • Familiarity with CXL, UCIE and other C2C technology architectures
  • Knowledge in storage and networking technologies

What the JD emphasized

  • BS or MS degree in Computer Engineering, Computer Science, or related degree or equivalent experience.
  • 15+ overall years of experience in the area of System architecture and design with 8+ yrs of proven experience in management
  • Deep experience in designing architecture for scalable and performant server systems, particularly at the SW/HW interface.
  • Proven leadership skills and strong ownership on past projects involving a large scale sophisticated code base
  • Previous experience working with complex system software for accelerators such as GPUs, DPUs, or FPGAs
  • Possess strong managerial, problem solving and critical thinking skills.
  • Comfortable operating in highly matrixed organizations while holding a leadership position
  • Known for your strong interactive, verbal and written communications skills