Manager, Systems Software Engineering - Nvlink and AI

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA

Manager for a Systems Software Engineering team focused on NVLink and AI, developing solutions for data center platforms and debugging complex issues using LLMs and agentic AI.

What you'd actually do

  1. You will be managing a team of software engineers passionate about developing tools and software/firmware solutions for next generation NVLink systems in the data center.
  2. Provide technical direction and strategic leadership in the design, development, and deployment of debug tools using LLMs, agentic AI , and related technologies and their integration into production systems.
  3. Be the primary technical leader passionate about developing debug features into the software stack.
  4. Collaborate with software, firmware and platform teams to direct the investigation and resolution of complex issues in the software/firmware stack.
  5. Align priorities across collaborators and define metrics for measuring the success of the product/team.

Skills

Required

  • BS+ degree in CS or related or equivalent experience
  • 10+ overall years of industry large distributed system software development experience
  • 4+ years of experience managing of AI /SW development teams
  • Experience debugging functional and performance issues in complex software /firmware systems
  • Strong technical foundation in software engineering, particularly Python, data structures, and system design
  • Excellent communication, collaboration and problem-solving skills
  • Demonstrated success in leading cross-functional teams across architecture, firmware/hardware interfaces

Nice to have

  • Python
  • data structures
  • system design

What the JD emphasized

  • managing of AI /SW development teams
  • debug tools using LLMs, agentic AI

Other signals

  • developing solutions for data center platforms
  • debug tools using LLMs
  • agentic AI