Manager, Software Engineering - Cumulus Linux

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA

Manager, Software Engineering for NVIDIA's Cumulus Linux Team, focusing on leading development of the Network Operating System software for AI-accelerated data centers. Responsibilities include leading a team, defining and implementing core infrastructure services, reliability, availability, serviceability, and telemetry features. Requires strong leadership in Linux systems, data center networking, and people management, with experience driving projects from concept to production.

What you'd actually do

  1. Lead a team involved with development and delivery of Cumulus Linux operating system telemetry and infrastructure features
  2. Partnering with other engineering teams to scope and develop solutions to improve systems security, performance and reliability features.
  3. Develop and debug C and Python code for system monitoring, reliability and serviceability features as needed.
  4. Collaborate with product, architecture, and engineering teams for E2E integration of systems infrastructure features into Linux and Cumulus Linux distribution
  5. Work with project management team for effort estimation and planning of the different features.

Skills

Required

  • Master of Science in Electrical Engineering, Computer Science, Computer Engineering or Bachelors (or equivalent experience)
  • 10+ years of proven leadership in Linux systems, data center networking technologies
  • 2 or more years of people management experience in an enterprise environment.
  • Familiar with cloud native concepts
  • Strong background with Linux OS feature development
  • Good knowledge of data modeling concepts, OpenConfig, and streaming telemetry protocols like gNMI.
  • Experience driving projects from concept to production
  • Excellent written and verbal communication and interpersonal skills. Comfortable articulating value propositions to customers and influencing internal teams
  • Experience with embedded software on network switches
  • Experience with bring up and troubleshooting of Ethernet interfaces and modules
  • Familiarity with datacenter protocols
  • Able to work independently with minimal direction.

Nice to have

  • Strong background in Linux systems and Linux kernel networking
  • Strong hands-on lab work experience involving system bring-up and debug
  • Loves to work in a collaborative environment

What the JD emphasized

  • take full ownership of tasks from A-Z
  • deliver independently with minimum supervision
  • 2 or more years of people management experience in an enterprise environment
  • Experience driving projects from concept to production