Senior Manager, Software Engineering - It Infrastructure

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA

Senior Manager, Software Engineering - IT Infrastructure at NVIDIA, leading a team to build and operate scalable, reliable enterprise software platforms supporting engineering and IT teams. Focuses on automation, self-service, documentation, and continuous improvement of applications, tools, and infrastructure.

What you'd actually do

  1. Lead, mentor, and grow a team of software engineers responsible for designing, building, and operating an enterprise software engineering platform that supports infrastructure and services at scale.
  2. Set technical direction and architectural standards for platforms that manage workflows across enterprise software, appliances, networks, and open-source technologies.
  3. Oversee the design, evolution, and reliability of REST APIs used by thousands of engineers for on-demand infrastructure and workflow management.
  4. Guide the development and integration of provisioning, metrics, monitoring, and management software to support highly available infrastructure services.
  5. Drive automation initiatives to improve deployment, scalability, reliability, and operational efficiency across large-scale engineering environments.

Skills

Required

  • Proven experience managing and leading software engineering teams
  • Experience in infrastructure or platform-focused environments
  • Strong technical background in designing and operating REST APIs at scale
  • Experience overseeing systems involving provisioning, metrics, monitoring, and infrastructure management workflows
  • Familiarity with CI/CD pipelines, Kubernetes, cloud deployments, and AWS
  • Working knowledge of Python and Go
  • Exposure to JavaScript frameworks and UI design
  • Ability to balance hands-on technical depth with people leadership and strategic planning

Nice to have

  • Experience leading teams that ship microservice-based and containerized software to production
  • Background in security standards, identity, and credential management
  • Experience with large-scale data center automation and provisioning
  • A track record of building inclusive, high-trust, and high-impact engineering teams

What the JD emphasized

  • enterprise software platforms
  • automation
  • self-service
  • documentation
  • scalability
  • reliability
  • REST APIs at scale
  • provisioning, metrics, monitoring, and infrastructure management workflows