Manager, Agentic Software Development and Test Infrastructure

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA

Manager for software development and test infrastructure teams focused on AI agents, aiming to reinvent software development at scale for multi-cloud and heterogeneous environments.

What you'd actually do

  1. Grow and develop a world-class, geographically distributed team.
  2. Plan, schedule, mentor, and lead the execution of projects and activities of the team.
  3. Collaborate with engineering teams, program and product management, and partners to define the product roadmap.
  4. Continuously review and identify improvement opportunities in established processes, infrastructure, and practices to ensure the teams are performing in the most efficient and open manner.
  5. Own and drive efficiency, reliability, security, performance, and compliance across the infrastructure.

Skills

Required

  • 8+ overall years of industry experience
  • 3+ years of relevant software engineering management experience leading large complex system software projects
  • Strong understanding of distributed system architecture, operating systems principles, and how to model standards for reliability, performance, monitoring, security, and compliance.
  • Experience leading multiple software engineering projects.
  • Demonstrated ability to work across organizations at all levels and experience balancing multiple projects with competing priorities.

Nice to have

  • Strong track-record of technical management experience.
  • Experience with Agentic AI frameworks and patterns.
  • Background with container technologies such as Docker and Kubernetes.
  • Experience with DevOps tools such as GitHub Actions, GitLab, Buildkite, Ansible, and Terraform.

What the JD emphasized

  • agentic
  • AI agents
  • software development at scale
  • multi-cloud and heterogeneous environment