Senior Staff Software Engineer - AI Agent Platform

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA

Senior Staff Software Engineer to build and scale the infrastructure for NVIDIA's AI agent ecosystem, focusing on platform services for the full agent lifecycle, Kubernetes execution environments, CI/CD pipelines, and AI data platform components.

What you'd actually do

  1. Build and develop platform services that own the full agent lifecycle from registration through deployment, execution, and teardown
  2. Architect Kubernetes-based execution environments with pod lifecycle management, namespace isolation, persistent storage, and identity propagation
  3. Develop and maintain automated CI/CD pipelines using GitLab CI and ArgoCD, including reusable pipeline templates and deployment blueprints that standardize how agents are built across teams
  4. Build framework-agnostic infrastructure supporting multiple agent SDKs (Claude Code, OpenAI Codex, LangGraph), with hands-on experience using harnesses, lifecycle hooks, skills configurability, observability (OTEL), and memory services
  5. Develop data ingestion pipelines, access interfaces, and storage layers that power AI agent knowledge and context

Skills

Required

  • Python
  • FastAPI
  • Flask
  • Kubernetes
  • GitLab CI
  • ArgoCD
  • Kafka
  • Redis
  • MongoDB
  • PostgreSQL
  • OAuth 2.0
  • JWT
  • Vault

Nice to have

  • Claude Code
  • OpenAI Codex
  • LangGraph
  • RAG architectures
  • Milvus
  • Pinecone
  • Weaviate
  • React
  • Vue

What the JD emphasized

  • Experience building and scaling AI agents in production
  • Deep Kubernetes expertise
  • Proven track record designing distributed systems
  • Expertise building and managing robust CI/CD pipelines
  • Experience designing AI data platform components
  • History of leading sophisticated technical projects

Other signals

  • AI agent ecosystem
  • production scale
  • agent lifecycle
  • Kubernetes-based execution environments
  • AI data platform components