Senior Staff Software Engineer — AI Applications and Platform Foundations

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA

Senior Staff Software Engineer to architect and build the foundation for NVIDIA's next generation of AI applications, focusing on production-grade AI applications and shared AI platforms for enterprise use. This role requires deep full-stack engineering expertise, platform technical leadership, and a passion for delivering reliable, scalable, enterprise-grade AI products.

What you'd actually do

  1. Architect and deliver production AI applications across desktop, mobile, web, and cloud platforms, driving technical decisions spanning user experiences, backend services, APIs, data pipelines, and platform infrastructure.
  2. Establish engineering excellence for AI products through robust testing strategies, CI/CD, observability, release management, operational readiness, incident response, and performance optimization.
  3. Build shared AI foundations including agent orchestration, memory and context services, evaluation frameworks, telemetry, workflow execution, tool integration, internal frameworks, and reusable platform services.
  4. Operationalize NVIDIA AI technologies such as Nemotron, NIM, NeMo, AI Blueprints, and emerging AI platform capabilities in enterprise production environments.
  5. Partner closely with product managers, designers, infrastructure teams, IT, and AI researchers to deliver measurable business impact while mentoring engineers and driving architectural alignment across organizations.

Skills

Required

  • BS, MS, or equivalent experience in Computer Science, Software Engineering, or a related technical field.
  • 10+ years of experience designing, building, and operating large-scale production software systems, including architecture leadership for complex products.
  • Strong full-stack development experience spanning modern frontend technologies (TypeScript/JavaScript, React, Electron, React Native, Flutter, or similar), backend services, APIs, and distributed systems.
  • Demonstrated expertise in cloud-native architectures, scalable platform infrastructure, data systems, and distributed application architectures.
  • Deep knowledge of software quality, testing, observability, release engineering, performance optimization, security, and operational excellence.
  • Experience building or operating AI-powered applications using technologies such as LLMs, RAG, agent frameworks, tool integrations, memory systems, or workflow orchestration.
  • Proven ability to influence technical direction across teams through strong architectural judgment, communication, and hands-on technical leadership.

Nice to have

  • Familiarity with enterprise application deployment, security, authentication, device management, and application lifecycle management.
  • Experience building and operating desktop and mobile applications at scale, including installers, enterprise deployment, app distribution, auto-update systems, crash recovery, offline experiences, and client observability.
  • Experience designing cross-platform application architectures that provide seamless user experiences across desktop, mobile, and web environments.
  • Familiarity with multi-agent systems, large-scale collaboration platforms, identity and trust frameworks, or enterprise-scale collaboration systems.
  • Hands-on experience with NVIDIA AI technologies, including NeMo, NIM, Nemotron, TensorRT-LLM, or AI Blueprints as well as experience contributing to or working with modern open-source AI ecosystems and agent frameworks.

What the JD emphasized

  • production AI applications
  • enterprise-grade AI products
  • AI prototypes into trusted products used at scale
  • architect and build the foundation for NVIDIA's next generation of AI applications
  • deep full-stack engineering expertise
  • strong platform technical leadership
  • passion for delivering reliable, scalable, enterprise-grade AI products
  • obsesses over software quality, operational excellence, user experience, and system reliability

Other signals

  • production-grade AI applications
  • shared AI platforms
  • enterprise-grade AI products
  • AI prototypes into trusted products used at scale