Solutions Architect, Agentic AI

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA +1 · Remote

NVIDIA is seeking Solutions Architects to build and deploy agentic AI applications at scale for enterprises, focusing on integrating enterprise data, developing multi-modal dialogue systems, and task-specific agents. The role involves working with agentic frameworks, providing feedback to improve software products, and educating vertical teams.

What you'd actually do

  1. The Agentic AI team mission is to deliver innovative and optimized AI agents using the latest techniques including Test Time Compute, Reinforcement Learning, inference optimization and model fine-tuning.
  2. You’ll work with agentic frameworks to develop applications that retrieve and generate insights from enterprise data, including text, code, and images.
  3. Provide direct feedback from these first-time implementations to improve our software products and scale knowledge by educating vertical teams and building communities on NVIDIA AI software products!

Skills

Required

  • Deep Learning and Machine Learning
  • Python
  • C/C++
  • Linux
  • TensorFlow or PyTorch
  • data structures
  • algorithms
  • software engineering principles
  • building advanced multi-agent systems
  • LangGraph
  • LlamaIndex
  • CrewAI
  • communication skills

Nice to have

  • building evaluation harnesses
  • success metrics
  • automated testing pipelines
  • guardrail frameworks
  • fine-tuning and optimizing reasoning-focused LLMs and SLMs
  • prompt engineering
  • quantization
  • benchmarking
  • Kubernetes/OpenShift
  • CI/CD automation
  • secure cloud-native infrastructure

What the JD emphasized

  • building advanced multi-agent systems
  • building evaluation harnesses
  • automated testing pipelines
  • guardrail frameworks
  • fine-tuning and optimizing reasoning-focused LLMs and SLMs
  • production-grade deployment patterns

Other signals

  • building agentic AI applications at scale
  • integrating enterprise data sources into meaningful agentic applications
  • developing production-grade deployment patterns