Solutions Architect, Model Builder - Latam

NVIDIA NVIDIA · Semiconductors · Sao Paulo, Brazil +1 · Remote

Solutions Architect focused on building and deploying agentic AI applications and enterprise agents, with a strong emphasis on localization, performance optimization, and leveraging NVIDIA's AI infrastructure.

What you'd actually do

  1. Localize the future: Fine-tune LLMs to speak the authentic language of specific regions and industries.
  2. Develop and optimize training and inference workflows with partners and collaborate with internal NVIDIA development teams to improve our software stack
  3. Build sophisticated agentic systems featuring multi-agent coordination, long-horizon reasoning, and sophisticated planning frameworks.
  4. Develop full-scale solutions, including domain-specific enterprise agents and high-performance retrieval pipelines (RAG) spanning various data sources.
  5. Optimize inference performance by bringing to bear GPU-accelerated frameworks and the full NVIDIA AI infrastructure stack.

Skills

Required

  • BS/MS/PhD in Computer Science, Electrical Engineering, AI/ML, or equivalent experience.
  • 5+ years of experience in deep learning, machine learning, or distributed AI systems.
  • Strong programming and debugging experience in Python, C/C++, and Linux environments.
  • Background in using deep learning libraries like PyTorch or TensorFlow.
  • Hands-on experience building LLM and generative AI applications.
  • Experience working with agentic or multi-agent AI systems employing frameworks such as: LangGraph, LlamaIndex, CrewAI, LangChain, or OpenAI Agents SDK or similar orchestration frameworks
  • Experience building tool-using AI agents that interact with APIs, databases, and enterprise systems.
  • Ability to rapidly prototype AI applications and build scalable GPU-accelerated architectures.
  • Excellent interpersonal skills and the ability to collaborate with engineering teams, partners, and executive collaborators.

Nice to have

  • Experience working with NVIDIA GPUs and AI software, such as NVIDIA NIM, NeMo Framework, NeMo Retriever, and NeMo Agent Toolkit.
  • Experience with LLM evaluation frameworks, benchmarking systems, and safety guardrails for agentic workflows.
  • Experience optimizing reasoning-focused LLMs through timely engineering, quantization, or benchmarking.
  • Familiarity with Kubernetes/OpenShift, CI/CD automation, and cloud-native deployment patterns for AI systems.
  • Experience with parallel or distributed computing environments and AI workloads optimized for GPUs.

What the JD emphasized

  • 5+ years of experience in deep learning, machine learning, or distributed AI systems.
  • Experience working with agentic or multi-agent AI systems employing frameworks such as: LangGraph, LlamaIndex, CrewAI, LangChain, or OpenAI Agents SDK or similar orchestration frameworks
  • Experience building tool-using AI agents that interact with APIs, databases, and enterprise systems.
  • Ability to rapidly prototype AI applications and build scalable GPU-accelerated architectures.

Other signals

  • Develop sophisticated agentic systems
  • Build hands-on PoCs and reference architectures
  • Partner with high-growth startups and Enterprise ISVs to embed NVIDIA’s software stack