Solutions Architect, Generative AI - Csp

NVIDIA NVIDIA · Semiconductors · Shenzhen, China +1

NVIDIA is seeking an AI-focused Solutions Architect with expertise in LLMs, generative AI, agentic AI, or recommender systems. The role involves providing technical expertise to customers, assisting with GPU infrastructure for AI, optimizing training and inference pipelines, and gathering customer feedback for product development. This position requires 3+ years of experience in AI for large models and proficiency with AI tools.

What you'd actually do

  1. Assisting field business development in guiding the customer build/extend their GPU infrastructures for AI.
  2. Help customers build their large-scale projects, especially Large Language Model (LLM) projects.
  3. Engage with customers to perform in-depth analysis and optimization to ensure the best performance on GPU architecture systems. This includes support in optimization of both training and inference pipelines.
  4. Partner with Engineering, Product and Sales teams to develop, plan best suitable solutions for customers. Enable development and growth of product features through customer feedback and proof-of-concept evaluations.
  5. Build industry expertise and become a contributor in integrating NVIDIA technology into Enterprise Computing architectures.

Skills

Required

  • Expertise in Large Language Models, generative AI, agentic AI, or recommender systems
  • 3+ years of experience in AI for large language or multi-modal models
  • Model architecture or training strategy design
  • Training optimization
  • Inference acceleration
  • Proficiency in using AI tools or agents
  • Strong written and oral communication skills in English

Nice to have

  • Specialization in post-training or reinforcement learning of LLMs
  • Algorithm research or efficiency optimization for LLMs
  • Demonstrated experience optimizing workloads with GPU technology
  • Experience with NVIDIA AI software and platforms

What the JD emphasized

  • expertise in Large Language Model, generative AI, agentic AI, or recommender system
  • 3+ years of work-related experience in AI for large language (or multi-modal) models, including model architecture or training strategy design, training optimization, inference acceleration, etc.

Other signals

  • customer-facing technical expertise
  • GPU infrastructure for AI
  • LLM project support
  • training and inference optimization
  • customer feedback for product development