Solutions Architect, Genai at NVIDIA

What you'd actually do

Be an expert and help customers to customize GenAI (LLM) models and optimise the training performance at scale.

Lead workshops and trainings on NVIDIA's technologies.

Closely partner with other Solutions Architects, engineering, product and business teams at NVIDIA to build GenAI full stack solutions for industry vertical and enterprise use cases.

Work with business managers to thoughtfully craft the vision, actionable and effective strategies for the group.

Encourage industry leaders by articulating the business value from the state of the art in Generative AI.

Skills

Required

BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering or related fields (or equivalent experience)
8+ overall years of work-related experience in deep learning, data science or software development with knowledge of parallel computing with GPUs.
Specifically focusing on generative AI at scale, with emphasis on training Large Language Models (LLMs) at scale.
Clear written and oral communication skills with the ability to collaborate with management and engineering.
Share knowledge with clients, partners and co-workers.
Experience leading workshops, training sessions, and presenting technical solutions to diverse audiences.
Professional or native language proficiency in English and Mandarin.
Ability to travel up to 30% of the time to support customer in South East Asia and beyond.

Nice to have

Hands-on experience with NVIDIA's NeMo SDKs or Megatron.
Demonstrable ability to customize LLM models with new capabilities as well as for training speed, memory efficiency, and resource utilization.
Familiarity with containerization technologies (e.g., Docker or enroot/pyxis, etc.) and orchestration tools (e.g., Slurm or Kubernetes, etc.) for scalable and efficient model building.

We are looking for a Solutions Architect with experience in GenAI (LLM) model building. This is a highly technical role that requires deep expertise in generative AI, large language models (LLMs), and scalable software engineering practices. We need a passionate, hard-working, expert and creative individual to help us pursue the many opportunities in this region.

A Solution Architect is the first line of technical expertise between NVIDIA and our customers, as well as our partners. Your duties will vary from solutions design, training/workshops, troubleshooting, project coordination, industry and marketing speaking engagements, customer relationship management and more. You will primarily support Singapore, but will need to support the wider South East Asia region if required.

What you'll be doing:

Be an expert and help customers to customize GenAI (LLM) models and optimise the training performance at scale.
Lead workshops and trainings on NVIDIA's technologies.
Closely partner with other Solutions Architects, engineering, product and business teams at NVIDIA to build GenAI full stack solutions for industry vertical and enterprise use cases.
Work with business managers to thoughtfully craft the vision, actionable and effective strategies for the group.
Encourage industry leaders by articulating the business value from the state of the art in Generative AI.

What we need to see:

BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering or related fields (or equivalent experience)
8+ overall years of work-related experience in deep learning, data science or software development with knowledge of parallel computing with GPUs. Specifically focusing on generative AI at scale, with emphasis on training Large Language Models (LLMs) at scale.
Clear written and oral communication skills with the ability to collaborate with management and engineering. Share knowledge with clients, partners and co-workers.
Experience leading workshops, training sessions, and presenting technical solutions to diverse audiences.
Professional or native language proficiency in English and Mandarin.
Ability to travel up to 30% of the time to support customer in South East Asia and beyond.

Ways to stand out from the crowd:

Hands-on experience with NVIDIA's NeMo SDKs or Megatron.
Demonstrable ability to customize LLM models with new capabilities as well as for training speed, memory efficiency, and resource utilization.
Familiarity with containerization technologies (e.g., Docker or enroot/pyxis, etc.) and orchestration tools (e.g., Slurm or Kubernetes, etc.) for scalable and efficient model building.

Widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. As you plan your future, see what we can offer to you and your family www.nvidiabenefits.com/

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.