Solution Architect, Generative AI at NVIDIA

What you'd actually do

Develop and demonstrate solutions based on NVIDIA’s pioneering GenAI software and hardware technologies with a focus on inference. This may include advising our customer on agent-based system design and optimal models and infrastructure.

Work directly with key customers to understand their challenges and provide the best solutions based on NVIDIA products

You will drive sustainability by performing in-depth analysis and optimization to ensure the best performance and cost-effectiveness using the NVIDIA software platform. This includes transitioning pipelines to lower precision compute.

Drive pre-sales conversations, build architectures and demos to accelerate the customer AI journey based on NVIDIA products, and work closely with Sales Account Managers to secure design wins.

Create or run Proofs of Concept and demos that require presentation skills, the explanation of complex topics, and Python coding to execute data pipelines, train ML/DL models, and deploy them on container-based orchestrators.

Skills

Required

Excellent verbal, written communication, and technical presentation skills in Japanese
Business level English communication
BS or MS in Computer Science, Engineering, Mathematics, or Physics (or equivalent experience)
5+ years of industry or academic experience related to Generative AI or Deep Learning
Strong coding development and debugging skills
Python
C/C++
Bash
Linux
Demonstrated experience with cluster orchestration tools including Docker, Kubernetes, or SLURM across cloud service providers and on premises
Ability to multitask effectively in a dynamic environment
Strong analytical and problem-solving skills
Clear written and oral communication skills with the ability to effectively collaborate with management and engineering
Have a strong desire to share knowledge with clients, partners and co-workers

Nice to have

Expertise in deploying large-scale training and inferencing pipeline
Experience with pre-training, post-training of transformer-based architectures for language or vision
A deep understanding of the latest generative AI or deep learning methods and algorithms
Experience using or operating Kubernetes, as well as experience writing or customizing Kubernetes configurations
Experience in designing agent-based systems: Experience working as an architect on the overall design, including optimal infrastructure, agent architecture, and authentication systems

NVIDIA is a world leader in computer graphics, artificial intelligence, and accelerated computing. For over 25 years, NVIDIA has been at the forefront of research and engineering around the greatest advances in technology. Our history of innovation drives us to solve the world's hardest problems.

We are now looking for a Solution Architect to work with enterprise companies in Japan and partners, promoting the adoption and providing technical support to enable them to use our portfolio of GPU-accelerated computing solutions—including machine learning, deep learning, and especially generative AI.

In Japan, the development of sovereign AI and agent-based systems using it is rapidly progressing. We are looking for people who are interested in pre-sales and technical support activities involving support and proposals for those model training and deployment. We need a passionate, hard-working, and creative individual who has the skills and aims to work in a fast-evolving technological environment that is always on the cutting edge of AI and Accelerated Computing. This individual is self-driven, excellent in communication and partnership, stays on top of tasks, mission focused and outcome oriented.

What you'll be doing:

Develop and demonstrate solutions based on NVIDIA’s pioneering GenAI software and hardware technologies with a focus on inference. This may include advising our customer on agent-based system design and optimal models and infrastructure.
Work directly with key customers to understand their challenges and provide the best solutions based on NVIDIA products
You will drive sustainability by performing in-depth analysis and optimization to ensure the best performance and cost-effectiveness using the NVIDIA software platform. This includes transitioning pipelines to lower precision compute.
Drive pre-sales conversations, build architectures and demos to accelerate the customer AI journey based on NVIDIA products, and work closely with Sales Account Managers to secure design wins.
Create or run Proofs of Concept and demos that require presentation skills, the explanation of complex topics, and Python coding to execute data pipelines, train ML/DL models, and deploy them on container-based orchestrators.

What we need to see:

Excellent verbal, written communication, and technical presentation skills in Japanese. Business level English communication is also a requirement.
BS or MS in Computer Science, Engineering, Mathematics, or Physics (or equivalent experience)
5+ years of industry or academic experience related to Generative AI or Deep Learning
Strong coding development and debugging skills. Including experience with Python, C/C++, Bash, and Linux
Demonstrated experience with cluster orchestration tools including Docker, Kubernetes, or SLURM across cloud service providers and on premises
Ability to multitask effectively in a dynamic environment
Strong analytical and problem-solving skills
Clear written and oral communication skills with the ability to effectively collaborate with management and engineering
Have a strong desire to share knowledge with clients, partners and co-workers

Ways to stand out from the crowd:

Expertise in deploying large-scale training and inferencing pipeline
Experience with pre-training, post-training of transformer-based architectures for language or vision
A deep understanding of the latest generative AI or deep learning methods and algorithms
Experience using or operating Kubernetes, as well as experience writing or customizing Kubernetes configurations
Experience in designing agent-based systems: Experience working as an architect on the overall design, including optimal infrastructure, agent architecture, and authentication systems

NVIDIA is widely considered to be one of the technological world’s most desirable employers. We have some of the most brilliant and talented people in the world working for us. If you're creative and autonomous, we want to hear from you!

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.