Deep Learning Product Research Engineer - Product Innovation

NVIDIA · Semiconductors · Santa Clara, CA

NVIDIA is seeking a Deep Learning Product Research Engineer to bridge cutting-edge AI research and real-world product adoption. This role involves building prototypes, evaluating emerging technologies, creating technical assets (demos, white papers, sample code), and collaborating across research, engineering, and product teams to advance NVIDIA's generative AI platform. The engineer will translate research concepts into practical, developer-focused product examples and stay current with generative AI trends.

What you'd actually do

Build prototypes, proof-of-concept applications, benchmarks and technical demos to explore and showcase the art of possible with NVIDIA’s generative AI platform. You will translate this work directly into high-quality into scalable demo artifacts, white papers, sample code, and other developer-facing materials.
Evaluate emerging trends in generative AI, including large language models, multimodal systems, agentic applications, model evaluation, inference optimization, and AI-assisted software development.
Collaborate closely with product managers, engineering teams, researchers, field teams, customers, and marketing partners to translate product capabilities into practical, developer-focused examples. Serve as the technical bridge, translating advanced AI capabilities and research concepts into practical, developer-focused product examples.
Evaluate the technical feasibility, scalability, and product relevance of emerging technologies. Synthesize deep technical insights, authoring decision memos and feature requests to inform internal roadmaps, drive integrations, and improve NVIDIA’s software stack.
Present technical material through developer blogs, webinars, conferences, workshops, customer engagements, and community events.

Skills

Required

Master’s degree in Computer Science, Computer Engineering, Electrical Engineering, Machine Learning, Artificial Intelligence, or a related technical field, or equivalent experience.
5+ years of meaningful experience in software engineering, machine learning engineering, AI engineering, solutions architecture, applied research, or a similar technical role.
Hands-on experience with machine learning, deep learning, or agentic AI, including building, training, fine-tuning, evaluating, deploying, or optimizing models and AI applications.
Practical experience with generative AI systems, including large language models, retrieval-augmented generation, agentic workflows, model evaluation, or AI application development.
Strong programming skills in Python, and experience with modern deep learning frameworks and libraries such as PyTorch, Hugging Face Transformers, LangChain, LlamaIndex, TensorFlow, or similar tools.
Familiarity with modern AI-assisted development tools and coding agents such as Codex, Claude Code, Cursor, or similar systems.
Ability to create clear, accurate, technically thorough, and compelling content for developers, including tutorials, blogs, sample code, white papers, benchmarks, or demos.
Strong communication and presentation skills, with the ability to explain complex technical topics to both expert and non-expert audiences.
Ability to collaborate optimally across research, engineering, product, marketing, field, and customer-facing teams, and passion for applied AI research, technical storytelling, and improving the user experience for AI practitioners.

Nice to have

PhD in Computer Science, Engineering, Machine Learning, Artificial Intelligence, or a related field.
3+ years of hands-on experience with machine learning, deep learning, generative AI, large language models, multimodal models, reinforcement learning, model optimization, or agentic applications.
Experience building production-quality AI applications, developer tools or research prototypes.
Experience designing or evaluating agentic AI systems, AI coding assistants, model evaluation harnesses, RAG pipelines, synthetic data workflows, or AI safety workflows.
Experience with NVIDIA AI software, models, or frameworks such as NeMo, NeMo Retriever, NeMo Guardrails, NeMo RL, NIM, TensorRT, Dynamo, CUDA, cuDNN, or Nemotron models.

What the JD emphasized

Hands-on experience with machine learning, deep learning, or agentic AI, including building, training, fine-tuning, evaluating, deploying, or optimizing models and AI applications.
Practical experience with generative AI systems, including large language models, retrieval-augmented generation, agentic workflows, model evaluation, or AI application development.
Strong programming skills in Python, and experience with modern deep learning frameworks and libraries such as PyTorch, Hugging Face Transformers, LangChain, LlamaIndex, TensorFlow, or similar tools.
Ability to create clear, accurate, technically thorough, and compelling content for developers, including tutorials, blogs, sample code, white papers, benchmarks, or demos.
Ability to collaborate optimally across research, engineering, product, marketing, field, and customer-facing teams, and passion for applied AI research, technical storytelling, and improving the user experience for AI practitioners.

Other signals

building prototypes
evaluating emerging technologies
translating research into product capabilities
developer-facing materials

Read full job description

NVIDIA is at the center of the AI revolution. Our deep learning platforms, models, frameworks, and accelerated computing technologies help developers, researchers, and enterprises build the next generation of intelligent applications The Deep Learning Product Research team sits at the intersection of engineering, product, research, developer relations, and go-to-market. We help accelerate the path from cutting-edge AI research to real-world product adoption by building high-quality technical assets, proof-of-concept applications, benchmarks, white papers, and developer-facing materials that advance NVIDIA’s generative AI platform We are looking for a hands-on engineer and generative AI practitioner who can build prototypes, write high-quality code, evaluate emerging technologies, explain sophisticated systems clearly, and turn research ideas into practical product capabilities. In this role, you will create prototypes, demos, white papers, benchmarks, blogs, sample applications, conference material, and other technical content. You will work closely with research, engineering, product, marketing, field teams, customers, and the developer community to identify opportunities, surface feedback, and improve products across NVIDIA’s AI ecosystem!

What you’ll be doing:

Build prototypes, proof-of-concept applications, benchmarks and technical demos to explore and showcase the art of possible with NVIDIA’s generative AI platform. You will translate this work directly into high-quality into scalable demo artifacts, white papers, sample code, and other developer-facing materials.
Evaluate emerging trends in generative AI, including large language models, multimodal systems, agentic applications, model evaluation, inference optimization, and AI-assisted software development.
Collaborate closely with product managers, engineering teams, researchers, field teams, customers, and marketing partners to translate product capabilities into practical, developer-focused examples. Serve as the technical bridge, translating advanced AI capabilities and research concepts into practical, developer-focused product examples.
Evaluate the technical feasibility, scalability, and product relevance of emerging technologies. Synthesize deep technical insights, authoring decision memos and feature requests to inform internal roadmaps, drive integrations, and improve NVIDIA’s software stack.
Present technical material through developer blogs, webinars, conferences, workshops, customer engagements, and community events.
Serve as a technical advocate for NVIDIA’s deep learning platform, helping developers understand how to build, optimize, and deploy AI applications using NVIDIA technologies.
Stay current with advances in deep learning, generative AI, model training, fine-tuning, inference, optimization, deployment, agentic workflows, and the broader AI developer ecosystem.

What we need to see:

Master’s degree in Computer Science, Computer Engineering, Electrical Engineering, Machine Learning, Artificial Intelligence, or a related technical field, or equivalent experience.
5+ years of meaningful experience in software engineering, machine learning engineering, AI engineering, solutions architecture, applied research, or a similar technical role.
Hands-on experience with machine learning, deep learning, or agentic AI, including building, training, fine-tuning, evaluating, deploying, or optimizing models and AI applications.
Practical experience with generative AI systems, including large language models, retrieval-augmented generation, agentic workflows, model evaluation, or AI application development.
Strong programming skills in Python, and experience with modern deep learning frameworks and libraries such as PyTorch, Hugging Face Transformers, LangChain, LlamaIndex, TensorFlow, or similar tools.
Familiarity with modern AI-assisted development tools and coding agents such as Codex, Claude Code, Cursor, or similar systems.
Ability to create clear, accurate, technically thorough, and compelling content for developers, including tutorials, blogs, sample code, white papers, benchmarks, or demos.
Strong communication and presentation skills, with the ability to explain complex technical topics to both expert and non-expert audiences.
Ability to collaborate optimally across research, engineering, product, marketing, field, and customer-facing teams, and passion for applied AI research, technical storytelling, and improving the user experience for AI practitioners.

Ways to stand out from the crowd

PhD in Computer Science, Engineering, Machine Learning, Artificial Intelligence, or a related field.
3+ years of hands-on experience with machine learning, deep learning, generative AI, large language models, multimodal models, reinforcement learning, model optimization, or agentic applications.
Experience building production-quality AI applications, developer tools or research prototypes.
Experience designing or evaluating agentic AI systems, AI coding assistants, model evaluation harnesses, RAG pipelines, synthetic data workflows, or AI safety workflows.
Experience with NVIDIA AI software, models, or frameworks such as NeMo, NeMo Retriever, NeMo Guardrails, NeMo RL, NIM, TensorRT, Dynamo, CUDA, cuDNN, or Nemotron models.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most brilliant, forward-thinking and hardworking people in the world working for us. There has never been a more exciting time to join!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 136,000 USD - 212,750 USD for Level 3, and 160,000 USD - 253,000 USD for Level 4.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until July 4, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering an inclusive work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.