Deep Learning Product Research Engineer

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA

This role focuses on product research for generative AI, evaluating emerging models and agent technology to shape NVIDIA's products. The engineer will build proof-of-concept applications, benchmarks, and reference code, convert feedback into product intelligence, and develop enterprise-ready enablement assets. They will also advance internal LLM expertise and tooling, and create technical content. The role requires hands-on experience with generative AI systems, LLMs, RAG, agentic workflows, and model evaluation, along with strong Python and deep learning framework skills.

What you'd actually do

  1. Lead product research for generative AI by evaluating emerging models, agent technology, reinforcement learning, and evaluation methods, then assessing what they mean for NVIDIA products.
  2. Build proof-of-concept applications, benchmarks, and reference sample code that validate new capabilities and demonstrate product value.
  3. Convert customer, developer, benchmark, usage, and field signals into structured product intelligence, including adoption trends, friction points, issue reproductions, and roadmap recommendations.
  4. Develop enterprise-ready enablement assets such as reference architectures, integration playbooks, performance tuning recipes, and demo-to-production workflows for Nemotron, NeMo, NIM, and related NVIDIA AI software.
  5. Partner with research, engineering, product management, technical marketing, field teams, and customers to turn insights into feature requests, launch inputs, positioning, and usability improvements.

Skills

Required

  • Master’s degree in Computer Science, Computer Engineering, Electrical Engineering, Machine Learning, Artificial Intelligence, or a related technical field, or equivalent experience.
  • 5+ years of proven experience in software engineering, machine learning engineering, AI engineering, solutions architecture, applied research, or a similar technical role.
  • Hands-on experience with machine learning, deep learning, or agentic AI, including building, training, fine-tuning, evaluating, deploying, or optimizing models and AI applications.
  • Practical experience with generative AI systems, including large language models, retrieval-augmented generation, agentic workflows, model evaluation, or AI application development.
  • Experience with Python and modern deep learning frameworks and libraries such as PyTorch, Hugging Face Transformers, LangChain, LlamaIndex, TensorFlow, or similar tools.
  • Familiarity with modern AI-assisted development tools and coding agents such as Codex, Claude Code, Cursor, or similar systems.
  • Ability to create clear, accurate, technically rigorous, and compelling content for developers, including tutorials, blogs, sample code, white papers, benchmarks, or demos.
  • Strong communication and presentation skills, with the ability to explain complex technical topics to both expert and non-expert audiences.

Nice to have

  • PhD in Computer Science, Engineering, Machine Learning, Artificial Intelligence, or a related field.
  • 3+ years of hands-on experience with machine learning, deep learning, generative AI, large language models, multimodal models, reinforcement learning, model optimization, or agentic applications.
  • Experience designing or evaluating agentic AI systems, AI coding assistants, model evaluation harnesses, RAG pipelines, synthetic data workflows, or AI safety workflows.
  • Experience with NVIDIA AI software, models, or frameworks such as NeMo, NeMo Retriever, NeMo Guardrails, NeMo RL, NIM, TensorRT, Dynamo, CUDA, cuDNN, or Nemotron models.
  • Familiarity with the broader generative AI ecosystem, including open models, agent frameworks, vector databases, evaluation tools, deployment platforms, and emerging AI developer workflows.

What the JD emphasized

  • Hands-on experience with machine learning, deep learning, or agentic AI, including building, training, fine-tuning, evaluating, deploying, or optimizing models and AI applications.
  • Practical experience with generative AI systems, including large language models, retrieval-augmented generation, agentic workflows, model evaluation, or AI application development.
  • Experience designing or evaluating agentic AI systems, AI coding assistants, model evaluation harnesses, RAG pipelines, synthetic data workflows, or AI safety workflows.

Other signals

  • build prototypes
  • evaluate emerging technologies
  • turn research ideas into practical product capabilities
  • convert customer, developer, benchmark, usage, and field signals into structured product intelligence
  • advance internal LLM expertise and tooling