Research Scientist, Efficient Deep Learning - New College Grad 2026

NVIDIA NVIDIA · Semiconductors · Santa Clara, CA

Research Scientist role focused on efficient deep learning methods, including post-training optimization, efficient architecture design, and resource-efficient training/finetuning. Requires a Ph.D. or equivalent research experience, strong Python/PyTorch skills, and experience with large-scale model training and large vision-language models. The role involves research, implementation, publication, and technology transfer.

What you'd actually do

  1. Research, design and implement novel methods for efficient deep learning.
  2. Publish original research.
  3. Collaborate with other team members and teams.
  4. Mentor interns.
  5. Speak at conferences and events.

Skills

Required

  • Ph.D. in Computer Science/Engineering, Electrical Engineering, or equivalent research experience
  • Excellent knowledge of theory and practice of computer vision methods, as well as deep learning
  • Experience with large language models and large vision-language models
  • Excellent programming skills in Python and PyTorch
  • Hands-on experience with large-scale model training including data preparation and model parallelization (tensor and pipeline)
  • Outstanding research track record
  • Excellent communications skills

Nice to have

  • Background in pruning, quantization, NAS, efficient backbones
  • C++ and parallel programming (e.g., CUDA)

What the JD emphasized

  • Experience with large language models and large vision-language models is required.
  • Hands-on experience with large-scale model training including data preparation and model parallelization (tensor and pipeline) is required.
  • Outstanding research track record.

Other signals

  • novel methods for efficient deep learning
  • post-training model optimization
  • efficient architecture design
  • adaptive/dynamic inference
  • resource-efficient training and finetuning