Skills

Required

Ph.D. in Computer Science/Engineering, Electrical Engineering, or equivalent experience
3+ years relevant post-graduate research experience
Excellent knowledge of theory and practice of computer vision methods, as well as deep learning
pruning
quantization
NAS
efficient backbones
large-scale model training
data preparation
model parallelization (tensor and pipeline)
Python
PyTorch
C++
parallel programming (e.g., CUDA)

Nice to have

Experience with large language models
Experience with large vision-language models

NVIDIA is searching for an outstanding Senior Researcher working on efficient deep learning to join our learning and perception research team. We are passionate about research that pushes boundaries but also has impact in the real world. We are particularly excited about methods for post-training model optimization (pruning, quantization, NAS), efficient architecture design, adaptive/dynamic inference, resource-efficient training and fine-tuning, and so forth. You will work within an amazing and collaborative research team that consistently publishes at the top venues in computer vision and machine learning. Our existing expertise includes computer vision, deep learning, generative models, and so forth. Your contributions have the chance to create real impact on our products.

What you'll be doing:

Research, design and implement novel methods for efficient deep learning.
Publish original research. Speak at conferences and events.
Collaborate on research with internal team members, internal teams as well as external researchers. Mentor interns.
Work with product groups to transfer technology.

What we need to see:

A Ph.D. in Computer Science/Engineering, Electrical Engineering, etc., or equivalent experience in industrial research labs.
3+ years or relevant post-graduate research experience
Excellent knowledge of theory and practice of computer vision methods, as well as deep learning.
A background in pruning, quantization, NAS, efficient backbones is required.
Experience with large language models and large vision-language models is a plus.
Excellent programming skills in Python and PyTorch; C++ and parallel programming (e.g., CUDA).
Hands-on experience with large-scale model training including data preparation and model parallelization (tensor and pipeline) is required.
An outstanding research track record and strong communications skills.

NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and productive people in the world working for us. If you're creative and collaborative researcher, we want to hear from you!

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is 192,000 USD - 304,750 USD for Level 4, and 224,000 USD - 356,500 USD for Level 5.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until January 13, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

What you'll be doing:

Research, design and implement novel methods for efficient deep learning.
Publish original research. Speak at conferences and events.
Collaborate on research with internal team members, internal teams as well as external researchers. Mentor interns.
Work with product groups to transfer technology.

What we need to see:

A Ph.D. in Computer Science/Engineering, Electrical Engineering, etc., or equivalent experience in industrial research labs.
3+ years or relevant post-graduate research experience
Excellent knowledge of theory and practice of computer vision methods, as well as deep learning.
A background in pruning, quantization, NAS, efficient backbones is required.
Experience with large language models and large vision-language models is a plus.
Excellent programming skills in Python and PyTorch; C++ and parallel programming (e.g., CUDA).
Hands-on experience with large-scale model training including data preparation and model parallelization (tensor and pipeline) is required.
An outstanding research track record and strong communications skills.

You will also be eligible for equity and benefits.

Applications for this job will be accepted at least until January 13, 2026.

This posting is for an existing vacancy.

NVIDIA uses AI tools in its recruiting processes.

Senior Research Scientist, Efficient Deep Learning

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals