Which companies are hiring for Quantization roles?

The companies with the most active Quantization listings are: NVIDIA (16 roles), Amazon (6 roles), ByteDance (5 roles), Intel (5 roles), Rivian (5 roles).

What AI lifecycle stage does Quantization belong to?

Quantization primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Quantization roles concentrate at: serving infrastructure (84%), post-training (12%).

What sectors invest most in Quantization?

The sectors with the most active Quantization hiring are: Semiconductors, Big Tech, Robotics.

← Tag co-occurrence network

Quantization

Compressing model weights to lower numerical precision (INT8, FP4, INT4) so inference costs less memory and runs faster, with minimal quality loss.

Primary AI lifecycle stage: serving infrastructure.

As of today, 69 active AI roles across 25 companies in our index reference Quantization. Hiring concentrates at the serving infrastructure (84%) and post-training (12%) stages. Most common sectors: Semiconductors, Big Tech, Robotics.

Top hiring:

Sector

All Semiconductors · 44 Big Tech · 32 Enterprise · 7 Auto · 7 Robotics · 6 AI Frontier · 5 Data AI · 4 Consumer · 2 Banking · 1

Function

All Engineering · 83 Research · 23 Product · 2

Status

All Active only

Sort

AI score Recently posted Company A–Z

FilteredsectorBig Tech×

32 AI roles tagged quantization.

Company	Title	Sector	AI score	Other tags
Microsoft	Senior Researcher - Efficient AI	Big Tech	9	Inference infra · Model serving
Amazon	Principal Applied Scientist, ML Codesign	Big Tech	9	Model serving · Inference infra
Meta	Research Scientist - Reality Labs	Big Tech	9	Fine-tuning · Model serving · Inference infra
Google	Staff Software Engineer, AI/ML Performance	Big Tech	9	Inference infra · Model serving · Fine-tuning · Agent orchestration
Google	Research Scientist, Efficient AI	Big Tech	9	Inference infra · Model serving · Fine-tuning · Frontier research
Google	Staff Research Scientist, ML Efficiency, Google Research	Big Tech	9	Inference infra · Model serving · Fine-tuning
Google	Research Scientist, ML Efficiency, Google Research	Big Tech	9	Inference infra · Model serving · Fine-tuning · Frontier research · Multimodal
Google	Senior Research Scientist, ML Efficiency, Google Research	Big Tech	9	Inference infra · Model serving · Fine-tuning · Frontier research · Distillation
Google	Staff Software Engineer, AI/ML Performance	Big Tech	9	Inference infra · Model serving · Fine-tuning · Agent orchestration
Apple	Machine Learning Systems Engineer, Siri Agent Modeling	Big Tech	9	Inference infra · Model serving · Fine-tuning · LLM observability
Meta	AI Research Scientist, CoreML - Monetization AI	Big Tech	9	Pretraining · Fine-tuning · RL post-training · Recommender systems · Search & ranking
ByteDance	Sr. Research Engineer/Scientist (all levels), Efficient Models	Big Tech	9	Fine-tuning · Model serving · Inference infra · Multimodal · Vision
ByteDance	Sr. Research Engineer/Scientist (all levels), Efficient Models	Big Tech	9	Fine-tuning · Model serving · Inference infra · Multimodal · Vision · Distillation
Microsoft	Research Intern - Training Methods for LLM Efficiency	Big Tech	9	Fine-tuning · Frontier research
Amazon	Applied Scientist, Edge AI and Science	Big Tech	8	Fine-tuning · Inference infra · Model serving · Multimodal · Vision · Audio & speech · LLM observability
Google	Senior Staff Software Engineer, TPU Performance	Big Tech	8	Inference infra · Model serving · Fine-tuning · Audio & speech
Google	Software Engineer, On Device Machine Learning	Big Tech	8	Model serving · Inference infra · Fine-tuning
Amazon	Applied Scientist, SSG Science	Big Tech	8	Fine-tuning · Model serving · Inference infra · Distillation
Microsoft	Senior Researcher - Efficient AI	Big Tech	8	Inference infra · Model serving
Microsoft	Senior AI Software Architect	Big Tech	8	Inference infra · Model serving · Fine-tuning
Microsoft	Research Intern - AI/ML Numerics & Efficiency	Big Tech	8	Inference infra · Model serving
Amazon	Sr. Applied Scientist, SSG Science	Big Tech	8	Fine-tuning · Model serving · Inference infra · Distillation
Amazon	Research Scientist, SSG Science	Big Tech	8	Inference infra · Distillation
Amazon	Software Dev Engineer, Machine Learning Compilers	Big Tech	7	Inference infra · Model serving
Amazon	Senior Software Engineer - AI/ML, AWS Neuron Inference	Big Tech	7	Inference infra · Model serving
Google	Senior Machine Learning Engineer, Performance	Big Tech	7	Inference infra · Model serving
ByteDance	Edge ML Software Engineer (Model Optimization-PICO) - San Jose	Big Tech	7	Inference infra · Model serving
ByteDance	Edge ML Software Engineer (Compiler-PICO) - San Jose	Big Tech	7	Inference infra · Model serving
Google	Staff Software Engineer, TPU Performance	Big Tech	7	Inference infra · Model serving · Fine-tuning
Apple	On-Device ML Infrastructure Engineer, ML User Experience, APIs & Integration, Graphics, Games & ML	Big Tech	7	Inference infra · Model serving

Frequently asked questions

What is Quantization in AI?
Compressing model weights to lower numerical precision (INT8, FP4, INT4) so inference costs less memory and runs faster, with minimal quality loss. Primary AI lifecycle stage: serving infrastructure.
How many AI roles reference Quantization right now?
69 active AI roles across 25 companies in our index reference Quantization as of today.
Which companies are hiring for Quantization roles?
The companies with the most active Quantization listings are: NVIDIA (16 roles), Amazon (6 roles), ByteDance (5 roles), Intel (5 roles), Rivian (5 roles).
What AI lifecycle stage does Quantization belong to?
Quantization primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Quantization roles concentrate at: serving infrastructure (84%), post-training (12%).
What sectors invest most in Quantization?
The sectors with the most active Quantization hiring are: Semiconductors, Big Tech, Robotics.