Compressing model weights to lower numerical precision (INT8, FP4, INT4) so inference costs less memory and runs faster, with minimal quality loss.
Primary AI lifecycle stage: serving infrastructure.
As of today, 69 active AI roles across 25 companies in our index reference Quantization. Hiring concentrates at the serving infrastructure (84%) and post-training (12%) stages. Most common sectors: Semiconductors, Big Tech, Robotics.
Compressing model weights to lower numerical precision (INT8, FP4, INT4) so inference costs less memory and runs faster, with minimal quality loss. Primary AI lifecycle stage: serving infrastructure.
69 active AI roles across 25 companies in our index reference Quantization as of today.
The companies with the most active Quantization listings are: NVIDIA (16 roles), Amazon (6 roles), ByteDance (5 roles), Intel (5 roles), Rivian (5 roles).
Quantization primarily belongs to the serving infrastructure stage of the AI lifecycle. In current hiring, Quantization roles concentrate at: serving infrastructure (84%), post-training (12%).
The sectors with the most active Quantization hiring are: Semiconductors, Big Tech, Robotics.
32 AI roles tagged quantization.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Microsoft | Senior Researcher - Efficient AI | Big Tech | 9 | Inference infra · Model serving |
| Amazon | Principal Applied Scientist, ML Codesign | Big Tech | 9 | Model serving · Inference infra |
| Meta | Research Scientist - Reality Labs | Big Tech | 9 | Fine-tuning · Model serving · Inference infra |
| Staff Software Engineer, AI/ML Performance | Big Tech | 9 | Inference infra · Model serving · Fine-tuning · Agent orchestration | |
| Research Scientist, Efficient AI | Big Tech | 9 | Inference infra · Model serving · Fine-tuning · Frontier research | |
| Staff Research Scientist, ML Efficiency, Google Research | Big Tech | 9 | Inference infra · Model serving · Fine-tuning | |
| Research Scientist, ML Efficiency, Google Research | Big Tech | 9 | Inference infra · Model serving · Fine-tuning · Frontier research · Multimodal | |
| Senior Research Scientist, ML Efficiency, Google Research | Big Tech | 9 | Inference infra · Model serving · Fine-tuning · Frontier research · Distillation | |
| Staff Software Engineer, AI/ML Performance | Big Tech | 9 | Inference infra · Model serving · Fine-tuning · Agent orchestration | |
| Apple | Machine Learning Systems Engineer, Siri Agent Modeling | Big Tech | 9 | Inference infra · Model serving · Fine-tuning · LLM observability |
| Meta | AI Research Scientist, CoreML - Monetization AI | Big Tech | 9 | Pretraining · Fine-tuning · RL post-training · Recommender systems · Search & ranking |
| ByteDance | Sr. Research Engineer/Scientist (all levels), Efficient Models | Big Tech | 9 | Fine-tuning · Model serving · Inference infra · Multimodal · Vision |
| ByteDance | Sr. Research Engineer/Scientist (all levels), Efficient Models | Big Tech | 9 | Fine-tuning · Model serving · Inference infra · Multimodal · Vision · Distillation |
| Microsoft | Research Intern - Training Methods for LLM Efficiency | Big Tech | 9 | Fine-tuning · Frontier research |
| Amazon | Applied Scientist, Edge AI and Science | Big Tech | 8 | Fine-tuning · Inference infra · Model serving · Multimodal · Vision · Audio & speech · LLM observability |
| Senior Staff Software Engineer, TPU Performance | Big Tech | 8 | Inference infra · Model serving · Fine-tuning · Audio & speech | |
| Software Engineer, On Device Machine Learning | Big Tech | 8 | Model serving · Inference infra · Fine-tuning | |
| Amazon | Applied Scientist, SSG Science | Big Tech | 8 | Fine-tuning · Model serving · Inference infra · Distillation |
| Microsoft | Senior Researcher - Efficient AI | Big Tech | 8 | Inference infra · Model serving |
| Microsoft | Senior AI Software Architect | Big Tech | 8 | Inference infra · Model serving · Fine-tuning |
| Microsoft | Research Intern - AI/ML Numerics & Efficiency | Big Tech | 8 | Inference infra · Model serving |
| Amazon | Sr. Applied Scientist, SSG Science | Big Tech | 8 | Fine-tuning · Model serving · Inference infra · Distillation |
| Amazon | Research Scientist, SSG Science | Big Tech | 8 | Inference infra · Distillation |
| Amazon | Software Dev Engineer, Machine Learning Compilers | Big Tech | 7 | Inference infra · Model serving |
| Amazon | Senior Software Engineer - AI/ML, AWS Neuron Inference | Big Tech | 7 | Inference infra · Model serving |
| Senior Machine Learning Engineer, Performance | Big Tech | 7 | Inference infra · Model serving | |
| ByteDance | Edge ML Software Engineer (Model Optimization-PICO) - San Jose | Big Tech | 7 | Inference infra · Model serving |
| ByteDance | Edge ML Software Engineer (Compiler-PICO) - San Jose | Big Tech | 7 | Inference infra · Model serving |
| Staff Software Engineer, TPU Performance | Big Tech | 7 | Inference infra · Model serving · Fine-tuning | |
| Apple | On-Device ML Infrastructure Engineer, ML User Experience, APIs & Integration, Graphics, Games & ML | Big Tech | 7 | Inference infra · Model serving |