50 AI roles tagged quantization.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Together AI | Forward Deployed Engineer (Inference & Post-Training) | Data AI | 9 | Inference infra · Model serving · Fine-tuning · RL post-training |
| Capital One | Distinguished Engineer | Banking | 9 | Inference infra · Model serving |
| Intel | AI Software Engineer Intern | Semiconductors | 9 | Inference infra · Model serving |
| Intel | AI Software Engineer Intern | Semiconductors | 9 | Multimodal · Embodied AI · Fine-tuning · RL post-training · Model serving · Inference infra |
| Intel | AI Software Engineer Intern | Semiconductors | 9 | Inference infra · Model serving |
| NVIDIA | Senior Deep Learning Algorithms Engineer - BioNeMo | Semiconductors | 9 | Inference infra · Model serving · Vision |
| NVIDIA | Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles | Semiconductors | 9 | Inference infra · Model serving · Distillation · Embodied AI · Multimodal |
| Perplexity | Engineering Manager (AI Inference) | AI Frontier | 9 | Inference infra · Model serving · LLM observability |
| NVIDIA | Inference Optimization Architect, Speech AI | Semiconductors | 9 | Inference infra · Model serving · Audio & speech · Distillation |
| NVIDIA | Research Scientist, AI Accelerator Design and VLSI - New College Grad 2026 | Semiconductors | 9 | |
| NVIDIA | Senior Applied Deep Learning Research Scientist, Efficiency | Semiconductors | 9 | Fine-tuning · Inference infra · Model serving · Pretraining |
| NVIDIA | Senior Research Scientist, AI Accelerator Design and VLSI | Semiconductors | 9 | Inference infra · Model serving |
| NVIDIA | Senior Deep Learning Engineer | Semiconductors | 9 | Model serving · Inference infra |
| NVIDIA | AI Inference Performance Engineer | Semiconductors | 9 | Inference infra · Model serving |
| NVIDIA | Senior Applied Scientist - Sovereign AI | Semiconductors | 9 | Fine-tuning · RL post-training · Evals · Inference infra · Model serving · Pretraining · Distillation |
| NVIDIA | Deep Learning Engineer - LLM and VLM Model Compression | Semiconductors | 9 | Fine-tuning · Inference infra · Model serving · Vision · Multimodal |
| NVIDIA | Senior Deep Learning Software Engineer, TensorRT Performance | Semiconductors | 9 | Inference infra · Model serving · Vision · Audio & speech |
| NVIDIA | Deep Learning Software Engineer, TensorRT Performance - New College Grad 2026 | Semiconductors | 9 | Inference infra · Model serving · Vision · Audio & speech |
| Canva | Research Scientist - Efficient AI 高性能AI大模型研究科学家 | Enterprise | 9 | Frontier research · Pretraining · Fine-tuning · Model serving · Inference infra · Multimodal · Distillation |
| OpenAI | ML Research Engineer - Hardware Codesign | AI Frontier | 9 | Inference infra · Model serving · Evals |
| Databricks | Staff Software Engineer - GenAI Performance and Kernel | Data AI | 9 | Inference infra · Model serving |
| Anthropic | Performance Engineer, GPU | AI Frontier | 9 | Inference infra · Model serving · Pretraining |
| Baseten | Software Engineer - GPU Kernels | Data AI | 9 | Inference infra · Model serving |
| xAI | Member of Technical Staff - Inference | AI Frontier | 9 | Inference infra · Model serving |
| AMD | Technical Marketing Engineer – AI Training Workloads & Performance | Semiconductors | 8 | Fine-tuning · Pretraining · RL post-training |
| Intel | Senior GenAI Software Solutions Engineer | Semiconductors | 8 | Agent orchestration · Tool use · Model serving · Inference infra · Distillation · RAG · Vector DB · Fine-tuning |
| Roblox | Principal Model Optimization Engineer | Consumer | 8 | Inference infra · Model serving · Fine-tuning |
| NVIDIA | Senior Performance Engineer - LLM Inference Frameworks | Semiconductors | 8 | Inference infra · Model serving |
| Unity | Principal Machine Learning Engineer, Mobile AI Inference Optimization | Enterprise | 8 | Model serving · Inference infra · Multimodal |
| Intel | AI Frameworks Software Engineer – Model Compression Algorithm | Semiconductors | 8 | Inference infra · Model serving · Fine-tuning · Vision |