41 AI roles tagged quantization.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Together AI | Forward Deployed Engineer (Inference & Post-Training) | Data AI | 9 | Inference infra · Model serving · Fine-tuning · RL post-training |
| Capital One | Distinguished Engineer | Banking | 9 | Inference infra · Model serving |
| Intel | AI Software Engineer Intern | Semiconductors | 9 | Inference infra · Model serving |
| Intel | AI Software Engineer Intern | Semiconductors | 9 | Inference infra · Model serving |
| NVIDIA | Senior Deep Learning Algorithms Engineer - BioNeMo | Semiconductors | 9 | Inference infra · Model serving · Vision |
| NVIDIA | Senior DL Software Engineer, Model Optimization and Edge Deployment - Autonomous Vehicles | Semiconductors | 9 | Inference infra · Model serving · Distillation · Embodied AI · Multimodal |
| Perplexity | Engineering Manager (AI Inference) | AI Frontier | 9 | Inference infra · Model serving · LLM observability |
| NVIDIA | Inference Optimization Architect, Speech AI | Semiconductors | 9 | Inference infra · Model serving · Audio & speech · Distillation |
| NVIDIA | Deep Learning Software Engineer, TensorRT Performance - New College Grad 2026 | Semiconductors | 9 | Inference infra · Model serving · Vision · Audio & speech |
| NVIDIA | Senior Deep Learning Engineer | Semiconductors | 9 | Model serving · Inference infra |
| NVIDIA | Senior Applied Scientist - Sovereign AI | Semiconductors | 9 | Fine-tuning · RL post-training · Evals · Inference infra · Model serving · Pretraining · Distillation |
| NVIDIA | Senior Deep Learning Software Engineer, TensorRT Performance | Semiconductors | 9 | Inference infra · Model serving · Vision · Audio & speech |
| NVIDIA | Deep Learning Engineer - LLM and VLM Model Compression | Semiconductors | 9 | Fine-tuning · Inference infra · Model serving · Vision · Multimodal |
| NVIDIA | AI Inference Performance Engineer | Semiconductors | 9 | Inference infra · Model serving |
| Databricks | Staff Software Engineer - GenAI Performance and Kernel | Data AI | 9 | Inference infra · Model serving |
| Anthropic | Performance Engineer, GPU | AI Frontier | 9 | Inference infra · Model serving · Pretraining |
| Baseten | Software Engineer - GPU Kernels | Data AI | 9 | Inference infra · Model serving |
| xAI | Member of Technical Staff - Inference | AI Frontier | 9 | Inference infra · Model serving |
| AMD | Technical Marketing Engineer – AI Training Workloads & Performance | Semiconductors | 8 | Fine-tuning · Pretraining · RL post-training |
| Intel | Senior GenAI Software Solutions Engineer | Semiconductors | 8 | Agent orchestration · Tool use · Model serving · Inference infra · Distillation · RAG · Vector DB · Fine-tuning |
| Roblox | Principal Model Optimization Engineer | Consumer | 8 | Inference infra · Model serving · Fine-tuning |
| NVIDIA | Senior Performance Engineer - LLM Inference Frameworks | Semiconductors | 8 | Inference infra · Model serving |
| Unity | Principal Machine Learning Engineer, Mobile AI Inference Optimization | Enterprise | 8 | Model serving · Inference infra · Multimodal |
| Intel | AI Frameworks Software Engineer – Model Compression Algorithm | Semiconductors | 8 | Inference infra · Model serving · Fine-tuning · Vision |
| NVIDIA | Deep Learning Algorithms Engineer - ACOT | Semiconductors | 8 | Inference infra · Model serving · Fine-tuning · Multimodal |
| NVIDIA | Senior Software Engineer – TensorRT Edge-LLM | Semiconductors | 8 | Inference infra · Model serving · Multimodal |
| NVIDIA | Senior Software Engineer, Quantized Inference | Semiconductors | 8 | Inference infra · Model serving |
| NVIDIA | Senior Software Engineer, Deep Learning - MLIR TRT | Semiconductors | 8 | Inference infra · Model serving |
| Samsara | Senior Machine Learning Engineer - Edge AI | Enterprise | 8 | Multimodal · Inference infra · Model serving · Distillation |
| Baseten | Software Engineer - Model Performance | Data AI | 8 | Inference infra · Model serving · Fine-tuning |