Inference infra

Function

All Engineering · 1985 Research · 94 Product · 45

Status

Sort

2124 AI roles tagged inference_infra.

Company	Title	Sector	AI score	Other tags
NVIDIA	AI Inference Performance Engineer	Semiconductors	9	Model serving · Quantization
NVIDIA	Senior Deep Learning Architect, LLM Inference	Semiconductors	9	Model serving · LLM observability
NVIDIA	Lead Principal Engineer, Enterprise Agentic AI Platform	Semiconductors	9	Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Model serving
NVIDIA	Senior Systems Software Engineer - Deep Learning Solutions	Semiconductors	9	Model serving · Vision · Multimodal
NVIDIA	Senior Deep Learning Compiler Engineer - XLA	Semiconductors	9	Model serving
NVIDIA	Principal Software Engineer - AI Inference	Semiconductors	9	Model serving
NVIDIA	Senior DL Algorithms Engineer - Inference Performance	Semiconductors	9	Model serving · Multimodal
NVIDIA	High-Performance LLM Training Engineer - New College Grad 2026	Semiconductors	9	Model serving
NVIDIA	Senior Research Scientist, AI Accelerator Design and VLSI	Semiconductors	9	Quantization · Model serving
NVIDIA	Deep Learning Performance Software Engineer	Semiconductors	9	Model serving
NVIDIA	Senior Applied Deep Learning Research Scientist, Efficiency	Semiconductors	9	Fine-tuning · Model serving · Quantization · Pretraining
Adobe	Applied Scientist - Multimodal	Enterprise	9	Multimodal · Guardrails · Fine-tuning · Model serving · Vision · LLM observability · Evals
Adobe	Senior ML Engineer - Firefly	Enterprise	9	Multimodal · Fine-tuning · Model serving
Adobe	Senior Staff Applied Scientist - AI/ML	Enterprise	9	Multimodal · Fine-tuning · Model serving · Evals
Adobe	Principal Machine Learning Engineer, Firefly	Enterprise	9	Model serving · Fine-tuning
NVIDIA	Senior DGX Cloud AI Infrastructure Software Engineer	Semiconductors	9	Model serving · Pretraining · Fine-tuning · LLM observability
Walmart	Distinguished, Software Engineer -AI/ML Engineer- Walmart Connect	Retail	9	Agent orchestration · Tool use · Multimodal · RAG · Vector DB · Fine-tuning · Model serving · RL post-training · Agent research · LLM observability · Guardrails
Together AI	Senior Machine Learning Engineer, Voice AI	Data AI	9	Model serving · Audio & speech
NVIDIA	Solutions Architect, Pre-training and Post-training	Semiconductors	9	Pretraining · Fine-tuning · RL post-training · Model serving
NVIDIA	Senior GPU Networking Architect	Semiconductors	9	Model serving
Canva	Research Scientist - Efficient AI 高性能AI大模型研究科学家	Enterprise	9	Frontier research · Pretraining · Fine-tuning · Model serving · Multimodal · Quantization · Distillation
Snowflake	Staff Research Scientist, AI Agents & LLMs	Data AI	9	Agent orchestration · Agent research · Fine-tuning · Model serving · Evals
Decagon	Senior Software Engineer, ML Infrastructure	Vertical AI	9	Fine-tuning · RL post-training · Model serving · Multimodal
OpenAI	Software Engineer, Codex Core Agents	AI Frontier	9	Agent orchestration · Tool use · Model serving
DoorDash	Senior/Staff Deep Reinforcement Learning Engineer	Consumer	9	RL robotics · Embodied AI · Agent orchestration · Model serving
Adobe	Principal Architect, Express AI Foundations	Enterprise	9	Agent orchestration · Model serving · LLM observability · Evals · Multimodal
Weights & Biases	VP of Product, Research and Training Infrastructure	Data AI	9	Frontier research · Pretraining · RL post-training · RLHF · Model serving
Anthropic	Research Engineer, Performance RL	AI Frontier	9	RL post-training · Frontier research · Code gen · Model serving
Crusoe	Senior Software Engineer, AI Model LifeCycle	Data AI	9	Fine-tuning · RL post-training · Frontier research · Multimodal · Model serving
OpenAI	TL, Research Inference	AI Frontier	9	Model serving