Inference infra

Function

All Engineering · 2810 Research · 168 Product · 57

Status

Sort

3035 AI roles tagged inference_infra.

Company	Title	Sector	AI score	Other tags
NVIDIA	NCX Engineer, AI Accelerator	Semiconductors	8	Model serving · Recommender systems
NVIDIA	Senior HPC and AI Networking Performance Research and Analysis Engineer	Semiconductors	8	Pretraining · Model serving
Capital One	Distinguished AI Engineer	Banking	8	Model serving · Guardrails · Vector DB · LLM observability
Autodesk	Senior Applied Scientist, Personalization & Agentic Systems	Enterprise	8	Agent orchestration · LLM observability · RAG · Model serving · Recommender systems · Tool use
Workday	Senior Machine Learning Engineer	Enterprise	8	Agent orchestration · LLM observability · Model serving · Recommender systems · RAG · Fine-tuning
Visa	Senior Director, Software Engineering (GenAI/Cloud)	Fintech	8	Agent orchestration · RAG · Vector DB · LLM observability · Model serving
NVIDIA	Machine Learning Applications and Compiler Engineer, LPX - New College Grad 2026	Semiconductors	8	Model serving
Anyscale	Distributed LLM Inference Engineer	Data AI	8	Model serving
Apple	Applied AI Engineer - iCloud Data	Big Tech	8	Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Model serving
DoorDash	Director of Engineering, Logistics	Consumer	8	Model serving
Microsoft	Senior Software Engineer - CoreAI	Big Tech	8	Agent orchestration · Model serving · Multimodal
Reddit	Staff Machine Learning Engineer, AI Serving	Consumer	8	Model serving · LLM observability
JPMorgan Chase	AWM Quant Modelling- Senior Associate	Banking	8	Agent orchestration · Fine-tuning · Model serving · RAG · LLM observability
NVIDIA	Senior AI Solutions Architect	Semiconductors	8	Model serving
Eli Lilly	Associate Director - AI Engineering	Pharma	8	Agent orchestration · Tool use · RAG · LLM observability · Model serving · Guardrails
Disney	Sr Data Scientist	Media	8	Multimodal · Fine-tuning · RAG · Model serving · Evals · Vector DB
NVIDIA	Senior Deep Learning Framework Communications Engineer	Semiconductors	8	Model serving
NVIDIA	Senior Solutions Architect, Generative AI Data Processing	Semiconductors	8	Agent orchestration · Model serving · LLM observability
Google	Staff Software Engineer, AI/ML, Google Cloud	Big Tech	8	Model serving · Audio & speech
JPMorgan Chase	AI Engineering Director	Banking	8	Agent orchestration · Tool use · Guardrails · LLM observability · RAG · Vector DB · Model serving
World Labs	Research Platform Engineer	AI Frontier	8	Model serving
Elastic	Lead GenAI Cloud Developer	Enterprise	8	Agent orchestration · RAG · Vector DB · Fine-tuning · Model serving · LLM observability · Evals
Amazon	Sr. Machine Learning Compiler Engineer, AWS Neuron, Annapurna Labs	Big Tech	8	Model serving
Intel	GPU Power Architect	Semiconductors	8	Model serving
NVIDIA	Director, System Software Engineering - Metropolis Accelerated and Inferencing Software	Semiconductors	8	Model serving · Multimodal · Vision · Agent orchestration · LLM observability
NVIDIA	Director, Isaac for Healthcare Engineering	Semiconductors	8	Synthetic data · Model serving · Embodied AI
NVIDIA	Senior Solutions Architect - Deep Learning	Semiconductors	8	Model serving · Agent orchestration
NVIDIA	Senior Software Architect - Deep Learning and HPC Communications	Semiconductors	8	Model serving
ClickUp	Staff AI Engineer - AI Platform	Enterprise	8	Agent orchestration · Model serving · LLM observability · RAG
ClickUp	Senior AI Engineer - AI Platform	Enterprise	8	Agent orchestration · Model serving · LLM observability · Guardrails