Inference infra

Function

All Engineering · 2810 Research · 168 Product · 57

Status

Sort

3035 AI roles tagged inference_infra.

Company	Title	Sector	AI score	Other tags
NVIDIA	Solution Architect, Energy	Semiconductors	8	Model serving
Capital One	Senior Distinguished AI Engineer	Banking	8	Model serving · Fine-tuning · Guardrails · LLM observability · Vector DB
Capital One	Lead AI Engineer (MLX)	Banking	8	Model serving · Guardrails · Vector DB · Fine-tuning · LLM observability · Evals
Hex	AI Engineering Lead	Data AI	8	Agent orchestration · Agent research · Evals · LLM observability · Model serving · Search & ranking
Samsara	Staff ML Engineer - ML Infrastructure	Enterprise	8	Model serving
Cerebras	Engineering Manager, Inference ML Runtime	Semiconductors	8	Model serving · Multimodal · LLM observability
Amazon	Software Engineer II- AI/ML, AWS Neuron	Big Tech	8	Model serving · Fine-tuning
Amazon	Principal GenAI Specialist SA	Big Tech	8	Agent orchestration · Fine-tuning · Model serving · RAG · Vector DB · LLM observability
NVIDIA	Developer Relations Manager – AI Natives	Semiconductors	8	Model serving · Agent orchestration · Multimodal
Microsoft	Principal Software Engineer - CoreAI Model Inference & Serving	Big Tech	8	Model serving · LLM observability
Microsoft	Principal Software Engineer, CoreAI	Big Tech	8	Model serving · Multimodal
Microsoft	Member of Technical Staff, AI Systems Engineer - Microsoft Superintelligence	Big Tech	8	Model serving
JPMorgan Chase	Software Engineer III - Applied AI	Banking	8	Agent orchestration · Model serving · RAG · Fine-tuning
Amazon	Applied Scientist	Big Tech	8	Recommender systems · Model serving
Amazon	Software Development Engineer II, Items and Relationships Platform	Big Tech	8	Model serving · LLM observability · Vector DB · RAG · Agent orchestration · Multimodal · Vision
Capital One	Lead AI Engineer	Banking	8	Model serving · Guardrails · Vector DB · Fine-tuning · LLM observability
Capital One	Senior Lead AI Engineer	Banking	8	Model serving · Guardrails · Vector DB · RAG · LLM observability · Fine-tuning
NVIDIA	Senior AI Performance and Efficiency Engineer	Semiconductors	8	Model serving
NVIDIA	Senior AI Developer Technology Engineer	Semiconductors	8	Model serving
Capital One	Lead AI Engineer (Gen AI Platform, Agentic AI & LLM Infrastructure & Orchestration)	Banking	8	Agent orchestration · LLM observability · RAG · Vector DB · Guardrails · Model serving
Klaviyo	Sr. Lead AI Engineer	Enterprise	8	Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Fine-tuning · Model serving
Cerebras	ML Performance Benchmarking Engineer	Semiconductors	8	Model serving · LLM observability
Airbnb	Senior Software Engineer, BizTech(AI Products)	Consumer	8	Agent orchestration · RAG · LLM observability · Model serving
Netflix	Technical Director, GenAI - Games	Big Tech	8	Multimodal · Model serving · Fine-tuning
Capital One	Senior Lead AI Engineer	Banking	8	Model serving · Fine-tuning · Guardrails · LLM observability · Vector DB · RAG · Evals
Google	Senior Staff Software Engineer, AI/ML GenAI, Google Ads	Big Tech	8	Vision · Model serving
Google	Senior Software Engineering Manager, AI/ML, Google Cloud AI	Big Tech	8	Model serving · Fine-tuning · Evals · Audio & speech · RL robotics
NVIDIA	Engineering Manager, AI Developer Technology	Semiconductors	8	Model serving · Recommender systems · Multimodal
NVIDIA	Senior Developer Technology Engineer - AI	Semiconductors	8	Model serving · Recommender systems
Capital One	Lead AI Engineer (AI Foundations, LLM Customization and Finetuning)	Banking	8	Fine-tuning · Model serving · Guardrails · Vector DB · LLM observability · Evals