Inference infra

Function

All Engineering · 2810 Research · 168 Product · 57

Status

Sort

3035 AI roles tagged inference_infra.

Company	Title	Sector	AI score	Other tags
NVIDIA	Senior AI Infrastructure Software Engineer	Semiconductors	8	Agent orchestration · Model serving · RAG · Vector DB · Fine-tuning
JPMorgan Chase	AWM Risk Analytics Group – Data Scientist - Vice President	Banking	8	Fine-tuning · Model serving · LLM observability · Evals
Writer	AI engineer	AI Frontier	8	Agent orchestration · LLM observability · Model serving
JPMorgan Chase	Agentic Development - Vice President	Banking	8	Agent orchestration · Agent research · LLM observability · RAG · Model serving · Tool use
Cohere	Staff Software Engineer, GPU Infrastructure (HPC)	AI Frontier	8	Model serving
Cerebras	AI Models, Product Manager	Semiconductors	8	Model serving · Agent orchestration · Quantization · Fine-tuning
Whatnot	Senior Engineering Manager, ML Platform	Consumer	8	Model serving
Amazon	Sr Software Development Manager, Generative AI for AWS Neuron	Big Tech	8	Agent orchestration · Model serving · Code gen
Capital One	Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)	Banking	8	Agent orchestration · Model serving · Guardrails · Vector DB · RAG · Fine-tuning · LLM observability · Evals
Capital One	Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)	Banking	8	Agent orchestration · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Model serving
Walmart	(USA) Staff, Software Engineer \| MLE	Retail	8	Multimodal · Vision · Fine-tuning · Model serving
Microsoft	Principal Applied Scientist	Big Tech	8	Agent orchestration · LLM observability · Model serving
Google	Senior Software Engineer, AI/ML GenAI, Google Workspace	Big Tech	8	Multimodal · Vision · Model serving
Capital One	Lead AI Engineer (MLX, Agentic AI, Gen AI platform Services)	Banking	8	Agent orchestration · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Model serving
Microsoft	Senior AI Software Architect	Big Tech	8	Model serving · Quantization · Fine-tuning
Datadog	Manager I, Engineering - AI Platform - Training & Serving	Enterprise	8	Model serving
Amazon	Sr. Machine Learning Engineer, AWS Applied AI Solution	Big Tech	8	Agent orchestration · Model serving · Fine-tuning
Capital One	Director, AI Engineering	Banking	8	Agent orchestration · Agent research · Model serving
Cohere	Site Reliability Engineer, Inference Infrastructure	AI Frontier	8	Model serving
Cohere	Staff Software Engineer, Inference Infrastructure	AI Frontier	8	Model serving
JPMorgan Chase	Lead Machine Learning Engineer-MLOps	Banking	8	Model serving · LLM observability · Vector DB · Recommender systems
Synthesia	Senior Research Engineer - Audio Post-Training	Multimodal	8	Audio & speech · Fine-tuning · RL post-training · Model serving · Multimodal
Ramp	Applied AI Engineer	Fintech	8	Agent orchestration · RAG · Fine-tuning · Model serving
NVIDIA	Distinguished Engineer, JAX	Semiconductors	8	Model serving
NVIDIA	Senior Software Architect - Deep Learning and HPC Communications	Semiconductors	8	Model serving
Capital One	Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)	Banking	8	Agent orchestration · Fine-tuning · Model serving · Guardrails · LLM observability · RAG · Vector DB · Evals
Capital One	Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)	Banking	8	Agent orchestration · Fine-tuning · Model serving · Guardrails · LLM observability · RAG · Vector DB · Evals
NVIDIA	Distinguished Engineer - Dynamo	Semiconductors	8	Model serving
NVIDIA	Principal Software Engineer - Dynamo	Semiconductors	8	Model serving · LLM observability · Agent orchestration
NVIDIA	Principal Software Engineer – Large-Scale LLM Memory and Storage Systems	Semiconductors	8	Model serving