Inference infra

Function

All Engineering · 1985 Research · 94 Product · 45

Status

Sort

2124 AI roles tagged inference_infra.

Company	Title	Sector	AI score	Other tags
NVIDIA	Senior Software Engineer - Agentic AI	Semiconductors	9	Agent orchestration · Multimodal · Model serving · Evals
Target	Principal AI Engineer - Advanced AI (Machine Learning, Python, Deep Learning)	Retail	9	Agent orchestration · LLM observability · Evals · Model serving
OpenAI	Performance & Systems Engineer, Codex	AI Frontier	9	Model serving · Agent orchestration · LLM observability
NVIDIA	Senior Software Engineer, AI Inference Systems	Semiconductors	9	Model serving
Adobe	Sr Staff Machine Learning Engineer, Adobe Firefly Services	Enterprise	9	Model serving · Fine-tuning
Elastic	Lead GenAI Cloud Developer	Enterprise	9	Agent orchestration · RAG · Vector DB · Fine-tuning · Model serving · LLM observability · Evals · Tool use · Guardrails
Databricks	Principal Research Scientist - AI Scaling & Optimization	Data AI	9	Frontier research · Fine-tuning · Model serving
Capital One	Distinguished Engineer	Banking	9	Model serving · Quantization
Snowflake	AI System Research and Development Engineer - Optimization	Data AI	9	Model serving · Agent orchestration
Capital One	Principal Associate, Data Science - AI Foundations	Banking	9	Fine-tuning · Model serving · Agent orchestration · RAG · Vector DB · LLM observability
Expedia	Machine Learning Engineer III (Gen AI & Multi-Agentic Systems)	Hospitality	9	Agent orchestration · Fine-tuning · RAG · Vector DB · Multimodal · Model serving · LLM observability · Evals · Guardrails · RL post-training · Code gen
Expedia	Senior Machine Learning Engineer (Gen AI & Multi-Agentic Systems)	Hospitality	9	Agent orchestration · RAG · Vector DB · Fine-tuning · RL post-training · Model serving · Multimodal · Vision · Audio & speech · Code gen · Evals · Guardrails · LLM observability
NVIDIA	Tech Engagement Lead - Model Builder	Semiconductors	9	Model serving
Intel	AI Software Engineer Intern	Semiconductors	9	Model serving · Quantization
Intel	AI Software Engineer Intern	Semiconductors	9	Multimodal · Embodied AI · Fine-tuning · RL post-training · Model serving · Quantization
Intel	AI Software Engineer Intern	Semiconductors	9	Model serving · Quantization
Intel	Senior AI Software Architect - Runtime	Semiconductors	9	Model serving
NVIDIA	Senior Software Engineer, AI Inference Systems	Semiconductors	9	Model serving
NVIDIA	Senior Solutions Architect - Generative AI	Semiconductors	9	Fine-tuning · RAG · Agent orchestration · Model serving
NVIDIA	Senior Software Engineer, Agentic AI	Semiconductors	9	Agent orchestration · Evals · Model serving · Code gen
Mistral AI	Applied AI, Forward Deployed Machine Learning Engineer - Montreal	AI Frontier	9	Fine-tuning · RAG · Agent orchestration · Vector DB · Model serving
Mistral AI	Applied AI, Senior/Staff Forward Deployed Machine Learning Engineer - EMEA	AI Frontier	9	Fine-tuning · RAG · Agent orchestration · Model serving
Mistral AI	Applied AI Engineer, Prototyping	AI Frontier	9	Agent orchestration · RAG · Model serving
Mistral AI	Applied AI, Forward Deployed Machine Learning Engineer, Critical and Sovereign Institutions, EMEA	AI Frontier	9	Agent orchestration · Fine-tuning · RAG · Model serving
OpenAI	Software Engineer, Inference - Performance Optimization	AI Frontier	9	Model serving
NVIDIA	Senior Deep Learning Software Engineer	Semiconductors	9	Model serving · Fine-tuning
NVIDIA	LLM Reinforcement Learning Framework Engineer	Semiconductors	9	RL post-training · Agent research · Agent orchestration · Fine-tuning · Model serving
NVIDIA	Senior Applied AI Researcher, Digital Biology	Semiconductors	9	Agent orchestration · Tool use · Multimodal · LLM observability · Fine-tuning · Model serving · Frontier research · Interpretability · Code gen
OpenAI	Manager, Forward Deployed Engineering - Munich	AI Frontier	9	Model serving
Anthropic	Research Engineer, RL Infrastructure (Knowledge Work)	AI Frontier	9	Evals · LLM observability · Model serving · RL post-training · Agent orchestration