Inference infra

Function

All Engineering · 2810 Research · 168 Product · 57

Status

Sort

3035 AI roles tagged inference_infra.

Company	Title	Sector	AI score	Other tags
Walmart	(USA) Principal, Software Engineer	Retail	8	Agent orchestration · RAG · LLM observability · Guardrails · Evals · Model serving
Deloitte	AI Engineer Consultant	Consulting	8	RAG · Vector DB · Fine-tuning · Model serving · Guardrails · LLM observability · Tool use
Skydio	Autonomy Engineer Intern - Deep Learning (Computational Photography)	Defense	8	Fine-tuning · Model serving · Vision · Synthetic data
JPMorgan Chase	SR Principal Software Engineer - LLM Engineering	Banking	8	Model serving
Google	Staff Software Engineer, On-Device Machine Learning Infrastructure	Big Tech	8	Model serving · Fine-tuning · Audio & speech · Evals
Google	Software Engineering Manager, Automotive AI Agent	Big Tech	8	Agent orchestration · LLM observability · Model serving · Multimodal
Canva	Engineering Manager (BE) - AI Media Platform	Enterprise	8	Model serving · Multimodal
Google	Senior Software Engineer, AI/ML, Search Growth	Big Tech	8	Recommender systems · Search & ranking · Model serving · Fine-tuning · LLM observability · Multimodal
LangChain	Solutions Architect (Remote)	Data AI	8	Agent orchestration · Model serving · RAG · Vector DB · Evals
Modal	Member of Technical Staff - ML Performance	Data AI	8	Model serving
Amazon	Sr. Applied Scientist, Special Projects	Big Tech	8	Model serving · Frontier research
NVIDIA	Senior AI-Native Systems Software Engineer, TensorRT	Semiconductors	8	Agent orchestration · Agent research · Multimodal · Model serving · Code gen · Vision · Audio & speech
Intel	Principal Engineer – Distributed AI Systems Architecture (Heterogeneous Compute)	Semiconductors	8	Model serving
NVIDIA	Senior Performance Engineer - LLM Inference Frameworks	Semiconductors	8	Model serving · Quantization
Uber	Sr Software Engineer	Consumer	8	Recommender systems · Search & ranking · Model serving
OpenAI	Performance Modeling Lead	AI Frontier	8	Model serving
Google	Software Engineer III, AI/ML, Google Cloud	Big Tech	8	Model serving · Multimodal · Vision
Google	Forward Deployed Architect, Generative AI, Google Cloud	Big Tech	8	Agent orchestration · RAG · Vector DB · Model serving · Evals · LLM observability · Fine-tuning · Multimodal
Google	Staff Software Engineer, Games, Inception, DeepMind	Big Tech	8	Agent orchestration · Model serving
Reddit	Senior Staff ML Engineer, Search & Recommendation	Consumer	8	Recommender systems · Search & ranking · Model serving · RAG · LLM observability · Fine-tuning
Google	Software Engineer III, AI/ML GenAI, YouTube	Big Tech	8	Multimodal · Vision · Audio & speech · Model serving
NVIDIA	OEM Solutions Architect - AI Full Stack Public Sector	Semiconductors	8	Model serving · Fine-tuning
NVIDIA	AI Computing Development Engineer, TensorRT-LLM	Semiconductors	8	Model serving · Fine-tuning
SoFi	Director, AI Platforms	Fintech	8	Model serving · Agent orchestration · RAG · Evals · LLM observability · Guardrails
Intercom	AI Infrastructure Engineer	Enterprise	8	Model serving · LLM observability
Intercom	AI Infrastructure Engineer	Enterprise	8	Model serving
NVIDIA	Senior Software Engineer, JAX	Semiconductors	8	Model serving
Intel	Research and Pathfinding Internship: AI Workload Compiler Optimization for CPU and GPU	Semiconductors	8	Model serving
Walmart	(USA) Distinguished, Software Engineer	Retail	8	Model serving · LLM observability · Guardrails · Agent orchestration · Tool use
Adobe	Machine Learning Architect 5 - GenAI Experiences	Enterprise	8	Agent orchestration · RAG · LLM observability · Recommender systems · Model serving