Evals

Function

All Engineering · 1466 Research · 384 Product · 247

Status

Sort

2097 AI roles tagged evals.

Company	Title	Sector	AI score	Other tags
LangChain	Product Manager, LangSmith	Data AI	8	LLM observability · Agent orchestration · Model serving
Datadog	Senior AI Engineer - APM Experiences	Enterprise	8	Agent orchestration · Tool use · Guardrails · LLM observability · RAG · Model serving
Microsoft	Principal Applied Scientist	Big Tech	8	Agent orchestration · RL post-training · Fine-tuning · Model serving · LLM observability
Microsoft	Research Intern - STAC, NYC (Sociotechnical Alignment Center)	Big Tech	8	LLM observability · Synthetic data · Interpretability
Intercom	Staff AI Product Manager	Enterprise	8	Agent orchestration · Model serving
Scale AI	Senior Machine Learning Engineer - Model Evaluations, Public Sector	Data AI	8	LLM observability · Agent orchestration · Guardrails · Multimodal
Snorkel AI	Applied AI Engineer - AI Solutions	Data AI	8	Agent orchestration · RAG · Fine-tuning · Vector DB · LLM observability
OpenAI	Forward Deployed Engineer - London	AI Frontier	8	Model serving · Inference infra · Agent orchestration · LLM observability
Anthropic	Forward Deployed Engineer, Applied AI	AI Frontier	8	Agent orchestration · Tool use · Fine-tuning · Model serving
Nuro	Technical Lead Manager, Autonomy Evaluation and Intelligence	Robotics	8	Agent research · Embodied AI · Agent orchestration · Model serving
Stripe	Machine Learning Engineer, Supportability	Fintech	8	Agent orchestration · LLM observability · Model serving
Capital One	Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)	Banking	8	Model serving · Inference infra · Guardrails · Vector DB · RAG · LLM observability · Fine-tuning · Agent orchestration
Capital One	Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)	Banking	8	Model serving · Inference infra · Fine-tuning · Guardrails · Vector DB · RAG · Agent orchestration · LLM observability
Sierra	Software Engineer, Agent	AI Frontier	8	Agent orchestration · RAG · Model serving · LLM observability
Zillow	AI Applied Scientist - PhD Intern, Evaluation Systems and Metrics	Consumer	8	Multimodal · Agent research · Guardrails
Amazon	Applied Scientist II, Strategic Account Services (SAS)	Big Tech	8	Model serving
Amazon	Principal Applied Scientist, Advertiser Growth, Amazon Sponsored Products & Brands	Big Tech	8	Agent orchestration · Fine-tuning · RL post-training · Recommender systems
Klaviyo	Senior AI Engineer	Enterprise	8	Agent orchestration · Fine-tuning · Model serving · Inference infra
Scale AI	STEM Fellow - Human Frontier Collective (UK)	Data AI	8	Frontier research
Descript	Senior Software Engineer, Agent	AI Frontier	8	Agent orchestration · Tool use · LLM observability · Multimodal
ZoomInfo	Senior Product Manager, Context Engineering	Enterprise	8	RAG · Vector DB · Agent orchestration · LLM observability · Model serving
Apple	AIML - Research Scientist, AI Interpretability & Visualization	Big Tech	8	Interpretability
Intercom	Senior Data Scientist - AI Tooling	Enterprise	8	Agent orchestration · Tool use · RAG · Vector DB
Abridge	Software Engineer, Gen AI Platform	Vertical AI	8	Agent orchestration · Tool use · LLM observability · RAG · Vector DB
Uber	Sr. Staff Engineer (Conversational/Voice AI)	Consumer	8	Agent orchestration · Tool use · Guardrails · LLM observability · RAG · Audio & speech · Model serving · Multimodal
Apple	AI Data Scientist	Big Tech	8	LLM observability · RAG · Fine-tuning · Multimodal
Scale AI	AI Product Manager	Data AI	8	Agent orchestration · RL robotics · Embodied AI · Synthetic data
Google	Software Engineer III, AI/ML GenAI, Google Cloud AI	Big Tech	8	Model serving · Inference infra · Fine-tuning · Multimodal · Vision · Audio & speech · Code gen
OpenAI	Forward Deployed Engineer - Tokyo	AI Frontier	8	Model serving · LLM observability
LangChain	Fullstack Software Engineer, Applied AI	Data AI	8	Agent orchestration · RAG · LLM observability · Model serving