Evals

Function

All Engineering · 1466 Research · 384 Product · 247

Status

Sort

2097 AI roles tagged evals.

Company	Title	Sector	AI score	Other tags
OpenAI	Data Scientist, Preparedness	AI Frontier	8	Guardrails · LLM observability
Amazon	Applied Scientist, Geospatial & Safety Science	Big Tech	8	Multimodal · Model serving · Fine-tuning
Amazon	Applied Scientist II, Foundation Model, Industrial Robotics Group	Big Tech	8	Multimodal · Fine-tuning · RL robotics
Amazon	AI Principal Product Manager-Technical, Alexa Responsible AI	Big Tech	8	Guardrails · RLHF · Reward modeling · LLM observability
Microsoft	Principal Product Manager	Big Tech	8	Agent orchestration · Guardrails · LLM observability · RAG · Vector DB
Bank of America	VP - GenAI Quant Developer	Banking	8	Agent orchestration · Tool use · Guardrails · RAG · Vector DB · Fine-tuning · Model serving
Gusto	Head of AI-Native Talent Systems	Fintech	8	Agent orchestration · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Model serving · Recommender systems · Search & ranking · Interpretability · Synthetic data · Agent research
Capital One	Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)	Banking	8	Agent orchestration · Fine-tuning · Inference infra · Model serving · Guardrails · LLM observability · RAG · Vector DB
Intercom	Senior Data Scientist AI Tooling	Enterprise	8	Agent orchestration · Tool use · RAG · Vector DB
Disney	Lead Data Scientist, Ad Research	Media	8	Agent orchestration · Multimodal · Vision
Amazon	Data Scientist, SPX AI Lab, SPX Science	Big Tech	8	Agent orchestration
Capital One	Senior Lead AI Engineer (Gen AI Platform Services)	Banking	8	Model serving · Inference infra · Fine-tuning · Guardrails · Vector DB · LLM observability
Datadog	Manager I, Engineering - CodeGen	Enterprise	8	Code gen · Agent orchestration · Model serving · Inference infra · LLM observability
Handshake	Senior Engineering Manager, Reinforcement Learning Environments (RLE)	Enterprise	8	RL post-training · Agent orchestration · Model serving · LLM observability
Datadog	Staff AI Engineer - Notebooks	Enterprise	8	Agent orchestration · Tool use · RAG · Guardrails · Fine-tuning · Model serving · Inference infra
Stripe	Machine Learning Engineer, Stripe Assistant	Fintech	8	Agent orchestration · Tool use · Fine-tuning · RAG · LLM observability · Code gen
Datadog	Staff AI Engineer - Notebooks	Enterprise	8	Agent orchestration · Tool use · RAG · Guardrails · Fine-tuning · Model serving
Datadog	Staff AI Engineer - Notebooks	Enterprise	8	Agent orchestration · Tool use · Guardrails · RAG · Fine-tuning · Model serving
Capital One	Senior Manager AI Engineer (GenAI Platform Services)	Banking	8	Model serving · Inference infra · Guardrails · Vector DB · Fine-tuning · LLM observability
Microsoft	Senior Applied AI Engineer	Big Tech	8	Agent orchestration · Fine-tuning · RAG · LLM observability
Canva	Machine Learning Engineering Manager - Evaluations	Enterprise	8	Model serving · Inference infra · LLM observability · Vision · Multimodal · Fine-tuning
Canva	Machine Learning Engineering Manager - Evaluations	Enterprise	8	LLM observability · Model serving · Vision · Multimodal
Grafana Labs	Staff AI Engineer \| US \| Remote	Data AI	8	Agent orchestration · LLM observability · RAG · Tool use
Walmart	Senior, Data Scientist	Retail	8	Vision · Multimodal · Fine-tuning · RLHF · Reward modeling
Perplexity	Member of Technical Staff (Data Scientist, Evals)	AI Frontier	8	LLM observability · Vision · RAG · Tool use
Anthropic	Applied AI Engineer	AI Frontier	8	Agent orchestration · LLM observability · Model serving
Datadog	Manager I, Engineering - AI Platform - Annotation & Evaluation	Enterprise	8	Synthetic data · Model serving
Apple	AIML - Sr Machine Learning Engineer, Responsible AI	Big Tech	8	Guardrails · Fine-tuning · Synthetic data · LLM observability · Multimodal
Cresta	Applied Data Scientist	Vertical AI	8
Anthropic	Applied AI Engineer	AI Frontier	8	Agent orchestration · RAG · Fine-tuning · Model serving