Which companies are hiring for Evals roles?

The companies with the most active Evals listings are: Amazon (188 roles), Google (153 roles), OpenAI (95 roles), Microsoft (73 roles), JPMorgan Chase (70 roles).

What AI lifecycle stage does Evals belong to?

Evals primarily belongs to the evaluation stage of the AI lifecycle. In current hiring, Evals roles concentrate at: agents (57%), evaluation (12%).

What sectors invest most in Evals?

The sectors with the most active Evals hiring are: Big Tech, Enterprise, AI Frontier.

← Tag co-occurrence network

Evals

Designing benchmarks and automated scoring systems to measure model quality, safety, or capability — typically blending classical metrics, LLM-as-judge, and human review.

Primary AI lifecycle stage: evaluation.

As of today, 2,040 active AI roles across 208 companies in our index reference Evals. Hiring concentrates at the agents (57%) and evaluation (12%) stages. Most common sectors: Big Tech, Enterprise, AI Frontier.

Top hiring:

194 AI roles tagged evals.

Company	Title	Sector	AI score	Other tags
JPMorgan Chase	Applied AI ML Researcher Lead	Banking	9	Agent orchestration · Multi-agent · Agent research · Model serving
JPMorgan Chase	Applied Machine Learning Scientist - Vice President	Banking	9	Agent orchestration · Tool use · Guardrails · LLM observability · RAG · Fine-tuning · Model serving · Recommender systems · Multimodal · Agent research · RL post-training
JPMorgan Chase	AI/ML Director	Banking	9	Agent orchestration · RAG · Vector DB · Tool use · Guardrails · LLM observability
JPMorgan Chase	Machine Learning Scientist - Vice President	Banking	9	Agent orchestration · Tool use · Guardrails · LLM observability · RAG · Fine-tuning · Model serving · Recommender systems · RL post-training · Multimodal
JPMorgan Chase	Lead Software Engineer -AI	Banking	8	Agent orchestration · Tool use · LLM observability · RAG
JPMorgan Chase	Lead Software Engineer -AI	Banking	8	Agent orchestration · Tool use · LLM observability · RAG
JPMorgan Chase	Applied AI ML Vice President	Banking	8	Agent orchestration · RAG · Model serving
JPMorgan Chase	Applied AI ML Lead [Multiple Positions Available]	Banking	8	Agent orchestration · RAG · LLM observability · Fine-tuning · Model serving
JPMorgan Chase	Senior AI Application Engineer - Vice President	Banking	8	LLM observability · Guardrails · Model serving · Inference infra
JPMorgan Chase	Security Engineer III - AIML	Banking	8	Agent orchestration · Guardrails · LLM observability · RAG
JPMorgan Chase	Lead Software Engineer - Python, AI	Banking	8	Agent orchestration · Agent research · Tool use · LLM observability
Capital One	Senior Lead AI Engineer (GenAI Platform Services)	Banking	8	Fine-tuning · Inference infra · Model serving · Guardrails · Vector DB · LLM observability
JPMorgan Chase	Applied AI/ML Lead	Banking	8	Agent orchestration · Model serving · RAG · LLM observability
JPMorgan Chase	Applied AI ML Lead - Payments	Banking	8	Agent orchestration · Tool use · Guardrails · Model serving
JPMorgan Chase	Software Engineer III - AI/ML, Prompt Engineer	Banking	8	Agent orchestration · RAG · Fine-tuning · Model serving · Vector DB · Guardrails · LLM observability
JPMorgan Chase	Applied AI ML Lead [Multiple Positions Available]	Banking	8	Fine-tuning · Model serving · Inference infra · RAG · LLM observability
Capital One	Lead Machine Learning Engineer	Banking	8	Agent orchestration · Model serving · Inference infra · Guardrails · LLM observability · Fine-tuning
Capital One	Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)	Banking	8	Agent orchestration · Model serving · Inference infra · Guardrails · Vector DB · Fine-tuning · LLM observability
Capital One	Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)	Banking	8	Agent orchestration · Model serving · Inference infra · Fine-tuning · Guardrails · Vector DB · LLM observability
JPMorgan Chase	Red Team Lead Security Engineer	Banking	8	Guardrails · LLM observability · RAG · Agent orchestration
JPMorgan Chase	Quantitative Trading & Research - Applied Researcher – Agentic AI Systems - Associate	Banking	8	Agent orchestration · Tool use · RAG · Fine-tuning · LLM observability
JPMorgan Chase	Senior Quant Analytics Associate - Fraud Risk	Banking	8	Agent orchestration · RAG · Vector DB · Fine-tuning · LLM observability
JPMorgan Chase	Predictive Science - AI Engineering & Prompt Architecture Lead - Vice President	Banking	8	Agent orchestration · RAG · Fine-tuning · LLM observability · Vector DB · Tool use · Guardrails
Capital One	Lead AI Engineer (GenAI Platform, AI Foundations, LLM Core and Agentic AI)	Banking	8	Agent orchestration · Model serving · Inference infra · Guardrails · Vector DB · RAG · Fine-tuning · LLM observability
Capital One	Sr Director, AI Engineering	Banking	8	Model serving · Inference infra · Vector DB · Guardrails · LLM observability
JPMorgan Chase	Applied AI ML Lead - LLM SUITE ENGINEERING	Banking	8	Agent orchestration · Tool use · Guardrails · LLM observability · RAG · Vector DB · Model serving · Inference infra
JPMorgan Chase	Applied ML and Generative AI Leader - Executive Director	Banking	8	Model serving · Fine-tuning · RAG · Agent orchestration
JPMorgan Chase	Applied ML and Generative AI Lead - Vice President	Banking	8	Fine-tuning · Model serving · RAG
JPMorgan Chase	Computational Linguist, Generative AI - Sr. Associate	Banking	8	Guardrails · Agent orchestration · Fine-tuning · LLM observability · RAG
Capital One	Distinguished AI Engineer	Banking	8	Model serving · Inference infra · Fine-tuning · Guardrails · LLM observability · Vector DB

Frequently asked questions

What is Evals in AI?
Designing benchmarks and automated scoring systems to measure model quality, safety, or capability — typically blending classical metrics, LLM-as-judge, and human review. Primary AI lifecycle stage: evaluation.
How many AI roles reference Evals right now?
2,040 active AI roles across 208 companies in our index reference Evals as of today.
Which companies are hiring for Evals roles?
The companies with the most active Evals listings are: Amazon (188 roles), Google (153 roles), OpenAI (95 roles), Microsoft (73 roles), JPMorgan Chase (70 roles).
What AI lifecycle stage does Evals belong to?
Evals primarily belongs to the evaluation stage of the AI lifecycle. In current hiring, Evals roles concentrate at: agents (57%), evaluation (12%).
What sectors invest most in Evals?
The sectors with the most active Evals hiring are: Big Tech, Enterprise, AI Frontier.