Designing benchmarks and automated scoring systems to measure model quality, safety, or capability — typically blending classical metrics, LLM-as-judge, and human review. Primary AI lifecycle stage: evaluation.
2,040 active AI roles across 208 companies in our index reference Evals as of today.
The companies with the most active Evals listings are: Amazon (188 roles), Google (153 roles), OpenAI (95 roles), Microsoft (73 roles), JPMorgan Chase (70 roles).
Evals primarily belongs to the evaluation stage of the AI lifecycle. In current hiring, Evals roles concentrate at: agents (57%), evaluation (12%).
The sectors with the most active Evals hiring are: Big Tech, Enterprise, AI Frontier.
Designing benchmarks and automated scoring systems to measure model quality, safety, or capability — typically blending classical metrics, LLM-as-judge, and human review.
Primary AI lifecycle stage: evaluation.
As of today, 2,040 active AI roles across 208 companies in our index reference Evals. Hiring concentrates at the agents (57%) and evaluation (12%) stages. Most common sectors: Big Tech, Enterprise, AI Frontier.
194 AI roles tagged evals.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| JPMorgan Chase | Applied AI ML Researcher Lead | Banking | 9 | Agent orchestration · Multi-agent · Agent research · Model serving |
| JPMorgan Chase | Applied Machine Learning Scientist - Vice President | Banking | 9 | Agent orchestration · Tool use · Guardrails · LLM observability · RAG · Fine-tuning · Model serving · Recommender systems · Multimodal · Agent research · RL post-training |
| JPMorgan Chase | AI/ML Director | Banking | 9 | Agent orchestration · RAG · Vector DB · Tool use · Guardrails · LLM observability |
| JPMorgan Chase | Machine Learning Scientist - Vice President | Banking | 9 | Agent orchestration · Tool use · Guardrails · LLM observability · RAG · Fine-tuning · Model serving · Recommender systems · RL post-training · Multimodal |
| JPMorgan Chase | Lead Software Engineer -AI | Banking | 8 | Agent orchestration · Tool use · LLM observability · RAG |
| JPMorgan Chase | Lead Software Engineer -AI | Banking | 8 | Agent orchestration · Tool use · LLM observability · RAG |
| JPMorgan Chase | Applied AI ML Vice President | Banking | 8 | Agent orchestration · RAG · Model serving |
| JPMorgan Chase | Applied AI ML Lead [Multiple Positions Available] | Banking | 8 | Agent orchestration · RAG · LLM observability · Fine-tuning · Model serving |
| JPMorgan Chase | Senior AI Application Engineer - Vice President | Banking | 8 | LLM observability · Guardrails · Model serving · Inference infra |
| JPMorgan Chase | Security Engineer III - AIML | Banking | 8 | Agent orchestration · Guardrails · LLM observability · RAG |
| JPMorgan Chase | Lead Software Engineer - Python, AI | Banking | 8 | Agent orchestration · Agent research · Tool use · LLM observability |
| Capital One | Senior Lead AI Engineer (GenAI Platform Services) | Banking | 8 | Fine-tuning · Inference infra · Model serving · Guardrails · Vector DB · LLM observability |
| JPMorgan Chase | Applied AI/ML Lead | Banking | 8 | Agent orchestration · Model serving · RAG · LLM observability |
| JPMorgan Chase | Applied AI ML Lead - Payments | Banking | 8 | Agent orchestration · Tool use · Guardrails · Model serving |
| JPMorgan Chase | Software Engineer III - AI/ML, Prompt Engineer | Banking | 8 | Agent orchestration · RAG · Fine-tuning · Model serving · Vector DB · Guardrails · LLM observability |
| JPMorgan Chase | Applied AI ML Lead [Multiple Positions Available] | Banking | 8 | Fine-tuning · Model serving · Inference infra · RAG · LLM observability |
| Capital One | Lead Machine Learning Engineer | Banking | 8 | Agent orchestration · Model serving · Inference infra · Guardrails · LLM observability · Fine-tuning |
| Capital One | Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Agent orchestration · Model serving · Inference infra · Guardrails · Vector DB · Fine-tuning · LLM observability |
| Capital One | Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Agent orchestration · Model serving · Inference infra · Fine-tuning · Guardrails · Vector DB · LLM observability |
| JPMorgan Chase | Red Team Lead Security Engineer | Banking | 8 | Guardrails · LLM observability · RAG · Agent orchestration |
| JPMorgan Chase | Quantitative Trading & Research - Applied Researcher – Agentic AI Systems - Associate | Banking | 8 | Agent orchestration · Tool use · RAG · Fine-tuning · LLM observability |
| JPMorgan Chase | Senior Quant Analytics Associate - Fraud Risk | Banking | 8 | Agent orchestration · RAG · Vector DB · Fine-tuning · LLM observability |
| JPMorgan Chase | Predictive Science - AI Engineering & Prompt Architecture Lead - Vice President | Banking | 8 | Agent orchestration · RAG · Fine-tuning · LLM observability · Vector DB · Tool use · Guardrails |
| Capital One | Lead AI Engineer (GenAI Platform, AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Agent orchestration · Model serving · Inference infra · Guardrails · Vector DB · RAG · Fine-tuning · LLM observability |
| Capital One | Sr Director, AI Engineering | Banking | 8 | Model serving · Inference infra · Vector DB · Guardrails · LLM observability |
| JPMorgan Chase | Applied AI ML Lead - LLM SUITE ENGINEERING | Banking | 8 | Agent orchestration · Tool use · Guardrails · LLM observability · RAG · Vector DB · Model serving · Inference infra |
| JPMorgan Chase | Applied ML and Generative AI Leader - Executive Director | Banking | 8 | Model serving · Fine-tuning · RAG · Agent orchestration |
| JPMorgan Chase | Applied ML and Generative AI Lead - Vice President | Banking | 8 | Fine-tuning · Model serving · RAG |
| JPMorgan Chase | Computational Linguist, Generative AI - Sr. Associate | Banking | 8 | Guardrails · Agent orchestration · Fine-tuning · LLM observability · RAG |
| Capital One | Distinguished AI Engineer | Banking | 8 | Model serving · Inference infra · Fine-tuning · Guardrails · LLM observability · Vector DB |