New AI postings mentioning LLM Evaluation per week — 70 total over 12 weeks.
89 active AI roles across 33 companies mention LLM Evaluation. Category: ML Ops & Evaluation.
LLM Evaluation is a skill in the "ML Ops & Evaluation" category. It currently appears in 89 active AI roles across 33 companies in our index.
The top employers with active AI roles mentioning LLM Evaluation are: Google (15), Apple (9), NVIDIA (8), JPMorgan Chase (7), Salesforce (5).
Over the last 12 weeks, 70 new AI postings mentioned LLM Evaluation. Demand is rising — up 22% in the last four weeks compared to the earliest four weeks in the window.
Roles requiring LLM Evaluation are concentrated in: agents (69%), evaluation (15%), serving infrastructure (6%). These stages follow a seven-stage AI lifecycle from data preparation through to shipped product.
7 AI roles requiring this skill.
| Company | Title | Sector | AI score | Stage |
|---|---|---|---|---|
| Writer | Staff AI research scientist | AI Frontier | 9 | L2 |
| OpenAI | Researcher, Alignment Training | AI Frontier | 9 | L2 |
| Anthropic | Research Engineer, RL Infrastructure (Knowledge Work) | AI Frontier | 9 | L5 |
| OpenAI | Researcher, Automated Red Teaming | AI Frontier | 9 | L5 |
| Mistral AI | Product Manager, Mistral Vibe | AI Frontier | 8 | L6 |
| xAI | Member of Technical Staff - RL Infrastructure | AI Frontier | 8 | L4 |
| Perplexity | Member of Technical Staff (Product Data Scientist, Search Quality) | AI Frontier | 7 | L6 |