57 AI roles tagged evals.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Target | Principal AI Engineer - Advanced AI (Machine Learning, Python, Deep Learning) | Retail | 9 | Agent orchestration · LLM observability · Model serving · Inference infra |
| Walmart | Senior Data Scientist: Associate AI Experience | Retail | 9 | Agent orchestration · Tool use · RAG · Vector DB · Fine-tuning · Model serving · LLM observability |
| Walmart | Principal Data Scientist: Associate AI experience | Retail | 9 | Agent orchestration · Tool use · RAG · Vector DB · Fine-tuning · Model serving · Multi-agent |
| Walmart | Distinguished Data Scientist: Associate AI Experience | Retail | 9 | Agent orchestration · Tool use · RAG · Vector DB · Fine-tuning · Model serving · Multi-agent |
| Walmart | Principal, Data Scientist | Retail | 9 | Agent orchestration · Inference infra · Model serving · RAG · Vector DB · LLM observability · Guardrails |
| Walmart | (USA) Distinguished, Data Scientist | Retail | 9 | Agent orchestration · Agent research · Tool use · RAG · Vector DB · Fine-tuning · Model serving · Multimodal |
| Walmart | Staff, Data Scientist – Conversational AI | Retail | 9 | Agent orchestration · Tool use · Guardrails · RAG · Fine-tuning · Model serving · LLM observability |
| Walmart | Principal, Data Scientist | Retail | 9 | Agent orchestration · Tool use · RAG · Vector DB · Model serving · Inference infra |
| Walmart | (USA) Staff, Data Scientist | Retail | 9 | Agent orchestration · Tool use · RAG · Agent research |
| Target | Lead Engineer- Advanced AI | Retail | 8 | Agent orchestration · Tool use · RAG · LLM observability · Model serving · Inference infra |
| Walmart | (USA) Senior, Data Scientist - Applied AI | Retail | 8 | Agent orchestration · Model serving · Inference infra |
| Walmart | (USA) Principal, Data Scientist - Applied AI | Retail | 8 | Agent orchestration · Model serving |
| Walmart | Staff, Data Scientist | Retail | 8 | Agent orchestration · Agent research · Synthetic data · Guardrails · Model serving |
| Walmart | (USA) Staff, Data Scientist | Retail | 8 | Agent orchestration · Agent research · Guardrails · LLM observability · Tool use |
| Walmart | Senior, Software Engineer - AI Systems | Retail | 8 | Agent orchestration · Tool use · Guardrails · RAG · Vector DB · Inference infra · Model serving |
| Walmart | Software Engineer III– AI Systems | Retail | 8 | Agent orchestration · Tool use · Guardrails · RAG · Vector DB · Inference infra · Model serving |
| Target | Sr Director Data Sciences | Retail | 8 | Agent orchestration · Search & ranking · Recommender systems · RAG · LLM observability · Model serving |
| Walmart | (USA) Senior Manager, Data Science (AI Technical Lead) – Next-Gen Customer Engagement & Returns | Retail | 8 | Agent orchestration · LLM observability · Recommender systems |
| Walmart | (USA) Principal, Software Engineer | Retail | 8 | Agent orchestration · RAG · LLM observability · Guardrails · Inference infra · Model serving |
| Walmart | Senior, Data Scientist | Retail | 8 | Vision · Multimodal · Fine-tuning · RLHF · Reward modeling |
| Walmart | Senior Data Scientists, Conversational AI | Retail | 8 | Fine-tuning · Multimodal · Model serving |
| Walmart | Expert Data Scientists, Conversational AI | Retail | 8 | Agent orchestration · LLM observability · Model serving |
| Nordstrom | Sr. Principal Technical Program Manager (Hybrid - Seattle, WA) | Retail | 7 | Agent orchestration · Tool use · RAG · Code gen · LLM observability |
| Target | Sr Engineer -Advanced AI | Retail | 7 | Agent orchestration · RAG · LLM observability · Fine-tuning · Model serving |
| Nordstrom | Senior Engineer 2: AI Agentic Solutions (Hybrid - Seattle, WA) | Retail | 7 | Agent orchestration · Tool use · RAG · Vector DB · LLM observability · Guardrails · Agent research |
| Nordstrom | Senior Engineer 2, Inventory Visibility | Retail | 7 | Agent orchestration · Tool use · LLM observability · RAG · Vector DB |
| Walmart | (USA) Staff, Software Engineer | Retail | 7 | LLM observability · Guardrails |
| Walmart | Data Scientist III | Retail | 7 | Fine-tuning |
| Walmart | (USA) Senior, Data Scientist | Retail | 7 | Agent orchestration · Agent research · LLM observability · Tool use |
| Walmart | (USA) Principal, Software Engineer | Retail | 7 | Model serving · Inference infra · RAG · LLM observability |
Designing benchmarks and automated scoring systems to measure model quality, safety, or capability — typically blending classical metrics, LLM-as-judge, and human review. Primary AI lifecycle stage: evaluation.
2,040 active AI roles across 208 companies in our index reference Evals as of today.
The companies with the most active Evals listings are: Amazon (188 roles), Google (153 roles), OpenAI (95 roles), Microsoft (73 roles), JPMorgan Chase (70 roles).
Evals primarily belongs to the evaluation stage of the AI lifecycle. In current hiring, Evals roles concentrate at: agents (57%), evaluation (12%).
The sectors with the most active Evals hiring are: Big Tech, Enterprise, AI Frontier.
Designing benchmarks and automated scoring systems to measure model quality, safety, or capability — typically blending classical metrics, LLM-as-judge, and human review.
Primary AI lifecycle stage: evaluation.
As of today, 2,040 active AI roles across 208 companies in our index reference Evals. Hiring concentrates at the agents (57%) and evaluation (12%) stages. Most common sectors: Big Tech, Enterprise, AI Frontier.