Evals
73 AI roles tagged evals.
Sector
Status
FilteredsectorConsumer×
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Senior Machine Learning Engineer, GenAI Security | Consumer | 9 | Agent orchestration · Tool use · Guardrails · Fine-tuning · Model serving | |
| DoorDash | AI Research Fellowship, (Summer and Fall 2026) | Consumer | 9 | Agent orchestration · Tool use · Forecasting · Multimodal · Vision · Audio & speech · Frontier research · Synthetic data |
| Airbnb | Machine Learning Engineer, Customer Support Engineering | Consumer | 9 | Agent orchestration · Tool use · Guardrails · RAG · Fine-tuning · Model serving · RL post-training · Agent research |
| Principal Engineer, Agentic Engineering | Consumer | 9 | Agent orchestration · Agent research · Guardrails · LLM observability · Tool use | |
| Sr. Data Scientist, Responsible AI | Consumer | 9 | Guardrails · LLM observability · Agent research · Multimodal | |
| Roblox | Principal Machine Learning Engineer, Engineering Acceleration | Consumer | 9 | Agent orchestration · Agent research · Synthetic data · Fine-tuning · Model serving · Code gen |
| Airbnb | Senior Staff Machine Learning Engineer, Data & Eval | Consumer | 9 | LLM observability · Guardrails · RAG · Agent orchestration · Tool use · Fine-tuning · Synthetic data |
| Instacart | Machine Learning Engineer, PhD Intern | Consumer | 9 | LLM observability · RAG · Fine-tuning · Inference infra · Model serving · Recommender systems · Search & ranking · Agent research |
| Spotify | Senior Machine Learning Engineer, Personalization, Magenta | Consumer | 8 | Agent orchestration · Tool use · LLM observability · Model serving |
| Sr. Data Scientist, AI/ML Systems | Consumer | 8 | Recommender systems · Search & ranking | |
| Staff Machine Learning Engineer, Ads Content Understanding | Consumer | 8 | LLM observability · Fine-tuning · Model serving | |
| Roblox | Senior Data Scientist - Machine Intelligence (Creator Services) | Consumer | 8 | Multimodal · Fine-tuning |
| Spotify | Senior Research Scientist - Personalization | Consumer | 8 | Recommender systems · Frontier research |
| Sr. Machine Learning Engineer, Responsible AI– Applied Research Science | Consumer | 8 | Fine-tuning · Guardrails · LLM observability · Recommender systems · Search & ranking · Multimodal | |
| Spotify | Senior Machine Learning Engineer, Personalization, Magenta | Consumer | 8 | Agent orchestration · Tool use · LLM observability |
| Roblox | Senior Software Engineer, ML Infra | Consumer | 8 | Fine-tuning · LLM observability · Guardrails · Model serving |
| Staff Product Manager, AI Safety | Consumer | 8 | Guardrails · LLM observability · Multimodal · Agent research · RLHF | |
| Engineering Manager, Developer Agents | Consumer | 8 | Agent orchestration · LLM observability · Model serving | |
| Staff Software Engineer, Ads Measurement Signal | Consumer | 8 | Agent orchestration · RAG · Guardrails | |
| Airbnb | Machine Learning Engineer, Community Support Engineering | Consumer | 8 | Agent orchestration · Fine-tuning · RAG · Guardrails · LLM observability · Model serving |
| Airbnb | Senior/Staff Machine Learning Engineer, Community Support Engineering | Consumer | 8 | Agent orchestration · Fine-tuning · RAG · Guardrails · LLM observability |
| Roblox | [2026] Data Scientist, Foundation AI - PhD Early Career | Consumer | 8 | LLM observability · Fine-tuning · RL post-training · Multimodal |
| Airbnb | Senior Data Scientist, Platform | Consumer | 8 | Fine-tuning |
| Roblox | Senior Machine Learning Engineer, Economy | Consumer | 8 | Recommender systems · Search & ranking · Fine-tuning · Model serving · Inference infra · Guardrails · Vision |
| Roblox | Senior Data Scientist - Generative AI | Consumer | 8 | LLM observability · Fine-tuning · Agent orchestration · Multimodal · Code gen |
| Senior Machine Learning Engineer | Consumer | 8 | Recommender systems · Search & ranking · Model serving · Inference infra · Fine-tuning · RAG · LLM observability | |
| Machine Learning Engineer | Consumer | 8 | Recommender systems · Search & ranking · Model serving · Inference infra · Fine-tuning · LLM observability · RAG · Agent orchestration | |
| Machine Learning Engineer | Consumer | 8 | Recommender systems · Search & ranking · Model serving · Inference infra · Fine-tuning · RAG · LLM observability | |
| Roblox | Senior Analyst, AI Workflows & Automation | Consumer | 7 | Agent orchestration · LLM observability · Fine-tuning |
| Roblox | Scientist / Senior Scientist, Ads Systems | Consumer | 7 | Model serving · Recommender systems · Search & ranking |