2097 AI roles tagged evals.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Anthropic | Manager of Applied AI Architecture, Industries | AI Frontier | 8 | LLM observability · Guardrails · Model serving |
| Hex | AI Research Engineer | Data AI | 8 | Agent orchestration · Fine-tuning · Model serving |
| Microsoft | Research Intern - AI Safety & Reliability for LLM Systems | Big Tech | 8 | Agent orchestration · RAG · LLM observability · Guardrails |
| Roblox | [2026] Data Scientist, Foundation AI - PhD Early Career | Consumer | 8 | LLM observability · Fine-tuning · RL post-training · Multimodal |
| Microsoft | Applied Researcher 2/ Senior Applied Researcher | Big Tech | 8 | Fine-tuning · Agent research · Code gen · LLM observability · RAG |
| Harvey | Senior Software Engineer, Fullstack - New Verticals | AI Frontier | 8 | Agent orchestration · Tool use · Guardrails · RAG · LLM observability · Model serving |
| Snorkel AI | Senior Manager - Research | Data AI | 8 | Agent research · RL post-training |
| Amazon | Applied Scientist II, Foundation Model, Industrial Robotics Group | Big Tech | 8 | Multimodal · Fine-tuning · RL robotics · Embodied AI |
| Microsoft | Sr Research Scientist | Big Tech | 8 | Code gen · RAG · Fine-tuning · Inference infra · Model serving · LLM observability |
| LangChain | Software Engineering Manager, AI Observability & Evals Platform (San Francisco, CA) | Data AI | 8 | LLM observability |
| Anthropic | Applied AI Engineer, Life Sciences (Beneficial Deployments) | AI Frontier | 8 | Agent orchestration · LLM observability |
| Airbnb | Senior Data Scientist, Platform | Consumer | 8 | Fine-tuning |
| Disney | Senior Machine Learning Engineer, Ad Platforms | Media | 8 | Agent orchestration · Multimodal · Fine-tuning · Model serving · Audio & speech |
| Whatnot | LLM Platform Engineer | Consumer | 8 | Agent orchestration · RAG · LLM observability · Model serving · Inference infra |
| Apple | Sr. Machine Learning Engineer | Big Tech | 8 | Search & ranking · Recommender systems · Model serving · Fine-tuning |
| Roblox | Senior Machine Learning Engineer, Economy | Consumer | 8 | Recommender systems · Search & ranking · Fine-tuning · Model serving · Inference infra · Guardrails · Vision |
| Capital One | Sr. Lead AI Engineer | Banking | 8 | Model serving · Inference infra · Guardrails · Vector DB · Fine-tuning · LLM observability |
| Capital One | Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Model serving · Inference infra · Guardrails · Vector DB · RAG · Fine-tuning · LLM observability · Agent orchestration |
| Capital One | Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Model serving · Inference infra · Guardrails · Vector DB · RAG · Fine-tuning · LLM observability · Agent orchestration |
| Capital One | Senior Lead AI Engineer (LLM Customization and Finetuning) | Banking | 8 | Fine-tuning · Inference infra · Model serving · Guardrails · Vector DB · LLM observability |
| Databricks | Staff Machine Learning Engineer | Data AI | 8 | Fine-tuning · RAG · Model serving |
| xAI | AI Tutor - Crypto | AI Frontier | 8 | Frontier research · Multimodal · Audio & speech |
| Snowflake | Principal Machine Learning Engineer- Search Quality | Data AI | 8 | Search & ranking · Recommender systems · RAG · Vector DB · Agent orchestration · Tool use · LLM observability · Inference infra · Model serving |
| Anthropic | Full Stack Software Engineer, Reinforcement Learning | AI Frontier | 8 | RL post-training · LLM observability |
| Amazon | Machine Learning Engineer II , AGI Customization | Big Tech | 8 | Fine-tuning · Model serving · Multimodal |
| Microsoft | Senior Researcher - AI & Society - Microsoft Research | Big Tech | 8 | Guardrails · Interpretability |
| Datadog | Staff AI Engineer - Notebooks | Enterprise | 8 | Agent orchestration · Tool use · Guardrails · RAG · Fine-tuning · Model serving |
| Datadog | Staff AI Engineer - Notebooks | Enterprise | 8 | Agent orchestration · Tool use · RAG · Guardrails · Fine-tuning · Model serving · Inference infra |
| Cresta | Senior Machine Learning Engineer - Automatic Speech Recognition (ASR) | Vertical AI | 8 | Audio & speech · Fine-tuning · Model serving |
| Amazon | Senior Software Development Engineer, US Prime and Marketing Tech | Big Tech | 8 | Agent orchestration · LLM observability |