2097 AI roles tagged evals.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| OpenAI | Data Scientist, Preparedness | AI Frontier | 8 | Guardrails · LLM observability |
| Amazon | Applied Scientist, Geospatial & Safety Science | Big Tech | 8 | Multimodal · Model serving · Fine-tuning |
| Amazon | Applied Scientist II, Foundation Model, Industrial Robotics Group | Big Tech | 8 | Multimodal · Fine-tuning · RL robotics |
| Amazon | AI Principal Product Manager-Technical, Alexa Responsible AI | Big Tech | 8 | Guardrails · RLHF · Reward modeling · LLM observability |
| Microsoft | Principal Product Manager | Big Tech | 8 | Agent orchestration · Guardrails · LLM observability · RAG · Vector DB |
| Bank of America | VP - GenAI Quant Developer | Banking | 8 | Agent orchestration · Tool use · Guardrails · RAG · Vector DB · Fine-tuning · Model serving |
| Gusto | Head of AI-Native Talent Systems | Fintech | 8 | Agent orchestration · Guardrails · LLM observability · RAG · Vector DB · Fine-tuning · Model serving · Recommender systems · Search & ranking · Interpretability · Synthetic data · Agent research |
| Capital One | Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) | Banking | 8 | Agent orchestration · Fine-tuning · Inference infra · Model serving · Guardrails · LLM observability · RAG · Vector DB |
| Intercom | Senior Data Scientist AI Tooling | Enterprise | 8 | Agent orchestration · Tool use · RAG · Vector DB |
| Disney | Lead Data Scientist, Ad Research | Media | 8 | Agent orchestration · Multimodal · Vision |
| Amazon | Data Scientist, SPX AI Lab, SPX Science | Big Tech | 8 | Agent orchestration |
| Capital One | Senior Lead AI Engineer (Gen AI Platform Services) | Banking | 8 | Model serving · Inference infra · Fine-tuning · Guardrails · Vector DB · LLM observability |
| Datadog | Manager I, Engineering - CodeGen | Enterprise | 8 | Code gen · Agent orchestration · Model serving · Inference infra · LLM observability |
| Handshake | Senior Engineering Manager, Reinforcement Learning Environments (RLE) | Enterprise | 8 | RL post-training · Agent orchestration · Model serving · LLM observability |
| Datadog | Staff AI Engineer - Notebooks | Enterprise | 8 | Agent orchestration · Tool use · RAG · Guardrails · Fine-tuning · Model serving · Inference infra |
| Stripe | Machine Learning Engineer, Stripe Assistant | Fintech | 8 | Agent orchestration · Tool use · Fine-tuning · RAG · LLM observability · Code gen |
| Datadog | Staff AI Engineer - Notebooks | Enterprise | 8 | Agent orchestration · Tool use · RAG · Guardrails · Fine-tuning · Model serving |
| Datadog | Staff AI Engineer - Notebooks | Enterprise | 8 | Agent orchestration · Tool use · Guardrails · RAG · Fine-tuning · Model serving |
| Capital One | Senior Manager AI Engineer (GenAI Platform Services) | Banking | 8 | Model serving · Inference infra · Guardrails · Vector DB · Fine-tuning · LLM observability |
| Microsoft | Senior Applied AI Engineer | Big Tech | 8 | Agent orchestration · Fine-tuning · RAG · LLM observability |
| Canva | Machine Learning Engineering Manager - Evaluations | Enterprise | 8 | Model serving · Inference infra · LLM observability · Vision · Multimodal · Fine-tuning |
| Canva | Machine Learning Engineering Manager - Evaluations | Enterprise | 8 | LLM observability · Model serving · Vision · Multimodal |
| Grafana Labs | Staff AI Engineer | US | Remote | Data AI | 8 | Agent orchestration · LLM observability · RAG · Tool use |
| Walmart | Senior, Data Scientist | Retail | 8 | Vision · Multimodal · Fine-tuning · RLHF · Reward modeling |
| Perplexity | Member of Technical Staff (Data Scientist, Evals) | AI Frontier | 8 | LLM observability · Vision · RAG · Tool use |
| Anthropic | Applied AI Engineer | AI Frontier | 8 | Agent orchestration · LLM observability · Model serving |
| Datadog | Manager I, Engineering - AI Platform - Annotation & Evaluation | Enterprise | 8 | Synthetic data · Model serving |
| Apple | AIML - Sr Machine Learning Engineer, Responsible AI | Big Tech | 8 | Guardrails · Fine-tuning · Synthetic data · LLM observability · Multimodal |
| Cresta | Applied Data Scientist | Vertical AI | 8 | |
| Anthropic | Applied AI Engineer | AI Frontier | 8 | Agent orchestration · RAG · Fine-tuning · Model serving |