2097 AI roles tagged evals.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Anthropic | Research Engineer / Research Scientist, Biology & Life Sciences | AI Frontier | 9 | Fine-tuning · RL post-training · Frontier research · Agent research |
| Anthropic | Research Engineer / Scientist, Tool Use Safety | AI Frontier | 9 | Agent orchestration · Tool use · Guardrails · RL post-training · Agent research · Fine-tuning · LLM observability |
| Sierra | Software Engineer, Agent (New Grad) | AI Frontier | 9 | Agent orchestration · RAG · Model serving · LLM observability |
| OpenAI | Forward Deployed Engineer - Munich | AI Frontier | 9 | Agent orchestration · Model serving · Inference infra · LLM observability |
| OpenAI | Forward Deployed Engineer - Paris | AI Frontier | 9 | Model serving · Inference infra · LLM observability · Agent orchestration |
| OpenAI | Forward Deployed Engineer - Dublin | AI Frontier | 9 | Agent orchestration · Model serving · Inference infra · LLM observability |
| Perplexity | Member of Technical Staff (Software Engineer, Applied AI) | AI Frontier | 9 | Agent orchestration · Recommender systems · Search & ranking · Fine-tuning · LLM observability |
| Shield AI | Product Manager, AI Platforms (R4991) | Defense | 9 | Multimodal · Training infra · Synthetic data · Inference infra · Model serving |
| Scale AI | Machine Learning Research Scientist, Reasoning | Data AI | 9 | Agent orchestration · Agent research · Fine-tuning · LLM observability |
| Glean | Machine Learning Engineer, AI Assistant & Autonomous AI Agents | Enterprise | 9 | Agent orchestration · Agent research · Inference infra · Model serving |
| ByteDance | Senior Software Engineer - AI for Security, Data/Application | Big Tech | 9 | Interpretability · RAG · LLM observability |
| Vectara | Senior Machine Learning Engineer | Data AI | 9 | RAG · Agent orchestration · LLM observability · Multimodal · Fine-tuning |
| Datadog | AI Research Engineer - Datadog AI Research (DAIR) | Enterprise | 9 | Multimodal · RL post-training · Agent orchestration · Frontier research · Model serving · Inference infra |
| Decagon | Senior Research Engineer | Vertical AI | 9 | Agent orchestration · Fine-tuning · Model serving · RAG · LLM observability |
| Sierra | Software Engineer, Agent (Spanish speaking) | AI Frontier | 9 | Agent orchestration · Model serving · RAG · LLM observability |
| Sierra | Software Engineer, Agent (French speaking) | AI Frontier | 9 | Agent orchestration · Model serving · RAG · Agent research |
| Sierra | Software Engineer, Agent (German speaking) | AI Frontier | 9 | Agent orchestration · Model serving · RAG |
| Datadog | AI Research Engineer - Datadog AI Research (DAIR) | Enterprise | 9 | Multimodal · Frontier research · RL post-training · RLHF · Agent orchestration · Model serving · Inference infra · Synthetic data |
| Anthropic | Research Engineer / Scientist, Robustness | AI Frontier | 9 | RL post-training · Agent research · Guardrails · LLM observability · Frontier research |
| OpenAI | Forward Deployed Engineer (FDE) - SF | AI Frontier | 9 | LLM observability · Model serving |
| OpenAI | Research Engineer, Codex | AI Frontier | 9 | Agent orchestration · Code gen · Agent research · Inference infra · Model serving |
| Anthropic | Research Engineer / Scientist, Tool Use | AI Frontier | 9 | Agent orchestration · Tool use · RL robotics · Guardrails · Fine-tuning · Model serving |
| Anthropic | Research Engineer, Model Performance & Quality | AI Frontier | 9 | LLM observability · Fine-tuning · RL post-training · Model serving |
| Anthropic | Research Engineer, Virtual Collaborator | AI Frontier | 9 | RL post-training · Fine-tuning |
| Anthropic | Research Scientist / Engineer, Agentic Learning (Horizons) | AI Frontier | 9 | Fine-tuning · RL post-training · Synthetic data |
| Cerebras | LLM Inference Performance & Evals Engineer | Semiconductors | 9 | Inference infra · Model serving |
| Anthropic | Research Engineer / Scientist, Model Welfare | AI Frontier | 9 | Interpretability |
| Anthropic | Research Engineer, Model Performance & Quality | AI Frontier | 9 | LLM observability · Fine-tuning · RL post-training · Model serving |
| Abridge | Machine Learning Scientist (All Levels) | Vertical AI | 9 | Fine-tuning · Model serving |
| Cohere | Senior Research Engineer, Model Evaluation | AI Frontier | 9 | LLM observability · Fine-tuning |