Evals
324 AI roles tagged evals.
Sector
Status
FilteredsectorAI Frontier×
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| OpenAI | Researcher, Misalignment Research | AI Frontier | 10 | Guardrails · Agent research · Frontier research |
| Mistral AI | AI Scientist - Zurich | AI Frontier | 10 | Frontier research · Pretraining · Agent research · Multimodal · Audio & speech · Code gen · Model serving · Fine-tuning |
| Mistral AI | AI Scientist - Paris/London - Onsite or Hybrid or Remote | AI Frontier | 10 | Frontier research · Pretraining · Fine-tuning · Model serving · Multimodal · Audio & speech · Agent research |
| Mistral AI | AI Scientist - Palo Alto | AI Frontier | 10 | Frontier research · Pretraining · Agent research · Multimodal · Audio & speech · Code gen · Model serving |
| OpenAI | Researcher, Loss of Control | AI Frontier | 10 | Agent orchestration · Tool use · Guardrails · LLM observability · Agent research |
| Anthropic | Research Engineer, Machine Learning (Reinforcement Learning) | AI Frontier | 10 | Agent orchestration · Tool use · RL post-training · Frontier research · Code gen |
| Anthropic | Research Engineer, Frontier Red Team (Autonomy) | AI Frontier | 10 | Agent orchestration · Tool use · Guardrails · Embodied AI · RL robotics · Agent research |
| OpenAI | Research Engineer, Frontier Evals & Environments - Finance | AI Frontier | 10 | Frontier research · Agent research |
| Anthropic | Anthropic AI Safety Fellow, UK | AI Frontier | 10 | Frontier research · Interpretability · Guardrails · RLHF |
| Anthropic | Anthropic AI Safety Fellow, US | AI Frontier | 10 | Frontier research · Interpretability · Guardrails · RL post-training |
| Anthropic | Staff Research Engineer, Discovery Team | AI Frontier | 10 | Frontier research · Pretraining · Fine-tuning · Inference infra · Model serving · Agent orchestration |
| OpenAI | Research Engineer, Frontier Evals & Environments | AI Frontier | 10 | RL robotics · Agent research · Frontier research · LLM observability · RL post-training |
| OpenAI | AI Deployment Engineer - Startups | AI Frontier | 9 | Agent orchestration · LLM observability · Fine-tuning |
| Mistral AI | AI Scientist - Warsaw | AI Frontier | 9 | Frontier research · Pretraining · Agent research · Multimodal · Audio & speech · Code gen · Fine-tuning · Model serving |
| Harvey | Staff Software Engineer, Agents | AI Frontier | 9 | Agent orchestration · Tool use · LLM observability · RAG |
| Harvey | Staff Software Engineer, Agents | AI Frontier | 9 | Agent orchestration · Tool use · LLM observability · RAG · Model serving |
| Harvey | Software Engineer, Agents | AI Frontier | 9 | Agent orchestration · Tool use · RAG · LLM observability · Model serving |
| Mistral AI | Model Behavior Architect | AI Frontier | 9 | Guardrails · LLM observability · Agent orchestration · Tool use · Fine-tuning · RL post-training |
| Anthropic | Research Engineer, Search and Knowledge Post-Training | AI Frontier | 9 | RL post-training · Search & ranking · RAG · Agent research · Frontier research · LLM observability |
| Sierra | Agent Engineer, TLM | AI Frontier | 9 | Agent orchestration · RAG · LLM observability |
| OpenAI | Researcher, Alignment Training | AI Frontier | 9 | Synthetic data · RL post-training · Frontier research · Interpretability |
| Anthropic | Technical Program Manager, Research | AI Frontier | 9 | RL post-training · Model serving |
| OpenAI | Researcher, Alignment Science | AI Frontier | 9 | RL post-training · LLM observability · Guardrails · Interpretability |
| Anthropic | Research Engineer, Model Evaluations | AI Frontier | 9 | LLM observability · Model serving · Agent research · Fine-tuning · RL post-training |
| Mistral AI | AI Engineer, Product | AI Frontier | 9 | Agent orchestration · Tool use · LLM observability · Model serving · Search & ranking · Multimodal |
| OpenAI | Researcher, Agentic Post-Training | AI Frontier | 9 | RL post-training · Agent orchestration · Tool use · Multi-agent · LLM observability · Fine-tuning |
| Anthropic | Research Engineer, RL Infrastructure (Knowledge Work) | AI Frontier | 9 | LLM observability · Inference infra · Model serving · RL post-training · Agent orchestration |
| OpenAI | Machine Learning Engineer, API Multicloud | AI Frontier | 9 | Fine-tuning · RL post-training · Model serving · Agent orchestration · Tool use · Audio & speech |
| Anthropic | Research Engineer, Safeguards Labs | AI Frontier | 9 | Guardrails · Agent orchestration · Agent research · Fine-tuning · RL post-training |
| xAI | Member of Technical Staff - Multimodal Understanding | AI Frontier | 9 | Multimodal · Pretraining · Fine-tuning · Model serving · Inference infra · Vision · Audio & speech · Agent orchestration · Tool use · Frontier research |