661 AI roles tagged agent_research.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Sierra | Software Engineer, Agent (French speaking) | AI Frontier | 9 | Agent orchestration · Model serving · Evals · RAG |
| Anthropic | Research Engineer / Scientist, Robustness | AI Frontier | 9 | Evals · RL post-training · Guardrails · LLM observability · Frontier research |
| OpenAI | Research Engineer, Codex | AI Frontier | 9 | Agent orchestration · Code gen · Inference infra · Model serving · Evals |
| Jane Street | Machine Learning Researcher | Quant | 9 | Pretraining · Frontier research · Recommender systems · RL robotics · Vision · Code gen |
| Cohere | Member of Technical Staff, Next Generation Agents | AI Frontier | 9 | Agent orchestration · Fine-tuning · RL post-training · Synthetic data |
| Amazon | Member of Technical Staff, Applied Science - People Leader, AGI Autonomy | Big Tech | 9 | Agent orchestration · RL post-training · Frontier research · Vision |
| Anthropic | Data Operations Manager, Knowledge | AI Frontier | 9 | Evals · Frontier research |
| ByteDance | Tech Lead, Research Scientist/Engineer - AI Infrastructure | Big Tech | 9 | Inference infra · Model serving · RL post-training · Frontier research |
| Datadog | AI Research Scientist – Datadog AI Research (DAIR) | Enterprise | 9 | Multimodal · Frontier research · RL post-training · Agent orchestration |
| Anthropic | Research Engineer / Scientist, Alignment Science | AI Frontier | 9 | RL post-training · Evals · Guardrails · LLM observability · Frontier research · Interpretability · Agent orchestration · RL robotics |
| Anthropic | Research Engineer / Scientist, Alignment Science - London | AI Frontier | 9 | RL post-training · Evals · RL robotics |
| OpenAI | Researcher, Safety Oversight | AI Frontier | 9 | Evals · Guardrails · RL post-training · Interpretability |
| Datadog | AI Research Scientist - Datadog AI Research (DAIR) | Enterprise | 9 | Frontier research · Multimodal · RL post-training · Evals |
| Anthropic | Research Scientist, Frontier Red Team (CBRN, Biosecurity) | AI Frontier | 9 | Evals · Guardrails · Fine-tuning |
| Anthropic | Research Scientist, Frontier Red Team (Autonomy) | AI Frontier | 9 | Evals · Agent orchestration · LLM observability |
| Anthropic | Research Engineer / Scientist, Safeguards | AI Frontier | 9 | RL post-training · Evals · Guardrails · Interpretability · Agent orchestration · RL robotics |
| Scale AI | Machine Learning Research Engineer, GenAI Applied ML | Data AI | 9 | Agent orchestration · Evals · LLM observability |
| Scale AI | Senior / Staff Machine Learning Research Scientist, Agents | Data AI | 9 | Agent orchestration · Fine-tuning · Evals |
| Anthropic | Research Manager, Horizons | AI Frontier | 9 | Frontier research · RL post-training · Code gen · Tool use |
| Anthropic | Research Engineer, Machine Learning (Horizons) | AI Frontier | 9 | RL post-training · Code gen · Agent orchestration |
| Moveworks | Senior Machine Learning Engineer II, NLU & Agentic AI | Enterprise | 9 | Agent orchestration · Fine-tuning · RLHF · Evals · Multimodal · Model serving · LLM observability |
| Moveworks | Senior Machine Learning Engineer II, NLU & Agentic AI | Enterprise | 9 | Agent orchestration · Fine-tuning · RLHF · Evals · Multimodal · Model serving · LLM observability |
| Anthropic | Research Engineer, Agents | AI Frontier | 9 | Agent orchestration · Tool use · Evals · Fine-tuning |
| Instacart | Machine Learning Engineer, PhD Intern | Consumer | 9 | LLM observability · RAG · Fine-tuning · Inference infra · Model serving · Recommender systems · Search & ranking · Evals |
| Jane Street | Machine Learning Researcher | Quant | 9 | Pretraining · Fine-tuning · RL post-training · Recommender systems · Search & ranking |
| Nuro | Machine Learning Research Scientist, Behavior Planning and Prediction | Robotics | 9 | Embodied AI · RL robotics · Fine-tuning |
| Scale AI | Senior Machine Learning Engineer, Public Sector | Data AI | 9 | Agent orchestration · Fine-tuning · Vision · Multimodal · LLM observability · Model serving |
| Jane Street | Machine Learning Researcher | Quant | 9 | Pretraining · Fine-tuning · Recommender systems · Multi-agent · Frontier research |
| Software Engineer, Acceleration Platform Team | Big Tech | 8 | Agent orchestration · RAG · Guardrails · LLM observability | |
| Software Engineer II, Google Threat Intelligence | Big Tech | 8 | Agent orchestration · Tool use · Fine-tuning · LLM observability |