124 AI roles tagged agent_research.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| NVIDIA | Applied Deep Learning PhD Research Intern, Reinforcement Learning for LLMs - Fall 2026 | Semiconductors | 10 | RL post-training · Frontier research · LLM observability · Fine-tuning |
| OpenAI | Researcher, Misalignment Research | AI Frontier | 10 | Evals · Guardrails · Frontier research |
| Cohere | Research Internship Reinforcement Learning (Summer) | AI Frontier | 10 | RL post-training · Fine-tuning · Code gen · Agent orchestration · Frontier research |
| Mistral AI | AI Scientist - Zurich | AI Frontier | 10 | Frontier research · Pretraining · Multimodal · Audio & speech · Code gen · Evals · Model serving · Fine-tuning |
| Mistral AI | AI Scientist - Paris/London - Onsite or Hybrid or Remote | AI Frontier | 10 | Frontier research · Pretraining · Fine-tuning · Evals · Model serving · Multimodal · Audio & speech |
| Mistral AI | AI Scientist - Palo Alto | AI Frontier | 10 | Frontier research · Pretraining · Multimodal · Audio & speech · Code gen · Evals · Model serving |
| OpenAI | Researcher, Loss of Control | AI Frontier | 10 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability |
| MongoDB | Senior Research Scientist | Enterprise | 10 | Frontier research · RL post-training · Code gen · Agent orchestration |
| Cursor | Research Scientist | Coding AI | 10 | RL post-training · Frontier research · Code gen · Evals |
| Anthropic | Research Engineer, Frontier Red Team (Autonomy) | AI Frontier | 10 | Agent orchestration · Tool use · Evals · Guardrails · Embodied AI · RL robotics |
| OpenAI | Researcher, Synthetic RL | AI Frontier | 10 | RL post-training · Synthetic data · Frontier research |
| NVIDIA | Research Scientist, Generalist Embodied Agent Research - PhD New College Grad 2026 | Semiconductors | 10 | Embodied AI · Multimodal · RL robotics · Model serving · Inference infra · Synthetic data |
| Cohere | Senior Research Scientist, Cohere Labs | AI Frontier | 10 | Frontier research · Multimodal |
| OpenAI | Research Engineer, Frontier Evals & Environments - Finance | AI Frontier | 10 | Evals · Frontier research |
| OpenAI | Research Engineer/Research Scientist, RL/Reasoning | AI Frontier | 10 | RL post-training · Frontier research · Model serving · Agent orchestration |
| OpenAI | Research Engineer, Frontier Evals & Environments | AI Frontier | 10 | Evals · RL robotics · Frontier research · LLM observability · RL post-training |
| Anthropic | Research Engineer, Machine Learning (Reinforcement Learning) | AI Frontier | 10 | RL post-training · Agent orchestration · Tool use · Frontier research · Code gen · Inference infra · Model serving |
| Scale AI | Machine Learning Engineer, Global Public Sector | Data AI | 10 | Agent orchestration · Frontier research · Guardrails · RL post-training |
| GE Healthcare | Staff AI Scientist | Healthcare | 9 | Agent orchestration · Tool use · Evals · Guardrails · LLM observability · RAG · Fine-tuning · Model serving · Multimodal |
| GE Healthcare | Staff AI Scienitist | Healthcare | 9 | Agent orchestration · Tool use · Fine-tuning · RAG · LLM observability · Guardrails · Evals · Multimodal · Frontier research · Model serving |
| Autodesk | Research Lead / Principal Scientist & Manager Post-Training · Alignment · Reinforcement Learning Autodesk AI Lab: Toronto · Remote (CA) | Enterprise | 9 | RL post-training · Agent orchestration · Evals · LLM observability |
| NVIDIA | Senior AI Security Researcher | Semiconductors | 9 | Evals · Guardrails · Model serving |
| Booking | Senior Machine Learning Scientist | Hospitality | 9 | Agent orchestration · RAG · LLM observability · Evals · Tool use |
| NVIDIA | Developer Relations Manager, Higher Education and Research - AI Agents | Semiconductors | 9 | Agent orchestration · Tool use · Evals · LLM observability · RAG · Multimodal · Embodied AI |
| Snorkel AI | AI Advocate, Open-Source & Research | Data AI | 9 | Evals · Fine-tuning · RL post-training · Agent orchestration · Tool use · Frontier research · LLM observability |
| Mistral AI | AI Scientist - Warsaw | AI Frontier | 9 | Frontier research · Pretraining · Multimodal · Audio & speech · Code gen · Evals · Fine-tuning · Model serving |
| Cognition | Research, Post-Training | Coding AI | 9 | RL post-training · RLHF · Reward modeling · Evals · Agent orchestration |
| Anthropic | Research Engineer, Search and Knowledge Post-Training | AI Frontier | 9 | RL post-training · Evals · Search & ranking · RAG · Frontier research · LLM observability |
| Visa | Head of Generative AI Research | Fintech | 9 | Frontier research · Pretraining · Multimodal · Agent orchestration · LLM observability · Model serving |
| CrowdStrike | Vice President, Agentic Systems | Enterprise | 9 | Agent orchestration |