Currently tracking 124 active AI roles, with 106 new openings in the last 4 weeks. Primary focus: Agent · Engineering. Salary range $46k–$850k (avg $405k).
Anthropic has 145 active AI-related job listings. The majority of these roles are focused on agents, comprising 28% of the total. Engineering is the most frequent function, with 74 listings, followed by Research with 51. The company is primarily hiring in the United States, with 118 positions, and the United Kingdom, with 22. Frequent tech tags include model_serving, evals, and agent_orchestration, suggesting a focus on deployment and evaluation of AI systems. In the last 30 days, Anthropic posted 16 new AI roles, a 47% decrease compared to the previous 30-day period.
Anthropic currently has 132 active AI-related roles in our index. The most common open titles are: Applied AI Architect, Industries (2), Regional Research Economist, Economic Research (2), Research Engineer, Machine Learning (RL Velocity) (2), Research Engineer, Production Model Post-Training (2), Staff Software Engineer, AI Reliability Engineering (2). Most positions are in Engineering and Research.
Anthropic's active AI hiring is concentrated in: agents (28%), serving infrastructure (17%), post-training (14%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
Anthropic is hiring AI talent in: United States (106 roles), United Kingdom (20 roles), Canada (6 roles), Ireland (5 roles).
Job postings at Anthropic most frequently reference: model serving, evals, llm observability, agent orchestration, inference infra.
In the past 30 days, Anthropic has posted 29 new AI-related roles. That is a +61% change versus the prior 30 days (18 → 29).
| Title | Stage | AI score |
|---|---|---|
| Research Engineer, RL Infrastructure (Knowledge Work) Research Engineer focused on the reliability, observability, and infrastructure of training environments and evaluation systems for AI models, ensuring stability and quality as they scale. The role involves proactive hardening, building tooling for early problem detection, and serving as a dedicated owner for environment health and evaluation integrity. | Eval GateData | 9 |
| Research Engineer, Safeguards Labs Research Engineer focused on AI safety, investigating novel methods for detecting misuse, strengthening model safeguards, and building evaluation methodologies for AI systems, particularly in agentic workflows. The role involves leading research projects, designing offline analyses, developing prototypes, and collaborating with production teams. | Eval Gate |
| 9 |
| Research Lead, Training Insights Research Lead focused on developing and executing strategies for measuring and characterizing model capabilities across training and deployment. This role involves driving original research into new evaluation methodologies, leading a team, and spanning the full lifecycle of model development, from pretraining to deployment. The work includes creating long-horizon evaluations, measuring emerging capabilities, and understanding their development during RL training and post-training. The role also involves cross-organizational collaboration to map evaluation landscapes and identify gaps, shaping the evaluation narrative for model releases, and contributing to the broader research community. | Eval GatePost-train | 9 |
| Model Quality Software Engineer, Claude Code Staff Software Engineer to set technical direction at the intersection of engineering and research on the Claude Code team. Architect systems, tooling, and evaluation infrastructure to measure, understand, and improve Claude's coding capabilities. Drive architecture, mentor engineers, and influence the direction of Claude Code. | Eval GateAgent | 9 |
| ML Infrastructure Engineer, Safeguards ML Infrastructure Engineer focused on building and scaling critical infrastructure for AI safety systems, including real-time and batch classifier/safety evaluations, monitoring, and optimizing inference for safety-critical applications. | Eval GateServe | 9 |
| Product Manager, Safeguards Rare Harms Product Manager for Anthropic's Safeguards team, focusing on building and deploying systems to ensure AI safety and prevent misuse. This role involves ideation, design, development, and UX for safeguards, working closely with research and product teams to mitigate risks associated with frontier models across various platforms. | Eval GateAgent | 8 |
| Engineering Manager, Agent Prompts & Evals Engineering Manager to lead the Agent Prompts & Evals team, responsible for the infrastructure that enables shipping model and prompt changes with confidence. This includes eval frameworks, system prompt pipelines, and regression-detection systems. The team acts as a platform for model behavior, sitting between product engineering and research, and partners with other evals groups and product teams. The role requires leading and growing a team, owning the product-side eval platform and system prompt infrastructure, managing model launches, fostering collaboration, recruiting engineers, and shaping team investment in areas like frontier eval development and launch automation. | Eval GateAgent | 8 |
| Biological Safety Research Scientist Research Scientist focused on biological safety for AI systems, applying technical skills to design and develop safety systems that detect harmful behaviors and prevent misuse. This role involves designing and executing capability evaluations, collaborating on training data and safety system training, analyzing performance, and stress-testing safeguards. The goal is to ensure biological safety is embedded throughout the model development lifecycle, balancing AI's potential in life sciences with preventing misuse. | Eval GatePost-train | 8 |
| Data Scientist, Safeguards This role focuses on building and scaling a data-driven culture within an AI company, specifically for safeguards. The Data Scientist will analyze user behavior, define key metrics, identify opportunities for product improvement, design and analyze experiments, and establish data best practices to inform product and commercial strategy for safe, frontier AI deployment. | Eval Gate | 7 |
| Safeguards Enforcement Analyst, Safety Evaluations This role focuses on evaluating AI models against safety and policy standards, running and monitoring evaluations, driving mitigations, and coordinating the creation of new evaluation frameworks. It involves cross-functional collaboration with policy experts and engineering teams to ensure model behavior meets required standards and to build scalable processes for evaluation. | Eval Gate | 5 |
| Technical Program Manager, Safeguards (Infrastructure & Evals) Technical Program Manager for Safeguards Infrastructure and Evals at Anthropic. This role focuses on owning the operational health, reliability, and forward momentum of AI safety infrastructure, including classifiers, detection pipelines, evaluation platforms, and monitoring systems. Responsibilities include driving incident response, post-mortem execution, establishing and maintaining SLOs with partner teams, maintaining runbook quality, managing platform migrations, and coordinating improvements to the evals platform. Requires technical depth in production ML systems and strong program management skills in operational and infrastructure-heavy environments. | Eval GateServe | 5 |