AI Frontier · AI lab
Currently tracking 124 active AI roles, with 106 new openings in the last 4 weeks. Primary focus: Agent · Engineering. Salary range $46k–$850k (avg $405k).
Anthropic has 145 active AI-related job listings. The majority of these roles are focused on agents, comprising 28% of the total. Engineering is the most frequent function, with 74 listings, followed by Research with 51. The company is primarily hiring in the United States, with 118 positions, and the United Kingdom, with 22. Frequent tech tags include model_serving, evals, and agent_orchestration, suggesting a focus on deployment and evaluation of AI systems. In the last 30 days, Anthropic posted 16 new AI roles, a 47% decrease compared to the previous 30-day period.
Anthropic currently has 132 active AI-related roles in our index. The most common open titles are: Applied AI Architect, Industries (2), Regional Research Economist, Economic Research (2), Research Engineer, Machine Learning (RL Velocity) (2), Research Engineer, Production Model Post-Training (2), Staff Software Engineer, AI Reliability Engineering (2). Most positions are in Engineering and Research.
Anthropic's active AI hiring is concentrated in: agents (28%), serving infrastructure (17%), post-training (14%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
Anthropic is hiring AI talent in: United States (106 roles), United Kingdom (20 roles), Canada (6 roles), Ireland (5 roles).
Job postings at Anthropic most frequently reference: model serving, evals, llm observability, agent orchestration, inference infra.
In the past 30 days, Anthropic has posted 29 new AI-related roles. That is a +61% change versus the prior 30 days (18 → 29).
| Title | Stage | AI score |
|---|---|---|
| Research Scientist, Interpretability Research Scientist focused on mechanistic interpretability of LLMs, aiming to understand how trained models work by reverse-engineering their parameters and algorithms. The role involves developing methods, designing experiments, creating interpretability features, building infrastructure, and collaborating with other teams. Requires strong scientific research background with some interpretability work, comfort with experimental science, and proficiency in Python. | Post-train | 10 |
| [Expression of Interest] Research Manager, Interpretability Research Manager for the Interpretability team, focusing on mechanistic interpretability to understand how large language models work internally and ensure AI safety. The role involves partnering with a research lead on direction, project planning, execution, hiring, and people development, translating research ideas into tangible goals, and overseeing their execution. This is a management role, distinct from individual contributor research scientist or engineer roles. |
| Post-train |
| 10 |
| Research Engineer, Pretraining Scaling Research Engineer focused on training production pretrained models at scale, involving performance optimization, debugging, experimental design, and incident response during model launches. The role bridges research and engineering, working across the full training stack. | Pretrain | 10 |
| Research Engineer/Research Scientist, Pre-training Research Engineer/Scientist focused on pre-training large language models, involving research in model architecture, algorithms, data processing, and optimizer development, as well as optimizing and scaling training infrastructure. | Pretrain | 10 |
| Staff Research Engineer, Discovery Team Staff Research Engineer focused on building AI systems capable of scientific discovery and long-horizon reasoning, working across the full model stack from training to inference and agentic systems. | PretrainAgent | 10 |
| Research Engineer, Machine Learning (Reinforcement Learning) Research Engineer focused on Reinforcement Learning to advance capabilities and safety of large language models. This role involves implementing novel approaches, contributing to research direction, creating agentic models via tool use for tasks like computer use and autonomous software generation, and improving reasoning abilities. Projects include architecting RL infrastructure, designing training environments and evaluations for RL agents, driving performance improvements, and developing automated testing frameworks. | Post-trainAgent | 10 |
| Research Engineer, Domain Scaling Research Engineer focused on scaling AI models for real-world knowledge work in domains like finance, healthcare, and legal. This role involves owning the end-to-end data strategy, from sourcing tasks to RL training, including designing reward signals, managing external data vendors, and developing QA frameworks to ensure environment quality and prevent reward hacking. It combines applied research with hands-on data work. | DataPost-train | 9 |
| Staff+ Software Engineer, Inference Runtime Staff+ Software Engineer for Anthropic's Inference Runtime team, focusing on the accelerator-agnostic core of their AI inference serving stack. The role involves setting technical direction, owning the architecture and roadmap, hands-on coding in Rust/Python, optimizing accelerator usage, and building validation systems. Requires deep systems engineering or ML infrastructure background with experience in performance optimization and large-scale distributed systems. | Serve | 9 |
| Research Engineer, Code RL (Reinforcement Learning) Research Engineer focused on Reinforcement Learning for code generation, aiming to improve models' ability to write, edit, test, debug, and ship software. This role involves designing RL environments, building reward signals, running training experiments, and improving pipeline efficiency, blending research with engineering implementation. | Post-trainAgent | 9 |
| Software Engineer, Safeguards Evals Software Engineer role focused on building and owning the evaluation infrastructure for an agentic investigation system. This involves designing experiments, constructing high-quality eval datasets, measuring agent performance, analyzing coverage gaps, and productionizing research into release pipelines. The role also involves building tooling for policy experts and constructing RL environments to improve safety investigation capabilities. | AgentEval Gate | 9 |
| Product Manager, Claude Code Model Performance Product Manager for Anthropic's Claude Code Model Performance team, responsible for driving model launches, building agentic evals, and translating research improvements into developer-facing outcomes. Requires experience building agentic evals, a systems thinking approach, and comfort with both research and engineering. | AgentEval Gate | 9 |
| Research Scientist, Life Sciences Research Scientist role focused on improving AI model capabilities for life sciences tasks. This involves building agentic tools, designing evaluation benchmarks, and applying post-training techniques to enhance model performance on scientific workflows like bioinformatics, database queries, and literature synthesis. The role bridges ML, software engineering, and biology to make AI a better research assistant in life sciences. | Post-trainAgent | 9 |
| Technical Program Manager, Research This role is a Technical Program Manager for Anthropic's research organization. The TPM will define and build programs for research teams, focusing on areas like compute, evals, and RL environments. They will drive end-to-end execution of complex research initiatives, establish processes, and ensure operational health of RL environments. The role requires a background in ML research or engineering, experience building technical programs from scratch, and the ability to navigate ambiguity in fast-moving research environments. | Post-trainData | 9 |
| Research Engineer, RL Infrastructure (Knowledge Work) Research Engineer focused on the reliability, observability, and infrastructure of training environments and evaluation systems for AI models, ensuring stability and quality as they scale. The role involves proactive hardening, building tooling for early problem detection, and serving as a dedicated owner for environment health and evaluation integrity. | Eval GateData | 9 |
| Research Engineer, Machine Learning (RL Velocity) Research Engineer focused on building and improving the RL training infrastructure and tooling at Anthropic. The role involves identifying and removing bottlenecks in the RL stack, partnering with researchers and other engineering teams, and owning the reliability and performance of research runs to enable faster iteration and shipping of better models at scale. | DataPost-train | 9 |
| Research Engineer, Safeguards Labs Research Engineer focused on AI safety, investigating novel methods for detecting misuse, strengthening model safeguards, and building evaluation methodologies for AI systems, particularly in agentic workflows. The role involves leading research projects, designing offline analyses, developing prototypes, and collaborating with production teams. | Eval GatePost-train | 9 |
| Anthropic Fellows Program — Reinforcement Learning This is a research fellowship program focused on Reinforcement Learning (RL) within AI safety. Fellows will work on empirical projects, potentially using external infrastructure, with the goal of producing public outputs like paper submissions. The program emphasizes mentorship from Anthropic researchers and provides a stipend and compute funding. Key activities include building model-based tools for data quality, understanding generalization, and creating RL environments for capabilities and safety tasks. | Post-train | 9 |
| Anthropic Fellows Program — AI Safety This is a research fellowship program focused on AI safety, aiming to foster talent in empirical AI research. Fellows will work on projects aligned with Anthropic's research priorities, using external infrastructure and external models, with the goal of producing public outputs like paper submissions. Key research areas include Scalable Oversight, Adversarial Robustness and AI Control, Model Organisms, Model Internals / Mechanistic Interpretability, and AI Welfare. | Post-train | 9 |
| Research Engineer, Performance RL Research Engineer focused on Reinforcement Learning for code generation and accelerator performance, aiming to improve model reasoning and coding capabilities. The role involves inventing RL environments, conducting experiments, shaping research roadmaps, and delivering work into training runs, with a strong emphasis on collaboration and scaling research innovations. | Post-trainData | 9 |
| Security Labs Engineer This role focuses on executing security R&D projects end-to-end, building novel security infrastructure, and driving successful experiments toward production scale. It involves working with research teams to test security controls, evaluating new security technologies, and documenting results to inform future security architecture. The role spans from initial project scoping to potential production deployment, with a focus on high-assurance environments and AI-assisted security tooling. | ServeShip | 9 |
| Research Lead, Training Insights Research Lead focused on developing and executing strategies for measuring and characterizing model capabilities across training and deployment. This role involves driving original research into new evaluation methodologies, leading a team, and spanning the full lifecycle of model development, from pretraining to deployment. The work includes creating long-horizon evaluations, measuring emerging capabilities, and understanding their development during RL training and post-training. The role also involves cross-organizational collaboration to map evaluation landscapes and identify gaps, shaping the evaluation narrative for model releases, and contributing to the broader research community. | Eval GatePost-train | 9 |
| Model Quality Software Engineer, Claude Code Staff Software Engineer to set technical direction at the intersection of engineering and research on the Claude Code team. Architect systems, tooling, and evaluation infrastructure to measure, understand, and improve Claude's coding capabilities. Drive architecture, mentor engineers, and influence the direction of Claude Code. | Eval GateAgent | 9 |
| Research Engineer / Scientist, Frontier Red Team (Cyber) Research Engineer/Scientist focused on AI-enabled cybersecurity, developing tools and frameworks for autonomous vulnerability discovery, remediation, malware detection, and pentesting. Designs and runs experiments to evaluate AI cyber capabilities and builds infrastructure for AI systems operating in security environments. Translates findings into demonstrations for policymakers and collaborates with external experts. Senior candidates will set research strategy and own the technical roadmap. | AgentEval Gate | 9 |
| Research Engineer, Universes Research Engineer role focused on building next-generation agentic environments for training AI models. This role involves implementing novel approaches, contributing to research direction, designing training environments and methodologies, and building evaluations for capable and safe agentic AI. It blends research and engineering, with a focus on reinforcement learning and complex, long-horizon agentic tasks. | AgentPost-train | 9 |
| Anthropic Fellows Program — AI Security This is a research fellowship program focused on AI safety and security, aiming to produce public outputs like paper submissions. Fellows will use external infrastructure and open-source models, working on empirical projects with mentorship from Anthropic researchers. | Post-train | 9 |
| Anthropic Fellows Program Anthropic's Fellows Program offers a 4-month full-time research opportunity focused on AI safety and related areas. Fellows will use external infrastructure and open-source models to conduct empirical projects, aiming for public outputs like paper submissions, with mentorship from Anthropic researchers. The program is designed to foster AI research and engineering talent, regardless of previous experience, and emphasizes safety, interpretability, and steerability of AI systems. | Pretrain | 9 |
| Research Engineer, Cybersecurity Reinforcement Learning Research Engineer role focused on applying reinforcement learning to cybersecurity tasks like secure coding and vulnerability remediation, blending research and engineering to train safe AI models. Requires cybersecurity domain expertise and ML/software engineering skills. | Post-trainData | 9 |
| Research Engineer, Interpretability Research Engineer focused on building and maintaining specialized infrastructure for interpretability research in AI systems. This involves developing tools for model analysis, optimizing training and inference pipelines, and ensuring reliability for safety audits, with a strong emphasis on understanding and controlling model behavior. | Post-trainServe | 9 |
| Machine Learning Systems Engineer, RL Engineering ML Systems Engineer focused on Reinforcement Learning Engineering to build, maintain, and improve the algorithms and infrastructure for training AI models like Claude using RLHF and other advanced techniques. The role emphasizes improving system performance, robustness, and usability to accelerate research breakthroughs in AI capabilities and safety. | Post-train | 9 |
| Machine Learning Systems Engineer, Research Tools Machine Learning Systems Engineer focused on developing and optimizing encodings and tokenization systems for Anthropic's Finetuning workflows. This role acts as a bridge between Pretraining and Finetuning teams, building infrastructure crucial for model learning and data interpretation, impacting research progress and efficiency. | DataPost-train | 9 |
| Research Engineer / Research Scientist, Tokens Research Engineer/Scientist role focused on building large-scale ML systems, touching all parts of code and infrastructure, from cluster reliability and job efficiency to running scientific experiments and improving dev tooling. The role involves optimizing ML systems, comparing model variants, scaling training jobs, and designing fault tolerance strategies, with a focus on safe, steerable, and trustworthy AI. | PretrainServe | 9 |
| ML/Research Engineer, Safeguards ML/Research Engineer focused on detecting and mitigating misuse of AI systems, building classifiers, monitoring for harms, evaluating agentic product safety, and conducting research on red-teaming and adversarial robustness. | AgentData | 9 |
| Performance Engineer, GPU This role focuses on optimizing GPU performance and systems engineering for large language models, specifically improving utilization and efficiency for inference and training at scale. It involves deep work in GPU programming, custom kernel development, and distributed systems. | ServePretrain | 9 |
| ML Infrastructure Engineer, Safeguards ML Infrastructure Engineer focused on building and scaling critical infrastructure for AI safety systems, including real-time and batch classifier/safety evaluations, monitoring, and optimizing inference for safety-critical applications. | Eval GateServe | 9 |
| Engineering Manager, GPU (ML Accelerator) Engineering Manager for Anthropic's performance and scaling teams, focusing on optimizing compute resources for inference and training systems. The role involves leadership, technical contribution, bottleneck identification, and ensuring efficiency in large-scale ML systems, with a strong emphasis on GPU/accelerator programming and ML/OS internals. | ServeData | 9 |
| TPU Kernel Engineer TPU Kernel Engineer responsible for identifying and addressing performance issues across ML systems (research, training, inference), with a focus on designing and optimizing kernels for TPUs. Provides feedback to researchers on model performance impact. | ServePost-train | 9 |
| Research Engineer, Discovery Research Engineer focused on building and optimizing infrastructure for AI scientist training, evaluation, and inference. The role involves identifying and resolving infra blockers, developing evaluation frameworks, managing data pipelines, and optimizing training/inference for reinforcement learning in distributed environments. | ServeData | 9 |
| Research Engineer / Scientist, Alignment Science Research Engineer/Scientist focused on AI safety and alignment, conducting experiments to understand and steer AI behavior, with a focus on risks from powerful future systems. Involves collaboration with interpretability, fine-tuning, and red teaming teams. Explores scalable oversight, AI control, stress-testing, automated alignment research, alignment assessments, safeguards research, and model welfare. | Post-trainAgent | 9 |
| Research Engineer, Production Model Post-Training Research Engineer focused on post-training of large language models, including techniques like Constitutional AI and RLHF, to enhance model capabilities, alignment, and safety for production Claude models. Involves implementing, scaling, and optimizing these processes, conducting research for improvements, and developing evaluation tools. | Post-train | 9 |
| [Expression of Interest] Research Scientist / Engineer, Honesty Research Scientist/Engineer focused on honesty in language models, developing techniques to minimize hallucinations and enhance truthfulness. This involves data curation, classifier development, evaluation frameworks, RAG implementation, human feedback collection, prompting pipelines, RL environments, and tools for human evaluators. | Post-trainAgent | 9 |
| Research Engineer, Knowledge Team Research Engineer focused on redesigning how LLMs interact with external data sources by designing new information architectures and training models to use them. Responsibilities include implementing information architecture strategies, performing finetuning and RL, building knowledge base eval sets, and designing agentic search capabilities. Requires strong Python, ML research experience, and experience with LLMs. Experience with complex agentic systems, RAG, and distributed information retrieval is a plus. | AgentPost-train | 9 |
| Product Manager, Safeguards Rare Harms Product Manager for Anthropic's Safeguards team, focusing on building and deploying systems to ensure AI safety and prevent misuse. This role involves ideation, design, development, and UX for safeguards, working closely with research and product teams to mitigate risks associated with frontier models across various platforms. | Eval GateAgent | 8 |
| Engineering Manager, Cloud Safety Engineering Manager to lead the Cloud Safety team, responsible for scaling and optimizing Claude's serving infrastructure across Cloud Service Providers (CSPs). The role involves owning end-to-end safety, including API, inference, classifiers, fraud detection, data management, and operations, to ensure safe usage and enable the launch of new models and features at scale. | Serve | 8 |
| Product Engineer, Computer Use Product Engineer role focused on building and shipping AI-powered computer-use and browser-control product surfaces. This involves full-stack development, agent harness, and working with LLM APIs and agent frameworks. The role requires end-to-end ownership and iteration based on user feedback, with a focus on reliability and robustness of the agent harness. | Agent | 8 |
| Software Engineer, RL Data Software Engineer on the RL Data team responsible for building systems that produce high-quality reinforcement learning data for Claude. This includes data collection pipelines, human feedback tooling, execution environments, and quality assurance. The role involves end-to-end ownership of stack components, iterating on prompts and evals, developing QA frameworks, hardening execution environments, and collaborating with domain experts and operations partners. | DataPost-train | 8 |
| Engineering Manager, Cybersecurity Products Engineering Manager for AI-powered cybersecurity products, leading a team to prototype and ship products using frontier models. The role involves setting technical direction, partnering with research, and staying close to customers. It requires hands-on technical involvement, product instincts, and scaling the team. | AgentShip | 8 |
| Software Engineer, Claude Design Software Engineer to build and shape Claude Design, a product that lets users collaborate with Claude to create visual work. This is a frontend-leaning role focused on creating intuitive AI-generated design experiences, working closely with researchers and users to iterate and validate product concepts. | ShipAgent | 8 |
| Manager of Applied AI Architecture, Startups Manager of Applied AI Architecture for startups, leading a team of technical architects to drive adoption of frontier AI and help startups build AI-native products using the Claude Developer Platform. Focuses on technical partnership, building teams, defining playbooks, and influencing product roadmap. | Agent | 8 |
| Engineering Manager, Research Tools Engineering Manager for Anthropic's Research Tools team, focusing on building and improving systems for large-scale, distributed finetuning runs and enhancing researcher productivity. The role involves prioritizing team work, designing operational processes, coaching reports, and managing recruiting efforts to support rapid growth in AI model development and research. | Post-train | 8 |
| Manager of Applied AI Architecture, Enterprise Tech (Cyber) Manager of Applied AI Architecture, Enterprise Tech (Cyber) at Anthropic, responsible for leading a team that drives the adoption of Anthropic's AI products (Claude for Enterprise, Claude Code, API) within Enterprise Tech companies. This role involves technical guidance, pre-sales engagements, customer strategy, and ensuring the safe and reliable deployment of AI systems. | ShipAgent | 8 |