Anthropic has 145 active AI-related job listings. The majority of these roles are focused on agents, comprising 28% of the total. Engineering is the most frequent function, with 74 listings, followed by Research with 51. The company is primarily hiring in the United States, with 118 positions, and the United Kingdom, with 22. Frequent tech tags include model_serving, evals, and agent_orchestration, suggesting a focus on deployment and evaluation of AI systems. In the last 30 days, Anthropic posted 16 new AI roles, a 47% decrease compared to the previous 30-day period.
Currently tracking 124 active AI roles, with 106 new openings in the last 4 weeks. Primary focus: Agent · Engineering. Salary range $46k–$850k (avg $405k).
Anthropic currently has 132 active AI-related roles in our index. The most common open titles are: Applied AI Architect, Industries (2), Regional Research Economist, Economic Research (2), Research Engineer, Machine Learning (RL Velocity) (2), Research Engineer, Production Model Post-Training (2), Staff Software Engineer, AI Reliability Engineering (2). Most positions are in Engineering and Research.
Anthropic's active AI hiring is concentrated in: agents (28%), serving infrastructure (17%), post-training (14%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
Anthropic is hiring AI talent in: United States (106 roles), United Kingdom (20 roles), Canada (6 roles), Ireland (5 roles).
Job postings at Anthropic most frequently reference: model serving, evals, llm observability, agent orchestration, inference infra.
In the past 30 days, Anthropic has posted 29 new AI-related roles. That is a +61% change versus the prior 30 days (18 → 29).
| Title | Stage | AI score |
|---|---|---|
| Staff+ Software Engineer, Inference Runtime Staff+ Software Engineer for Anthropic's Inference Runtime team, focusing on the accelerator-agnostic core of their AI inference serving stack. The role involves setting technical direction, owning the architecture and roadmap, hands-on coding in Rust/Python, optimizing accelerator usage, and building validation systems. Requires deep systems engineering or ML infrastructure background with experience in performance optimization and large-scale distributed systems. | Serve | 9 |
| Software Engineer, Safeguards Evals Software Engineer role focused on building and owning the evaluation infrastructure for an agentic investigation system. This involves designing experiments, constructing high-quality eval datasets, measuring agent performance, analyzing coverage gaps, and productionizing research into release pipelines. The role also involves building tooling for policy experts and constructing RL environments to improve safety investigation capabilities. |
| AgentEval Gate |
| 9 |
| Research Engineer, Machine Learning (RL Velocity) Research Engineer focused on building and improving the RL training infrastructure and tooling at Anthropic. The role involves identifying and removing bottlenecks in the RL stack, partnering with researchers and other engineering teams, and owning the reliability and performance of research runs to enable faster iteration and shipping of better models at scale. | DataPost-train | 9 |
| Security Labs Engineer This role focuses on executing security R&D projects end-to-end, building novel security infrastructure, and driving successful experiments toward production scale. It involves working with research teams to test security controls, evaluating new security technologies, and documenting results to inform future security architecture. The role spans from initial project scoping to potential production deployment, with a focus on high-assurance environments and AI-assisted security tooling. | ServeShip | 9 |
| Prompt Engineer, Agent Prompts & Evals This role focuses on prompt engineering and evaluation development for AI-first products and features, bridging model capabilities with user experience. It involves designing, testing, and optimizing prompts, building evaluation suites, supporting model launches, and contributing to prompt development frameworks. The role requires strong software engineering skills, LLM and prompt engineering experience, and understanding of evaluation methodologies. | AgentEval Gate | 9 |
| Model Quality Software Engineer, Claude Code Staff Software Engineer to set technical direction at the intersection of engineering and research on the Claude Code team. Architect systems, tooling, and evaluation infrastructure to measure, understand, and improve Claude's coding capabilities. Drive architecture, mentor engineers, and influence the direction of Claude Code. | Eval GateAgent | 9 |
| Applied AI Engineer, Startups Applied AI Engineer role focused on advising and partnering with AI-native startups to build on the Claude Developer Platform. Responsibilities include technical guidance, developing evaluation frameworks, designing scalable architectures, and creating technical resources to help startups succeed with Claude. Requires production experience with LLM-powered applications, agent architectures, and evaluation frameworks. | Agent | 9 |
| Research Engineer, Reward Models Platform Research Engineer focused on building platforms and infrastructure to automate and accelerate the reward model development and evaluation workflows for ML researchers at Anthropic. The role involves creating tools for rubric development, human feedback analysis, reward robustness evaluation, and detecting reward hacks, with the goal of enabling rapid iteration and improving reward signal quality for training AI models. | Post-train | 9 |
| Research Engineer, Interpretability Research Engineer focused on building and maintaining specialized infrastructure for interpretability research in AI systems. This involves developing tools for model analysis, optimizing training and inference pipelines, and ensuring reliability for safety audits, with a strong emphasis on understanding and controlling model behavior. | Post-trainServe | 9 |
| Machine Learning Systems Engineer, RL Engineering ML Systems Engineer focused on Reinforcement Learning Engineering to build, maintain, and improve the algorithms and infrastructure for training AI models like Claude using RLHF and other advanced techniques. The role emphasizes improving system performance, robustness, and usability to accelerate research breakthroughs in AI capabilities and safety. | Post-train | 9 |
| Machine Learning Systems Engineer, Research Tools Machine Learning Systems Engineer focused on developing and optimizing encodings and tokenization systems for Anthropic's Finetuning workflows. This role acts as a bridge between Pretraining and Finetuning teams, building infrastructure crucial for model learning and data interpretation, impacting research progress and efficiency. | DataPost-train | 9 |
| Performance Engineer, GPU This role focuses on optimizing GPU performance and systems engineering for large language models, specifically improving utilization and efficiency for inference and training at scale. It involves deep work in GPU programming, custom kernel development, and distributed systems. | ServePretrain | 9 |
| ML Infrastructure Engineer, Safeguards ML Infrastructure Engineer focused on building and scaling critical infrastructure for AI safety systems, including real-time and batch classifier/safety evaluations, monitoring, and optimizing inference for safety-critical applications. | Eval GateServe | 9 |
| Engineering Manager, GPU (ML Accelerator) Engineering Manager for Anthropic's performance and scaling teams, focusing on optimizing compute resources for inference and training systems. The role involves leadership, technical contribution, bottleneck identification, and ensuring efficiency in large-scale ML systems, with a strong emphasis on GPU/accelerator programming and ML/OS internals. | ServeData | 9 |
| TPU Kernel Engineer TPU Kernel Engineer responsible for identifying and addressing performance issues across ML systems (research, training, inference), with a focus on designing and optimizing kernels for TPUs. Provides feedback to researchers on model performance impact. | ServePost-train | 9 |
| Engineering Manager, Cloud Safety Engineering Manager to lead the Cloud Safety team, responsible for scaling and optimizing Claude's serving infrastructure across Cloud Service Providers (CSPs). The role involves owning end-to-end safety, including API, inference, classifiers, fraud detection, data management, and operations, to ensure safe usage and enable the launch of new models and features at scale. | Serve | 8 |
| Product Engineer, Computer Use Product Engineer role focused on building and shipping AI-powered computer-use and browser-control product surfaces. This involves full-stack development, agent harness, and working with LLM APIs and agent frameworks. The role requires end-to-end ownership and iteration based on user feedback, with a focus on reliability and robustness of the agent harness. | Agent | 8 |
| Software Engineer, RL Data Software Engineer on the RL Data team responsible for building systems that produce high-quality reinforcement learning data for Claude. This includes data collection pipelines, human feedback tooling, execution environments, and quality assurance. The role involves end-to-end ownership of stack components, iterating on prompts and evals, developing QA frameworks, hardening execution environments, and collaborating with domain experts and operations partners. | DataPost-train | 8 |
| Engineering Manager, Cybersecurity Products Engineering Manager for AI-powered cybersecurity products, leading a team to prototype and ship products using frontier models. The role involves setting technical direction, partnering with research, and staying close to customers. It requires hands-on technical involvement, product instincts, and scaling the team. | AgentShip | 8 |
| Software Engineer, Claude Design Software Engineer to build and shape Claude Design, a product that lets users collaborate with Claude to create visual work. This is a frontend-leaning role focused on creating intuitive AI-generated design experiences, working closely with researchers and users to iterate and validate product concepts. | ShipAgent | 8 |
| Engineering Manager, Research Tools Engineering Manager for Anthropic's Research Tools team, focusing on building and improving systems for large-scale, distributed finetuning runs and enhancing researcher productivity. The role involves prioritizing team work, designing operational processes, coaching reports, and managing recruiting efforts to support rapid growth in AI model development and research. | Post-train | 8 |
| Manager of Applied AI Architecture, Enterprise Tech (Cyber) Manager of Applied AI Architecture, Enterprise Tech (Cyber) at Anthropic, responsible for leading a team that drives the adoption of Anthropic's AI products (Claude for Enterprise, Claude Code, API) within Enterprise Tech companies. This role involves technical guidance, pre-sales engagements, customer strategy, and ensuring the safe and reliable deployment of AI systems. | ShipAgent | 8 |
| Full-Stack Software Engineer, Reinforcement Learning Full-Stack Software Engineer to build platforms, tools, and interfaces for environment creation, data collection, and training observability for RL. The role involves owning product surfaces end-to-end, iterating on data collection strategies, and partnering with researchers to ship reliable products. | DataEval Gate | 8 |
| Staff Software Engineer, Cloud Inference Safeguards Staff Software Engineer to build and operate safety, oversight, and intervention mechanisms for AI models (Claude) on third-party cloud service provider (CSP) platforms. This role ensures requests are monitored for misuse, enforced against policy, and compliant with data residency and privacy commitments. The engineer will integrate Safeguards into the CSP inference serving path, focusing on real-time enforcement, telemetry, and privacy architecture, while maintaining serving-path latency and scale. The work directly impacts the ability to ship frontier models on CSP platforms safely. | ServeEval Gate | 8 |
| Engineering Manager, Agent Prompts & Evals Engineering Manager to lead the Agent Prompts & Evals team, responsible for the infrastructure that enables shipping model and prompt changes with confidence. This includes eval frameworks, system prompt pipelines, and regression-detection systems. The team acts as a platform for model behavior, sitting between product engineering and research, and partners with other evals groups and product teams. The role requires leading and growing a team, owning the product-side eval platform and system prompt infrastructure, managing model launches, fostering collaboration, recruiting engineers, and shaping team investment in areas like frontier eval development and launch automation. | Eval GateAgent | 8 |
| Engineering Manager, Cloud Inference AWS Engineering Manager to lead the Cloud Inference team for AWS, responsible for scaling and optimizing Claude's inference, API, load balancing, capacity, and operations on AWS. The role ensures LLMs meet performance, safety, and security standards, and enhances global inference technology deployment. It focuses on increasing operational scale and accelerating the launch of new models and features. | Serve | 8 |
| Engineering Manager, Vertical AI Products (Multiple Roles) Engineering Manager for Vertical AI Products at Anthropic, focusing on building and shipping AI products for specific industries like financial services, life sciences, and healthcare. The role involves leading engineering teams, defining products, working with enterprise customers, and collaborating with research to improve AI models. The teams are often building products from scratch (0->1) or scaling existing ones. | ShipAgent | 8 |
| Senior Staff Software Engineer, API Senior Staff Software Engineer for Anthropic's Claude Developer Platform team, focusing on API Engineering. This role involves setting technical strategy for systems that make Claude accessible to developers at scale, ensuring reliability, capability, and growth. Responsibilities include defining multi-year technical strategy, leading complex engineering initiatives, making major architectural decisions, partnering with Research, Inference, Platform, Infrastructure, and Safeguards, and mentoring engineers. The role spans API Core, Capabilities, Knowledge, Distributability, and Agents, with a focus on shipping frontier model capabilities and agentic workflows. | ShipAgent | 8 |
| Staff Software Engineer, People Products Staff Software Engineer focused on building AI-native workflows and LLM-native features for internal people products at Anthropic. The role involves full-stack development, designing and implementing AI tools, evals, and prompts, and working directly with internal stakeholders to iterate quickly. Emphasis on autonomy, shipping products rapidly, and making independent product and architecture decisions in a low-structure environment. | Agent | 8 |
| Applied AI Engineer, Life Sciences (Beneficial Deployments) Applied AI Engineer role focused on deploying Claude in life sciences to accelerate scientific progress. The role involves partnering with research institutions, building agents integrated into scientific workflows, and developing ecosystem infrastructure like MCP servers, benchmarks, and agent skills. The goal is to make Claude a go-to tool for the life sciences ecosystem, from discovery to pharma pipelines. | Agent | 8 |
| Manager, Forward Deployed Engineering Manager for a new Forward Deployed Engineering team at Anthropic, focused on helping enterprise customers adopt Claude. This player-coach role involves hiring, developing, and leading a team of FDEs to ship production AI applications, build playbooks, and influence product direction. Requires strong leadership, technical mentorship, and customer-facing experience. | Ship | 8 |
| Design Engineer, AI Capability Development (Education Labs) This role focuses on building and shipping AI-native product features that enhance human capability and skill development, operating as a technical lead within a research-focused team. The engineer will prototype, define technical direction, and collaborate across functions to integrate skill development principles into Anthropic's broader product strategy, measuring success by user capability growth. | Ship | 8 |
| Forward Deployed Engineer, Federal Civilian Forward Deployed Engineer for Anthropic's Applied AI team, embedding with federal civilian customers to drive AI adoption and ship advanced AI applications built on Claude models. Responsibilities include building production applications, delivering technical artifacts like sub-agents, providing deployment support, identifying repeatable patterns, and building customer relationships. Requires strong programming skills, production LLM experience (prompt engineering, agent development, evaluation, scaling), and experience with government agencies. | Agent | 8 |
| Applied AI Engineer, Beneficial Deployments Applied AI Engineer focused on deploying AI to mission-driven organizations, advising on AI applications like evals and agent architectures, and building infrastructure to scale impact. Requires production experience with LLM applications and a builder mindset. | Agent | 8 |
| Staff+ Software Engineer, Claude App Infrastructure Staff+ Software Engineer role focused on building the agentic layer and infrastructure for Claude App, enabling task execution, tool use, and safe interaction with external services. This involves designing and building sandboxed compute environments, state management for agent tasks, authentication/authorization, and observability tools for agent execution at scale. | AgentServe | 8 |
| Applied AI Architect, Startups This role involves partnering with startups to help them build and scale LLM solutions on the Claude Developer Platform. The architect will guide them through technical evaluations, design architectures, and ensure successful deployment of AI systems, acting as a technical advisor and builder. | Agent | 8 |
| Forward Deployed Engineer, Applied AI Forward Deployed Engineer to embed with strategic customers, drive AI adoption, and ship advanced AI applications using Claude models. Responsibilities include building production applications, delivering technical artifacts like sub-agents, providing deployment support, identifying deployment patterns, and maintaining knowledge of LLM capabilities. Requires 3+ years in a technical, customer-facing role with production LLM experience, strong Python skills, and high agency. | Agent | 8 |
| Staff + Sr. Software Engineer, Inference The Inference team at Anthropic is responsible for building and maintaining the systems that serve Claude to millions of users. This involves managing the entire stack from request routing to fleet-wide orchestration across diverse AI accelerators, with a dual mandate of maximizing compute efficiency and enabling research breakthroughs. The role requires significant software engineering experience, particularly with distributed systems, and experience with LLM inference optimization. | Serve | 8 |
| Engineering Manager, Inference Engineering Manager for Anthropic's performance and scaling teams, focusing on improving model performance and scaling inference and training systems. Responsibilities include front-line leadership, managing day-to-day execution, prioritizing work, and coaching reports. Requires management experience in technical environments, background in ML/AI, and interest in safe AI development. | ServeData | 8 |
| Performance Engineer This role focuses on optimizing the performance, throughput, and robustness of large-scale distributed machine learning systems. The engineer will identify and solve novel systems problems, implement low-latency sampling, adapt models for low-precision inference, optimize serving efficiency, and design fault-tolerant distributed systems. While not directly building ML models, the role is critical for enabling ML algorithms to run efficiently at scale. | Serve | 8 |
| Engineering Manager, Safeguards Review Tooling Engineering Manager for Anthropic's Safeguards Review Tooling team, focusing on building and scaling systems for AI safety investigation and enforcement. This role involves leading a team to develop tooling that supports human reviewers and integrates AI (Claude) for automation, with a strong emphasis on privacy, analytics, and a sandbox environment for rapid iteration. | AgentServe | 7 |
| Staff + Senior Software Engineer, Inference Software Engineer focused on building and maintaining the distributed systems that serve large language models (like Claude) to millions of users. The role involves maximizing compute efficiency, enabling research through high-performance inference infrastructure, and integrating new AI hardware and model architectures. | Serve | 7 |
| Staff+ Software Engineer, GRC Platform Software Engineer to build the platform for governance, risk, and compliance (GRC) at Anthropic. This role involves integrating data from various systems, creating automated checks, dashboards, and evidence for decision-making. The engineer will design and build data pipelines, integrations, and agentic workflows using Claude for tasks like evidence collection and analysis, translating policies into code, and developing real-time visibility dashboards. The goal is to turn manual compliance processes into scalable, reliable systems. | Agent | 7 |
| Staff + Sr. Software Engineer, Cloud Inference This role focuses on building and optimizing backend services and infrastructure for serving large language models (LLMs) like Claude across multiple cloud service providers (CSPs). The engineer will be responsible for API integration, intelligent request routing, inference execution, capacity management, and day-to-day operations, ensuring reliability, cost-effectiveness, and performance at massive scale. The role involves cross-functional collaboration with internal teams and CSP partners, CI/CD automation, and analyzing observability data. | Serve | 7 |
| Performance Engineer, Inference Systems Performance Engineer for Anthropic's inference fleet (Claude), focusing on throughput, latency, reliability, and correctness. The role involves cross-layer performance investigations, improving correctness evaluation pipelines, building observability tools, and partnering with component teams to implement optimizations. Requires strong performance engineering, Python, and data analysis skills, with a genuine interest in correctness as an engineering discipline. | ServeEval Gate | 7 |
| Staff + Sr. Software Engineer, Cloud Inference Launch Engineering Staff + Sr. Software Engineer role focused on scaling and optimizing Claude's inference on cloud platforms (AWS, GCP, Azure). The role involves owning the end-to-end product of Claude on each cloud, including API integration, request routing, inference execution, capacity management, and day-to-day operations. Key responsibilities include validating inference server and load balancer changes, ensuring correctness, performance, and reliability across platforms, and driving down cycle times for model launches and feature integrations. The role requires strong software engineering experience in distributed systems and experience with cloud platforms, with a focus on building automation and test infrastructure for inference services. | Serve | 7 |
| Data Scientist, Supply This role focuses on optimizing compute allocation for AI systems by building testing frameworks, connecting compute decisions to user outcomes, and partnering with infrastructure and research teams. The goal is to ensure efficient use of scarce AI resources and translate data-driven insights into operational changes that impact how AI reaches users at scale. | Serve | 7 |
| Staff+ Software Engineer, Public Sector Staff+ Software Engineer for Anthropic's public sector team, focusing on building and scaling AI applications for governments. This role involves full-stack development, customer collaboration, and adapting Claude for government workflows, particularly in national security and public services. The position requires experience with AI/ML models and shipping enterprise/government-grade products, with a strong emphasis on public sector experience and adapting AI for critical operations. | Ship | 7 |
| Productivity Engineer This role focuses on building AI-powered productivity tools for sales teams, specifically for top-of-funnel activities like lead processing, prospecting, and routing. It involves developing Claude-powered automations and alert systems, integrating various data sources, and analyzing engagement data to improve sales processes. The goal is to enhance sales effectiveness and pipeline generation using AI. | Agent | 7 |
| Partner Business Systems & AI Operations Lead This role focuses on building and operating an AI automation layer within business systems, specifically for partner operations. It involves creating agentic workflows, LLM-driven processes, and AI-augmented tools to improve efficiency and reduce manual work. The role requires hands-on experience with LLMs and AI agents, integrating them into production processes, and ensuring responsible AI deployment with evals and guardrails. The domain is enterprise AI operations, with a primary focus on developing and deploying AI agents and automated workflows. | Agent | 7 |