AI Frontier · AI lab
Anthropic has 145 active AI-related job listings. The majority of these roles are focused on agents, comprising 28% of the total. Engineering is the most frequent function, with 74 listings, followed by Research with 51. The company is primarily hiring in the United States, with 118 positions, and the United Kingdom, with 22. Frequent tech tags include model_serving, evals, and agent_orchestration, suggesting a focus on deployment and evaluation of AI systems. In the last 30 days, Anthropic posted 16 new AI roles, a 47% decrease compared to the previous 30-day period.
Currently tracking 124 active AI roles, with 106 new openings in the last 4 weeks. Primary focus: Agent · Engineering. Salary range $46k–$850k (avg $405k).
Anthropic currently has 132 active AI-related roles in our index. The most common open titles are: Applied AI Architect, Industries (2), Regional Research Economist, Economic Research (2), Research Engineer, Machine Learning (RL Velocity) (2), Research Engineer, Production Model Post-Training (2), Staff Software Engineer, AI Reliability Engineering (2). Most positions are in Engineering and Research.
Anthropic's active AI hiring is concentrated in: agents (28%), serving infrastructure (17%), post-training (14%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
Anthropic is hiring AI talent in: United States (106 roles), United Kingdom (20 roles), Canada (6 roles), Ireland (5 roles).
Job postings at Anthropic most frequently reference: model serving, evals, llm observability, agent orchestration, inference infra.
In the past 30 days, Anthropic has posted 29 new AI-related roles. That is a +61% change versus the prior 30 days (18 → 29).
| Title | Stage | AI score |
|---|---|---|
| Staff+ Software Engineer, Inference Runtime Staff+ Software Engineer for Anthropic's Inference Runtime team, focusing on the accelerator-agnostic core of their AI inference serving stack. The role involves setting technical direction, owning the architecture and roadmap, hands-on coding in Rust/Python, optimizing accelerator usage, and building validation systems. Requires deep systems engineering or ML infrastructure background with experience in performance optimization and large-scale distributed systems. | Serve | 9 |
| Software Engineer, Safeguards Evals Software Engineer role focused on building and owning the evaluation infrastructure for an agentic investigation system. This involves designing experiments, constructing high-quality eval datasets, measuring agent performance, analyzing coverage gaps, and productionizing research into release pipelines. The role also involves building tooling for policy experts and constructing RL environments to improve safety investigation capabilities. |
| AgentEval Gate |
| 9 |
| Research Engineer, Machine Learning (RL Velocity) Research Engineer focused on building and improving the RL training infrastructure and tooling at Anthropic. The role involves identifying and removing bottlenecks in the RL stack, partnering with researchers and other engineering teams, and owning the reliability and performance of research runs to enable faster iteration and shipping of better models at scale. | DataPost-train | 9 |
| Security Labs Engineer This role focuses on executing security R&D projects end-to-end, building novel security infrastructure, and driving successful experiments toward production scale. It involves working with research teams to test security controls, evaluating new security technologies, and documenting results to inform future security architecture. The role spans from initial project scoping to potential production deployment, with a focus on high-assurance environments and AI-assisted security tooling. | ServeShip | 9 |
| Model Quality Software Engineer, Claude Code Staff Software Engineer to set technical direction at the intersection of engineering and research on the Claude Code team. Architect systems, tooling, and evaluation infrastructure to measure, understand, and improve Claude's coding capabilities. Drive architecture, mentor engineers, and influence the direction of Claude Code. | Eval GateAgent | 9 |
| Research Engineer, Interpretability Research Engineer focused on building and maintaining specialized infrastructure for interpretability research in AI systems. This involves developing tools for model analysis, optimizing training and inference pipelines, and ensuring reliability for safety audits, with a strong emphasis on understanding and controlling model behavior. | Post-trainServe | 9 |
| Machine Learning Systems Engineer, RL Engineering ML Systems Engineer focused on Reinforcement Learning Engineering to build, maintain, and improve the algorithms and infrastructure for training AI models like Claude using RLHF and other advanced techniques. The role emphasizes improving system performance, robustness, and usability to accelerate research breakthroughs in AI capabilities and safety. | Post-train | 9 |
| Machine Learning Systems Engineer, Research Tools Machine Learning Systems Engineer focused on developing and optimizing encodings and tokenization systems for Anthropic's Finetuning workflows. This role acts as a bridge between Pretraining and Finetuning teams, building infrastructure crucial for model learning and data interpretation, impacting research progress and efficiency. | DataPost-train | 9 |
| Performance Engineer, GPU This role focuses on optimizing GPU performance and systems engineering for large language models, specifically improving utilization and efficiency for inference and training at scale. It involves deep work in GPU programming, custom kernel development, and distributed systems. | ServePretrain | 9 |
| ML Infrastructure Engineer, Safeguards ML Infrastructure Engineer focused on building and scaling critical infrastructure for AI safety systems, including real-time and batch classifier/safety evaluations, monitoring, and optimizing inference for safety-critical applications. | Eval GateServe | 9 |
| Engineering Manager, GPU (ML Accelerator) Engineering Manager for Anthropic's performance and scaling teams, focusing on optimizing compute resources for inference and training systems. The role involves leadership, technical contribution, bottleneck identification, and ensuring efficiency in large-scale ML systems, with a strong emphasis on GPU/accelerator programming and ML/OS internals. | ServeData | 9 |
| TPU Kernel Engineer TPU Kernel Engineer responsible for identifying and addressing performance issues across ML systems (research, training, inference), with a focus on designing and optimizing kernels for TPUs. Provides feedback to researchers on model performance impact. | ServePost-train | 9 |
| Engineering Manager, Cloud Safety Engineering Manager to lead the Cloud Safety team, responsible for scaling and optimizing Claude's serving infrastructure across Cloud Service Providers (CSPs). The role involves owning end-to-end safety, including API, inference, classifiers, fraud detection, data management, and operations, to ensure safe usage and enable the launch of new models and features at scale. | Serve | 8 |
| Applied AI Engineer Applied AI Engineer role focused on being a technical advisor to customers deploying Claude (LLM). Responsibilities include guiding architecture, developing evaluation frameworks, and implementing cutting-edge LLM patterns via API. Requires strong Python skills and production experience with LLMs, including agent development and retrieval frameworks. | Agent | 8 |
| Product Engineer, Computer Use Product Engineer role focused on building and shipping AI-powered computer-use and browser-control product surfaces. This involves full-stack development, agent harness, and working with LLM APIs and agent frameworks. The role requires end-to-end ownership and iteration based on user feedback, with a focus on reliability and robustness of the agent harness. | Agent | 8 |
| Applied AI Architect (Startups) This role focuses on partnering with startups to help them build and scale AI solutions using Anthropic's Claude Developer Platform. The architect will guide technical decisions, win evaluations, and provide feedback to product and engineering teams. Requires strong technical expertise in LLM application development and deployment, with a customer-facing background. | Agent | 8 |
| Software Engineer, RL Data Software Engineer on the RL Data team responsible for building systems that produce high-quality reinforcement learning data for Claude. This includes data collection pipelines, human feedback tooling, execution environments, and quality assurance. The role involves end-to-end ownership of stack components, iterating on prompts and evals, developing QA frameworks, hardening execution environments, and collaborating with domain experts and operations partners. | DataPost-train | 8 |
| Engineering Manager, Cybersecurity Products Engineering Manager for AI-powered cybersecurity products, leading a team to prototype and ship products using frontier models. The role involves setting technical direction, partnering with research, and staying close to customers. It requires hands-on technical involvement, product instincts, and scaling the team. | AgentShip | 8 |
| Software Engineer, Claude Design Software Engineer to build and shape Claude Design, a product that lets users collaborate with Claude to create visual work. This is a frontend-leaning role focused on creating intuitive AI-generated design experiences, working closely with researchers and users to iterate and validate product concepts. | ShipAgent | 8 |
| Engineering Manager, Research Tools Engineering Manager for Anthropic's Research Tools team, focusing on building and improving systems for large-scale, distributed finetuning runs and enhancing researcher productivity. The role involves prioritizing team work, designing operational processes, coaching reports, and managing recruiting efforts to support rapid growth in AI model development and research. | Post-train | 8 |
| Manager of Applied AI Architecture, Enterprise Tech (Cyber) Manager of Applied AI Architecture, Enterprise Tech (Cyber) at Anthropic, responsible for leading a team that drives the adoption of Anthropic's AI products (Claude for Enterprise, Claude Code, API) within Enterprise Tech companies. This role involves technical guidance, pre-sales engagements, customer strategy, and ensuring the safe and reliable deployment of AI systems. | ShipAgent | 8 |
| Full-Stack Software Engineer, Reinforcement Learning Full-Stack Software Engineer to build platforms, tools, and interfaces for environment creation, data collection, and training observability for RL. The role involves owning product surfaces end-to-end, iterating on data collection strategies, and partnering with researchers to ship reliable products. | DataEval Gate | 8 |
| Engineering Manager, Agent Prompts & Evals Engineering Manager to lead the Agent Prompts & Evals team, responsible for the infrastructure that enables shipping model and prompt changes with confidence. This includes eval frameworks, system prompt pipelines, and regression-detection systems. The team acts as a platform for model behavior, sitting between product engineering and research, and partners with other evals groups and product teams. The role requires leading and growing a team, owning the product-side eval platform and system prompt infrastructure, managing model launches, fostering collaboration, recruiting engineers, and shaping team investment in areas like frontier eval development and launch automation. | Eval GateAgent | 8 |
| Staff Software Engineer, Inference Staff Software Engineer on the Inference team responsible for building and maintaining systems that serve Claude to millions of users. Focuses on maximizing compute efficiency and providing high-performance inference infrastructure for research, tackling complex distributed systems challenges across diverse AI accelerators. | Serve | 8 |
| Senior Staff Software Engineer, API Senior Staff Software Engineer for Anthropic's Claude Developer Platform team, focusing on API Engineering. This role involves setting technical strategy for systems that make Claude accessible to developers at scale, ensuring reliability, capability, and growth. Responsibilities include defining multi-year technical strategy, leading complex engineering initiatives, making major architectural decisions, partnering with Research, Inference, Platform, Infrastructure, and Safeguards, and mentoring engineers. The role spans API Core, Capabilities, Knowledge, Distributability, and Agents, with a focus on shipping frontier model capabilities and agentic workflows. | ShipAgent | 8 |
| Staff Software Engineer, People Products Staff Software Engineer focused on building AI-native workflows and LLM-native features for internal people products at Anthropic. The role involves full-stack development, designing and implementing AI tools, evals, and prompts, and working directly with internal stakeholders to iterate quickly. Emphasis on autonomy, shipping products rapidly, and making independent product and architecture decisions in a low-structure environment. | Agent | 8 |
| Design Engineer, AI Capability Development (Education Labs) This role focuses on building and shipping AI-native product features that enhance human capability and skill development, operating as a technical lead within a research-focused team. The engineer will prototype, define technical direction, and collaborate across functions to integrate skill development principles into Anthropic's broader product strategy, measuring success by user capability growth. | Ship | 8 |
| Staff+ Software Engineer, Claude App Infrastructure Staff+ Software Engineer role focused on building the agentic layer and infrastructure for Claude App, enabling task execution, tool use, and safe interaction with external services. This involves designing and building sandboxed compute environments, state management for agent tasks, authentication/authorization, and observability tools for agent execution at scale. | AgentServe | 8 |
| Engineering Manager, Inference Engineering Manager for Anthropic's performance and scaling teams, focusing on improving model performance and scaling inference and training systems. Responsibilities include front-line leadership, managing day-to-day execution, prioritizing work, and coaching reports. Requires management experience in technical environments, background in ML/AI, and interest in safe AI development. | ServeData | 8 |
| Senior Software Engineer, Inference Senior Software Engineer on the Inference team responsible for building and maintaining systems that serve Claude models to millions of users. Focuses on maximizing compute efficiency and providing high-performance inference infrastructure for research. | Serve | 8 |
| Performance Engineer This role focuses on optimizing the performance, throughput, and robustness of large-scale distributed machine learning systems. The engineer will identify and solve novel systems problems, implement low-latency sampling, adapt models for low-precision inference, optimize serving efficiency, and design fault-tolerant distributed systems. While not directly building ML models, the role is critical for enabling ML algorithms to run efficiently at scale. | Serve | 8 |
| Engineering Manager, Safeguards Review Tooling Engineering Manager for Anthropic's Safeguards Review Tooling team, focusing on building and scaling systems for AI safety investigation and enforcement. This role involves leading a team to develop tooling that supports human reviewers and integrates AI (Claude) for automation, with a strong emphasis on privacy, analytics, and a sandbox environment for rapid iteration. | AgentServe | 7 |
| Staff + Senior Software Engineer, Inference Software Engineer focused on building and maintaining the distributed systems that serve large language models (like Claude) to millions of users. The role involves maximizing compute efficiency, enabling research through high-performance inference infrastructure, and integrating new AI hardware and model architectures. | Serve | 7 |
| Staff+ Software Engineer, GRC Platform Software Engineer to build the platform for governance, risk, and compliance (GRC) at Anthropic. This role involves integrating data from various systems, creating automated checks, dashboards, and evidence for decision-making. The engineer will design and build data pipelines, integrations, and agentic workflows using Claude for tasks like evidence collection and analysis, translating policies into code, and developing real-time visibility dashboards. The goal is to turn manual compliance processes into scalable, reliable systems. | Agent | 7 |
| Staff + Sr. Software Engineer, Cloud Inference This role focuses on building and optimizing backend services and infrastructure for serving large language models (LLMs) like Claude across multiple cloud service providers (CSPs). The engineer will be responsible for API integration, intelligent request routing, inference execution, capacity management, and day-to-day operations, ensuring reliability, cost-effectiveness, and performance at massive scale. The role involves cross-functional collaboration with internal teams and CSP partners, CI/CD automation, and analyzing observability data. | Serve | 7 |
| Performance Engineer, Inference Systems Performance Engineer for Anthropic's inference fleet (Claude), focusing on throughput, latency, reliability, and correctness. The role involves cross-layer performance investigations, improving correctness evaluation pipelines, building observability tools, and partnering with component teams to implement optimizations. Requires strong performance engineering, Python, and data analysis skills, with a genuine interest in correctness as an engineering discipline. | ServeEval Gate | 7 |
| Staff+ Software Engineer, Public Sector Staff+ Software Engineer for Anthropic's public sector team, focusing on building and scaling AI applications for governments. This role involves full-stack development, customer collaboration, and adapting Claude for government workflows, particularly in national security and public services. The position requires experience with AI/ML models and shipping enterprise/government-grade products, with a strong emphasis on public sector experience and adapting AI for critical operations. | Ship | 7 |
| Applied AI Architect, Commercial This role is a Pre-Sales architect focused on becoming a trusted technical advisor helping customers understand the value of Claude and how they can successfully integrate and deploy Claude into their technology stack. The role involves being a hands-on builder, creating reusable blueprints, demos, and enablement. Responsibilities include partnering with account executives, serving as the primary technical advisor, supporting customers building with the Claude API, shipping working code, building prototypes and proof-of-concepts, developing eval frameworks, writing near-production examples, building reusable blueprints and demos, guiding technical architecture decisions, helping customers integrate Claude, and helping customers develop evaluation frameworks. | Agent | 7 |
| Global Applied AI Architecture Lead, Beneficial Deployments Lead a global team of Applied AI Architects who partner with mission-driven organizations to deploy Claude, focusing on responsible and effective adoption to accelerate their missions. This role involves setting strategy, scaling the team, influencing product roadmaps, and representing Anthropic as a senior technical leader in high-impact partnerships. | Agent | 7 |
| Software Engineer, Research Data Platform Software Engineer to build and operate data pipelines and tooling for AI researchers managing data from training runs, exploring datasets, and analyzing experiments. Focus on data products supporting the research workflow. | Data | 7 |
| Technical Enablement Lead, Claude Code This role focuses on creating and delivering technical training and content for go-to-market teams, partners, and customers to enable them to effectively demonstrate and support Claude Code, an AI coding assistant. The lead will build demos, labs, and competitive positioning materials, and translate new features into field-ready content. The role requires strong programming skills and extensive use of AI coding tools. | Agent | 7 |
| Staff+ Software Engineer, Full-stack Staff+ Software Engineer, Full-stack at Anthropic, focusing on building and scaling AI products for enterprise customers. This role involves end-to-end ownership of products like Claude.ai, the Anthropic API, enterprise deployments, and specialized industry applications. Responsibilities include developing developer tools, enhancing enterprise workflows with features like plugins and retrieval, ensuring security and compliance, and driving user growth and monetization. The role requires a product-oriented mindset and technical leadership across the full stack. | Agent | 7 |
| Staff+ Software Engineer, Backend Experienced backend engineers to own and scale the backend systems powering Anthropic's user-facing products like API, Claude Code, and Claude.ai. The role involves scoping and leading complex projects, making architectural decisions for reliability and scalability, and translating frontier model improvements into shipped products. Specific teams focus on API core, capabilities (vision, tool use), knowledge integration, developer experience, agent infrastructure, enterprise foundations (security, compliance, analytics), identity/verification, and marketplace platforms. | ShipAgent | 7 |
| Staff + Sr. Software Engineer, AI Reliability This role focuses on improving the reliability of AI serving systems, including infrastructure, API layers, and accelerators. Responsibilities include developing SLOs, designing monitoring and observability systems, assisting with high-availability infrastructure, leading incident response for critical AI services, and supporting safeguard model serving. The role requires strong distributed systems and reliability backgrounds, with experience in large-scale model serving infrastructure being a plus. | Serve | 7 |
| Technical Program Manager, Infrastructure Technical Program Manager for Anthropic's Infrastructure organization, focusing on coordinating complex programs across developer productivity, tooling, reliability, and operations for AI systems. The role involves driving strategic initiatives, improving developer workflows, ensuring system reliability, and bridging communication between research, engineering, and product teams. | Serve | 7 |
| Staff + Sr. Software Engineer, Inference Deployment This role focuses on building and maintaining the infrastructure for deploying AI inference code to production across various accelerator fleets (GPU, TPU, Trainium). The core responsibility is to create a continuous, unattended deployment system that optimizes for resource constraints, minimizes cycle time, and ensures reliability at scale. It involves capacity-aware scheduling, deployment observability, and self-service onboarding for new models. | Serve | 7 |
| Staff Software Engineer, AI Reliability Engineering Staff Software Engineer focused on AI Reliability Engineering for large language model serving systems. Responsibilities include developing SLOs, designing monitoring and observability systems, implementing high-availability infrastructure, and leading incident response for critical AI services. This role partners with teams across Anthropic to improve reliability across serving paths. | Serve | 7 |
| Technical Program Manager, Inference Performance Technical Program Manager focused on inference performance and efficiency for AI models, coordinating launches, managing dependencies, and optimizing runtime and accelerator performance across multiple hardware targets. | Serve | 7 |
| Staff Software Engineer, AI Reliability Engineering Staff Software Engineer, AI Reliability Engineering at Anthropic. This role focuses on improving the reliability, robustness, and resilience of AI serving systems, specifically for large language models like Claude. Responsibilities include developing SLOs, designing monitoring and observability, assisting with high-availability infrastructure, leading incident response for critical AI services, and supporting the reliability of safeguard model serving. | Serve | 7 |
| Software Engineer, Safeguards Infrastructure Software Engineer focused on building foundational systems for AI safety, including infrastructure for data management, metric and evaluation systems, and tooling for human and agentic review. The role involves ensuring the day-to-day running of Safeguards systems and building robust, reliable multi-layered defenses for real-time improvement of safety mechanisms at scale. | Eval GateAgent | 7 |