Staff+ Software Engineer, Claude App Infrastructure

Anthropic Anthropic · AI Frontier · San Francisco, CA · Software Engineering - Infrastructure

Staff+ Software Engineer role focused on building the agentic layer and infrastructure for Claude App, enabling task execution, tool use, and safe interaction with external services. This involves designing and building sandboxed compute environments, state management for agent tasks, authentication/authorization, and observability tools for agent execution at scale.

What you'd actually do

  1. Design and build sandboxed compute environments where Claude can safely execute code, access tools, and interact with external services
  2. Build state management systems for long-running agent tasks, handling checkpoints, recovery, and resumption across failures
  3. Develop authentication and authorization frameworks for delegated access, so Claude can act on behalf of users securely
  4. Create observability and debugging tools for agent execution, so we understand what Claude did, why, and how to make it better
  5. Partner closely with product and research teams to define what's possible and ship it

Skills

Required

  • Experience building distributed systems, infrastructure, or platform services at scale
  • Comfort building cloud native infrastructure on GCP, AWS, or Azure
  • Experience with containers, sandboxing, or secure execution environments (e.g., gVisor, Firecracker, V8 isolates)
  • Write clean, maintainable code in Python, Go, Rust, or a similar language
  • Care about security, isolation, and building systems that fail safely

Nice to have

  • Comfort with ambiguity and greenfield work where you help shape the architecture
  • Interest in problems that don't have existing playbooks
  • Experience building multi-tenant execution platforms or serverless infrastructure
  • Background in security engineering, sandboxing, or isolation technologies
  • Familiarity with workflow orchestration systems (Temporal, Airflow, Step Functions)
  • Experience with state machines, checkpointing, or durable execution patterns
  • Low-level systems experience (Linux internals, eBPF, container runtimes)

What the JD emphasized

  • building distributed systems, infrastructure, or platform services at scale
  • cloud native infrastructure
  • containers, sandboxing, or secure execution environments
  • security, isolation, and building systems that fail safely

Other signals

  • building the agentic layer
  • task execution
  • personalization
  • browser use
  • server-side tools
  • primitives that let Claude act in the world
  • reliability and velocity
  • sandboxed compute environments
  • execute code
  • access tools
  • interact with external services
  • state management systems for long-running agent tasks
  • authentication and authorization frameworks for delegated access
  • observability and debugging tools for agent execution