It Systems Engineer, Corporate Systems & Infrastructure

Anthropic Anthropic · AI Frontier · San Francisco, CA · Security

This role is for an IT Systems Engineer focused on building and operating the corporate infrastructure, including cloud services, CI/CD pipelines, and observability stacks. The engineer will develop internal tooling, automate systems, and manage infrastructure as code, supporting the broader IT organization and contributing to the company's scaling efforts.

What you'd actually do

  1. Build and operate the cloud infrastructure that hosts IT's internal services
  2. Design CI/CD pipelines that let IT Engineering ship through code review and automated testing
  3. Own observability for corporate infrastructure — monitoring, alerting, dashboards, and SLOs
  4. Write cross-system automation to integrate third-party systems and internal services
  5. Partner with network, audiovisual, and physical security to deliver robust infrastructure solutions

Skills

Required

  • Python
  • golang
  • Terraform and Infrastructure as Code
  • Cloud platforms (AWS, GCP, Azure)
  • CI/CD pipeline design
  • Observability tooling (e.g., Prometheus, Grafana, Datadog, Honeycomb, or equivalent)
  • Linux systems administration
  • Strong networking skills
  • Configuration management

Nice to have

  • transformed traditional IT operations into engineering-driven organizations
  • built strong partnerships with Security and Engineering teams
  • Practice modern development methods (code reviews, testing, CI/CD)
  • Work effectively in distributed teams
  • experience with ECS, Kubernetes or other container orchestration for internal services
  • automated physical-world infrastructure deployment (e.g., network configuration, office technology, physical security systems)
  • worked with enterprise integration or workflow automation platforms (e.g., Workato, n8n, Tines, or equivalents)

What the JD emphasized

  • 8+ years building secure IT systems in complex environments
  • shipped Infrastructure as Code in production — Terraform or similar, with modules and state you maintained
  • run services with SLOs, on-call rotations, and post-incident reviews