Staff Platform Engineer

Abridge Abridge · Vertical AI · San Francisco, CA · Builder

Staff Platform Engineer to scale cloud infrastructure, developer platform, and operational maturity for an AI-powered healthcare platform. Focus on multi-tenant, multi-cloud infrastructure, CI/CD, security, and observability, ensuring reliability, scalability, and compliance (SoC2, HIPAA).

What you'd actually do

  1. Design, build, and evolve cloud infrastructure platforms including networking, IAM, Kubernetes, databases, streaming and pubsub platforms, storage, distribution, observability, and more.
  2. Lead the architecture and operational evolution of multi-tenant, multi-region, and multi-cloud infrastructure with strong reliability, scalability, and security boundaries.
  3. Design and implement build pipelines, branching strategies, release management tooling, and self-service platform workflows that will serve an engineering organization that is rapidly growing in both size and operational complexity.
  4. Design, implement, and scale secure-by-default cloud infrastructure practices including CI and deployment scans, least privileged access controls, auditing, policy enforcement, and maintaining SoC2 and HIPAA compliance.
  5. Build reusable infrastructure abstractions, Terraform modules, golden paths, and developer platform capabilities that allow engineering teams to move quickly while maintaining operational consistency and governance.

Skills

Required

  • 10+ years of software and infrastructure engineering experience
  • significant experience operating infrastructure-as-code platforms in cloud-first organizations
  • Deep understanding of Kubernetes platform architecture and operations
  • Experience designing and maintaining CI/CD systems for both infrastructure-as-code deployments and application delivery workflows

Nice to have

  • Experience designing and operating large-scale Kubernetes platforms and scaling compute services on Kubernetes
  • experience with related cloud-native technologies including ArgoCD, Argo Rollouts, Istio, etc.
  • Experience building scalable infras
  • Terragrunt, Atlas, ArgoCD, Octopus Deploy, Travis CI

What the JD emphasized

  • regulatory requirements
  • SoC2 and HIPAA compliance
  • AI-first, cloud-native, security-first infrastructure at scale

Other signals

  • AI-powered platform
  • generative AI for healthcare
  • AI-first, cloud-native, security-first infrastructure at scale
  • scale our cloud infrastructure, developer platform, and operational maturity