Senior Infrastructure Engineer

Twilio Twilio · Enterprise · United States · Remote · Engineering

Senior Infrastructure Engineer to build and scale core product infrastructure for identity and agentic use cases, ensuring security, reliability, and performance. Responsibilities include designing, implementing, and operating scalable cloud infrastructure, partnering with leadership, and improving production quality through observability and automation.

What you'd actually do

  1. Build and own the infrastructure and platform capabilities that power Stytch’s identity platform as it scales across Twilio—ensuring security, reliability, and performance for every customer.
  2. Design, implement, and operate scalable cloud infrastructure (AWS/EKS, ECS, networking, data stores), balancing uptime, cost, and developer velocity.
  3. Partner closely with Product and Engineering leadership to set infrastructure direction, translate platform needs into technical plans, and deliver high-impact roadmap work.
  4. Collaborate across Twilio and Stytch teams to align on architecture, integrate platform capabilities, and unblock cross-team initiatives.
  5. Operate with deep technical ownership: author design docs, drive key technical decisions, review code, and stay close to the systems you ship.

Skills

Required

  • 6+ years of experience as an Infrastructure or Platform Engineer building and operating high-scale, mission-critical cloud production systems.
  • Strong experience with containerization and orchestration (Kubernetes/EKS, Docker), Infrastructure as Code (Terraform, GitOps, or similar) and AWS.
  • Hands-on proficiency in at least one modern programming language used in production.
  • Experience designing and running observability and on-call systems (e.g., Datadog, ELK, Prometheus/Grafana).
  • Experience scaling cloud infrastructure for distributed systems, including relational databases and high-availability service architectures.
  • Excellent written and verbal communication skills; comfortable writing design docs and leading technical discussions.
  • Bachelor’s degree in Computer Science or equivalent practical experience.
  • Schedule: ability to work non-standard, on-call rotation weekend and holiday hours.

Nice to have

  • Experience with multi-region or global infrastructure, including disaster recovery and data replication strategies.
  • Familiarity with enterprise-scale platform challenges: multi-tenant infrastructure, compliance, and cost/performance optimization.
  • Builder at heart. Through a hobby or your profession, you are passionate about being hands on and seeing your work come to life.

What the JD emphasized

  • high-scale, mission-critical cloud production systems
  • containerization and orchestration (Kubernetes/EKS, Docker)
  • Infrastructure as Code (Terraform, GitOps, or similar)
  • AWS
  • observability and on-call systems (e.g., Datadog, ELK, Prometheus/Grafana)
  • scaling cloud infrastructure for distributed systems
  • multi-region or global infrastructure, including disaster recovery and data replication strategies