Senior Manager, Site Reliability Engineering, Follow up Boss

Zillow Zillow · Consumer · United States · Remote

This role is for a Senior Engineering Manager leading a multidisciplinary team of SREs, SDEs, and security engineers responsible for the infrastructure, reliability, and developer experience that underpin Follow Up Boss. The manager will own team execution, quality, and innovation, drive cross-org alignment, and lead modernization efforts. While the role mentions AI agents for automation, the core focus is on SRE and infrastructure management, not direct AI/ML model development.

What you'd actually do

  1. Own team execution, quality, and innovation across multiple systems and workloads that support the entire FUB+ org.
  2. Move the infra team into a strong posture for proactive, roadmap-driven investments in reliability, scalability, security, and developer productivity.
  3. Drive cross-org alignment and strategy with ZG platform teams (ZGCP, SRE, networking, database, security) as well as FUB product teams to modernize FUB infrastructure and adopt shared platform capabilities where appropriate.
  4. Own execution for the FUB infra & security roadmap, turning strategic goals (e.g., DB scalability, ZGCP adoption, infra cost and reliability targets) into a sequenced, realistic plan with clear milestones and measures of success.
  5. Be accountable for reliability, performance, operability, and cost of core FUB services and infrastructure (EC2, RDS/Aurora, Redis/Valkey, networking, queues, SRE tooling).

Skills

Required

  • SRE leadership
  • Infrastructure management
  • AWS
  • Terraform/Ansible
  • Kubernetes/ZGCP
  • Observability
  • Security
  • Databases
  • CI/CD
  • Incident management
  • SLOs and error budgets
  • Capacity management
  • Developer experience tooling

Nice to have

  • AI agents for automation, diagnostics, and operations

What the JD emphasized

  • infrastructure, reliability, and developer experience
  • proactive, roadmap-driven investments
  • modernize FUB infrastructure
  • Own execution for the FUB infra & security roadmap
  • reliability, performance, operability, and cost
  • database scaling and performance
  • Infrastructure Modernization & Platform Strategy
  • Developer Experience & Environments
  • Team Building, Coaching & Talent
  • Cross-Org Alignment & Strategy
  • infra-focused AI agents for automation, diagnostics, and operations