Principal Product Manager, Reliability Platform (observability, Sre, Aim)

GEICO GEICO · Insurance · Palo Alto, CA +1

Product Manager for GEICO's core reliability platforms and services, focusing on observability, SRE, and incident management to empower engineering teams and ensure high-quality software delivery.

What you'd actually do

  1. Building and scaling foundational developer platforms that serve as the backbone for our engineering organization.
  2. Defining and executing a clear product strategy for the Observability, BCDR & Incident Management areas within our internal developer engineering team.
  3. Leading cross-functional teams to deliver high-impact, developer-facing products in an agile environment.
  4. Deeply understanding the entire developer workflow—from coding and testing to deployment and operations—and identifying opportunities to remove friction and improve efficiency.
  5. Owning and prioritizing the product roadmap for a suite of platform services, such as our metrics platform, logging pipelines, alerting systems, on-call and incident response tooling, and BCDR orchestration platform.

Skills

Required

  • technical product management
  • Developer Tools
  • Platform Engineering
  • SRE
  • Observability
  • cloud infrastructure
  • product lifecycle management
  • data-driven product decisions
  • measuring success of developer-facing tools

Nice to have

  • MBA
  • internal developer platforms (IDPs)
  • service catalogs
  • Paved Road engineering
  • Grafana
  • Azure
  • AWS
  • Kubernetes
  • Site Reliability Engineering (SRE) principles
  • SLOs
  • error budgets
  • incident management
  • communication skills

What the JD emphasized

  • core reliability platforms and services
  • Developer Engineering organization
  • ensure our software is delivered quickly, safely, and with the highest quality
  • developer-facing products
  • developer workflow
  • Owning and prioritizing the product roadmap
  • culture of reliability and ownership