Sr. Manager, Engineering Configuration Platform Team

Databricks Databricks · Data AI · Bellevue, WA +1 · Engineering

Senior Engineering Manager to lead a team within the Configuration Platform organization at Databricks. This role focuses on managing systems that handle configuration and feature flags across all Databricks services, ensuring safe and efficient rollout of changes, experiments, and incident mitigation. The platform supports thousands of internal engineers and handles millions of evaluations per second with low latency. The manager will drive strategy, reduce incidents, improve developer experience, and grow the team.

What you'd actually do

  1. You will lead a team that directly shapes how thousands of Databricks engineers ship, ramp, and roll back changes every day
  2. You will reduce the rate and blast radius of customer-impacting incidents caused by configuration and feature changes, through stronger guardrails, validation, and graduated rollout patterns
  3. You will close the loop between "bad change" and automated mitigation, driving company-wide MTTR down through investments in health-mediated releases, auto-rollback, and zero-downtime configuration delivery
  4. You will evolve the Configuration Platform user experience into a frictionless, opinionated set of surfaces (UI, CLI, agentic) where engineers author, review, and roll out changes safely without thinking about the internals
  5. You will grow the team into a Staff and Principal attracting org whose technical decisions shape how the company balances speed and reliability

Skills

Required

  • Backend, infrastructure, or platform team management
  • Distributed systems
  • Infrastructure background
  • Java/Scala services
  • RPC and routing layers
  • Configuration systems
  • Kubernetes-based microservices
  • Multi-cloud or multi-region deployments
  • Hands-on EM disposition
  • Managing, growing, and retaining high performing teams
  • Partnering with senior+ engineers
  • Operational experience owning a high-availability platform
  • On-call
  • Incident response
  • Product thinking for developers
  • BS in Computer Science

Nice to have

  • Masters or higher level of education

What the JD emphasized

  • 5+ years managing backend, infrastructure, or platform teams of roughly 7 to 15 engineers
  • Strong distributed systems and infrastructure background
  • Hands-on EM disposition
  • Demonstrated track record managing, growing, and retaining high performing teams
  • Operational experience owning a high-availability platform with on-call, incident response, and measurable reliability improvements
  • Demonstrated product thinking for developers