Sr Manager, Engineering- Cloud Intelligence & Infrastructure Economics

Databricks Databricks · Data AI · San Francisco, CA · Executive Engineering - Pipeline

This role is for a Senior Engineering Manager at Databricks to lead a team responsible for the Intelligence and Economics of their infrastructure. The focus is on building a governance and efficiency layer to optimize costs in a multi-cloud environment, particularly for Generative AI and Serverless technologies. The mission is to evolve from cost reporting to Autonomous Governance, managing, enforcing, and optimizing resources globally.

What you'd actually do

  1. Lead the engineering team responsible for the Intelligence and Economics of our infrastructure.
  2. Build the Intelligent Governance and Efficiency layer that optimizes billions of dollars spent across all Databricks products in a complex multi-cloud environment.
  3. Own the roadmap to build platform-native systems that automatically manage, enforce, and optimize resources globally.
  4. Build high-scale data pipelines and attribution models that provide a "Source of Truth" for demand and capacity planning.
  5. Engineer the automated enforcement layer—including budget quotas, anomaly detection, and self-healing remediation—to manage exploding GenAI and serverless costs.

Skills

Required

  • Engineering management experience (7+ years)
  • Leading high-performance teams in Infrastructure, Production Engineering, or Cloud Systems
  • Distributed systems architecture (7+ years)
  • Kubernetes and Cloud-native architectures (AWS, Azure, or GCP)
  • Designing, building, and operating large-scale distributed systems
  • High-availability SaaS platforms or services with millions of users
  • Professional software development (Java, Scala, or C++)
  • Architectural leadership in transitioning fragmented technical environments into unified, automated, and opinionated platforms

Nice to have

  • Master’s degree or PhD in Computer Science or a related field
  • Infrastructure-as-Code (Terraform, CloudFormation)
  • Large-scale data orchestration
  • Cloud Economics, Capacity Planning, or Fleet Efficiency at a global scale

What the JD emphasized

  • billions of dollars spent
  • complex multi-cloud environment
  • Generative AI and Serverless
  • Autonomous Governance
  • manage, enforce, and optimize resources globally
  • exploding GenAI and serverless costs