Senior Engineering Manager - Metrics Platform

Confluent Confluent · Data AI · India · Remote · Engineering

Senior Engineering Manager for Confluent's Metrics Platform team, responsible for leading a high-performing engineering organization focused on building and scaling a best-in-class observability foundation for real-time data streaming infrastructure. The role involves defining technical strategy, scaling infrastructure for 10x data volume, evolving the metrics API, ensuring operational excellence (99.99%+ availability, sub-second query performance), and collaborating with other teams.

What you'd actually do

  1. Define and execute the multi-year technical roadmap for the Metrics Platform, including Data infrastructure cluster evolution, data retention strategies, and query optimization
  2. Build, mentor, and grow a world-class engineering team
  3. Partner with Product Management to define and prioritize the Metrics API roadmap based on customer needs and business impact
  4. Align with Confluent's broader observability strategy across Cloud and Platform offerings
  5. Establish metrics and KPIs to measure system performance, system reliability, and customer satisfaction

Skills

Required

  • Software development and engineering
  • Engineering management
  • Large-scale distributed systems
  • Cloud-native environments
  • Distributed systems design
  • Replication protocols
  • High-availability production operations
  • Kafka or similar high-scale event streaming platforms
  • Public cloud platforms (AWS, GCP, Azure)
  • SaaS or platform engineering

Nice to have

  • Apache Druid
  • Prometheus
  • OpenMetrics
  • OpenTelemetry

What the JD emphasized

  • 14+ years of overall experience in software development and engineering
  • 4+ years of engineering management experience, leading productive, high-performing teams
  • Experience operating large-scale distributed systems in production environments (preferably cloud-native)
  • Demonstrated ability to hire and retain top engineering talent, provide impactful coaching, and drive high-performance results.
  • Proven track record of shipping features consistently and meeting aggressive deadlines with a high degree of urgency.
  • Exceptional prioritization skills with the ability to balance short-term execution with a long-term strategic vision for technical evolution.
  • Exceptional communication and collaboration skills, with a focus on building a positive, inclusive team culture aligned with organizational goals.
  • Solid fundamentals in distributed systems design, replication protocols, and high-availability production operations.
  • Deep familiarity with Kafka or similar high-scale event streaming platforms (Pulsar, Flink, etc.) in cloud environments.
  • Experience operating complex architectures across large public clouds (AWS, GCP, Azure) or private cloud-native infrastructures.
  • Strong engineering background with a hands-on approach to technology and a passion for architectural deep-dives.