Director -software Engineering, Infrastructure

ServiceTitan ServiceTitan · Enterprise · Bengaluru, KA, India

Director of Infrastructure Software Engineering focused on SQL Reliability & Observability to lead an engineering organization at the intersection of Infrastructure, SRE, and Product Engineering. The role aims to improve uptime, customer experience, and cost efficiency by transforming systems from reactive operations to proactive, data-driven reliability engineering and building a world-class team. Responsibilities include defining strategy for SQL reliability, data platform stability, and observability; overseeing performance, scaling, and availability of various databases; leading development of a unified observability platform; implementing proactive monitoring and automation including AI-driven anomaly detection and SRE Agent/AI-assisted debugging systems; and building/leading a high-performing team.

What you'd actually do

  1. Lead Reliability & Observability Strategy
  2. Lead Database Reliability & Performance
  3. Build the Observability Platform
  4. Drive Proactive Monitoring & Automation
  5. Build and Lead a High-Performing Team

Skills

Required

  • 10–15+ years of experience in software engineering, SRE, or infrastructure roles
  • 5+ years in engineering leadership at the Manager or Director level
  • Proven track record managing large-scale distributed systems and databases
  • BS or MS in Computer Science or equivalent experience
  • Deep expertise in SQL and relational database systems
  • Strong hands-on experience with cloud platforms, Azure preferred
  • Solid understanding of observability platforms (metrics, logs, tracing), Kubernetes, and performance engineering

Nice to have

  • Familiarity with NoSQL systems (Cosmos DB, MongoDB, Redis, Kafka), OpenTelemetry, and CI/CD reliability practices
  • Exposure to AI/ML-driven monitoring or root cause analysis systems is a plus

What the JD emphasized

  • SQL Reliability & Observability
  • AI-driven anomaly detection
  • SRE Agent / AI-assisted debugging systems