Director Pcs Cloud Operations Sre

GE Healthcare GE Healthcare · Healthcare · Bengaluru, Karnātaka, India · Digital Technology / IT

Director of Cloud Operations for GE Healthcare's PCS division, focusing on SRE and CloudOps for mission-critical applications. The role involves establishing operating foundations, metrics, and automation to ensure high reliability and security, meeting product SLAs, and scaling Cloud Operations. Key responsibilities include implementing SRE practices, building end-to-end observability, driving change management, establishing DR/BCP, leading FinOps, standardizing CI/CD, and developing a high-performing team.

What you'd actually do

  1. Own Cloud Operations for PCS cloud applications; stand up and scale CloudOps capabilities to support multiple products while adhering to committed SLAs.
  2. Institutionalize SRE practices: implement SLI/SLO/SLA frameworks, error budgets, incident/post‑mortem processes, and reliability runbooks; champion automation to reduce toil and improve service health and monitoring.
  3. Build end‑to‑end observability (APM/RUM, logs, metrics, traces, health dashboards, proactive alerting) and evolve toward auto‑healing and AIOps for anomaly detection and closed‑loop remediation.
  4. Drive change, incident, and problem management with clear RACI and stakeholder communications; reduce MTTR through streamlined L1–L4 escalation.
  5. Establish and test DR/BCP posture; conduct AWS Well‑Architected and operational readiness reviews for services (AWS‑first, with multi‑cloud considerations as needed).

Skills

Required

  • Bachelor’s degree in computer science or a STEM field
  • 10+ years of experience leading technical teams
  • 5+ years of Cloud Ops and SRE leadership experience
  • DevSecOps expertise
  • Day-2 Ops expertise
  • APM/RUM expertise
  • Cloud Operations expertise
  • Public cloud service building and operation (AWS-first)
  • CI/CD proficiency
  • Infrastructure-as-Code proficiency (e.g., Terraform/CloudFormation)
  • SLI/SLO/SLA establishment experience
  • Observability establishment experience
  • Incident management establishment experience at scale
  • Leadership and team management skills
  • Project management skills
  • SaaS technologies knowledge
  • Cloud computing knowledge
  • Medical device development processes knowledge

Nice to have

  • Experience scaling CloudOps/SRE for multiple products and customer deployments
  • Deep fluency in SLI/SLO/SLA design, error budgets, runbooks, and auto-healing patterns
  • Strong AWS architecture and operations
  • Well-Architected reviews
  • Capacity and cost optimization (FinOps)
  • Modern observability (APM/RUM/logs/metrics/traces)
  • AIOps for predictive analytics/anomaly detection
  • Security by design (DevSecOps, policy-as-code)
  • DR/BCP planning/testing
  • Clear, decisive communication
  • Influence across product, platform, and security stakeholders
  • Builder-coach mindset
  • Change agent
  • Ownership, bias for action, and strong judgment

What the JD emphasized

  • high reliability
  • high security
  • secure, reliable deployments
  • secure-by-default pipelines
  • medical device development processes