Program Manager 4-proddev

Oracle Oracle · Enterprise · BENGALURU, KARNATAKA, India

This role is for a Principal Technical Program Manager within Oracle Cloud Infrastructure (OCI) focusing on AI Infrastructure Repair Operations and Fleet Health. The primary responsibility is to lead complex, cross-functional programs aimed at improving AI infrastructure availability, repair efficiency, and operational excellence. This involves translating operational gaps into structured programs, establishing operational mechanisms, driving KPI governance, and collaborating with various engineering, operations, and partner teams. The role requires strong technical program management, operational rigor, and the ability to influence across organizational boundaries in a rapidly scaling environment. While the role deals with AI Infrastructure, the core function is program management and operational improvement, not direct AI/ML model development or research.

What you'd actually do

  1. Lead one or more major AI Infrastructure Repair domains, including strategic customer availability, partner repair execution, RMA/spares governance, repair workflow optimization, data and reporting frameworks, or engineering platform tooling.
  2. Translate availability and operational gaps into structured programs with clearly defined scope, milestones, KPIs, owners, dependencies, risks, and success criteria.
  3. Establish and drive operational mechanisms that ensure accountability, visibility, and execution excellence across repair programs.
  4. Own program reviews, executive reporting, and operational readiness assessments.
  5. Drive programs focused on improving AI Infrastructure fleet availability and repair performance.

Skills

Required

  • Bachelor’s degree in Computer Science, Engineering, Information Technology, or related technical field.
  • 8+ years of experience in Technical Program Management, Program Operations, Release Management, Infrastructure Operations, or related disciplines.
  • 5+ years leading large-scale, cross-functional technical programs from conception through execution.
  • Demonstrated experience managing complex programs involving multiple stakeholders, dependencies, and operational risks.
  • Strong analytical skills with experience developing metrics, dashboards, KPIs, and executive reporting.
  • Proven ability to drive execution across engineering, operations, and business teams.
  • Excellent verbal and written communication skills, including executive-level communication.
  • Demonstrated ability to influence and drive accountability across matrixed organizations.
  • Strong organizational skills with the ability to manage multiple competing priorities simultaneously.

Nice to have

  • Master’s degree in Engineering, Computer Science, Business Administration, or related field.
  • Experience with cloud infrastructure, AI/ML infrastructure, GPU operations, fleet management, or large-scale distributed systems.

What the JD emphasized

  • critical to customer success
  • operational excellence
  • customer availability
  • operational governance
  • technical program management capabilities
  • operational rigor
  • rapidly scaling environment
  • KPIs
  • executive reporting
  • cloud infrastructure
  • AI/ML infrastructure
  • GPU operations
  • fleet management
  • large-scale distributed systems