Senior Manager, Data Center Site Operations

Oracle Oracle · Enterprise · Thailand

This role manages the operations of OCI data center facilities, focusing on high availability, reliability, safety, and operational excellence. Responsibilities include leading teams, driving site readiness for deployments (including AI infrastructure and liquid cooling), overseeing infrastructure deployment, managing critical electrical and mechanical systems, leading commissioning and readiness reviews, monitoring metrics, managing vendor relationships, leading incident response, and managing KPIs and budgets. The role requires extensive experience in hyperscale data center operations, critical facilities management, and leadership, with a strong understanding of electrical and mechanical systems, and experience with liquid-cooled environments. While AI infrastructure is mentioned, the core function is data center operations management.

What you'd actually do

  1. Lead overall operations of OCI data center facilities, ensuring high availability, reliability, safety, and operational excellence.
  2. Manage and develop teams of Data Center Operations Engineers, Facility Engineers, Technicians, and vendors.
  3. Drive site readiness for new deployments, capacity expansions, AI infrastructure rollouts, and liquid-cooling implementations.
  4. Partner with Engineering, Network Operations, Construction, Capacity Planning, Security, and Global Operations teams to support business growth.
  5. Oversee deployment of server, storage, and network infrastructure at hyperscale scale.

Skills

Required

  • Bachelor’s degree in Engineering, Facilities Management, Data Center Operations, or a related technical field (or equivalent experience).
  • 10+ years of experience in hyperscale data center operations, critical facilities, or mission-critical infrastructure management.
  • 5+ years of leadership experience managing technical operations teams.
  • Experience working in major cloud or hyperscale environments (OCI, AWS, Azure, Google Cloud, Meta, etc.).
  • Strong knowledge of electrical and mechanical systems supporting data centers.
  • Experience with liquid-cooled and high-density computing environments.
  • Experience leading large-scale infrastructure deployment and capacity expansion projects.
  • Experience managing colocation providers, vendors, and service delivery partners.
  • Knowledge of incident management, operational governance, commissioning, and operational readiness processes.
  • Strong communication, stakeholder management, and leadership skills.

Nice to have

  • Experience with AI, GPU, or HPC infrastructure.
  • Knowledge of ASHRAE guidelines, liquid cooling technologies, and power redundancy architectures (N, N+1, 2N).
  • Experience with BMS, DCIM, EPMS, and industrial control systems.
  • Sustainability and energy-efficiency program experience.
  • Certifications such as ITIL, CDCP/CDCS/DCEP, Uptime Institute, or PMP.

What the JD emphasized

  • AI infrastructure rollouts
  • liquid-cooling implementations
  • high-density computing environments