Principal Data Center Facilities Engineer

Oracle Oracle · Enterprise · United States

This role is for a Principal Data Center Facilities Engineer responsible for technical leadership in managing and expanding data center growth, focusing on electrical, mechanical, and controls systems, including liquid-to-chip solutions. The role involves supporting new capacity deployment, representing operations in due diligence and design, establishing processes, collaborating with colocation providers, and ensuring reliability, safety, compliance, and efficiency across global sites. Key responsibilities include critical environment maintenance, incident response, site commissioning, operations management, engineering design, governance, training, and colocation vendor management.

What you'd actually do

  1. Define and govern enterprise standards for preventive and predictive maintenance across electrical, mechanical,liquid to chip, and life-safety systems; ensure consistent execution and documentation across all sites.
  2. Lead advanced diagnostics for systemic reliability issues; drive remediation programs that eliminate recurring failure modes and materially improve site availability and energy efficiency.
  3. Lead high-severity incident response across regions; coordinate rapid stabilization, stakeholder communications, and executive updates; ensure durable corrective and preventive actions are implemented at scale.
  4. Spearheads collaboration with design, construction, facility engineering, and operations teams to validate designs, influence decision-making to integrate new systems with existing infrastructure and tooling alignment with operations objectives.
  5. Serve as principal SME for complex and novel scenarios in operations of mission-critical systems (power, cooling, BAS, fire/life safety)

Skills

Required

  • Data center facilities engineering
  • Electrical systems
  • Mechanical systems
  • Controls systems
  • Mission-critical environments
  • Change management
  • Preventive and predictive maintenance
  • Incident response
  • Root cause analysis (RCA)
  • Corrective and Preventive Actions (CAPA)
  • Site commissioning
  • Design reviews
  • Uptime requirements
  • Integrated systems testing
  • Operations management
  • Power systems
  • Cooling systems
  • BAS (Building Automation Systems)
  • Fire/life safety systems
  • Engineering Change Advisory Board (CAB) process
  • Operational risk management
  • Work practices
  • Audit readiness
  • Capacity planning
  • Performance optimization
  • Sustainability objectives
  • PUE reduction
  • Utilization improvement
  • Operating cost control
  • Contract negotiation
  • Due diligence validation
  • Vendor management
  • Service Level Agreements (SLAs)
  • Continuous improvement
  • Cost optimization
  • Technical leadership
  • Problem-solving
  • Communication

Nice to have

  • Liquid-to-chip solutions
  • Colocation vendor management
  • Global site operations
  • Cross-site knowledge sharing

What the JD emphasized

  • mission-critical environments
  • senior technical leader
  • strong background in electrical, mechanical, and controls systems
  • experience in administering and maintaining mission-critical environments
  • highly self-directed
  • strong experience in data center design and critical infrastructure operations
  • Exceptional customer focus
  • effective collaboration
  • Critical Environment Maintenance Support
  • advanced diagnostics for systemic reliability issues
  • Incident Management and Operation Improvement
  • high-severity incident response
  • Institutionalize standardized root cause analysis (RCA) methodologies and CAPA governance
  • Site Commissioning and Build
  • technical oversight for site assessments, design reviews, uptime requirements, commissioning strategy, and integrated systems testing
  • Data Center Operations Management
  • principal SME for complex and novel scenarios in operations of mission-critical systems
  • Engineering Design, Leadership, and Governance
  • Define and maintain best-in-class policies, technical standards, and playbooks
  • senior engineering guidance for complex design challenges, lifecycle strategies, and modernization roadmaps
  • Colocation Vendor Management
  • SME technical support for contract negotiations