Commissioning & Reliability Lead

Oracle Oracle · Enterprise · United States

This role focuses on the commissioning and reliability of mission-critical power systems for Oracle Cloud Infrastructure (OCI) data centers. The Lead will ensure a smooth and stable operational startup by developing and executing integrated commissioning plans, identifying and resolving system-level gaps, and establishing reliability readiness. While the team overview mentions AI integration, the core responsibilities of this role are centered on power systems engineering and commissioning, not direct AI/ML development.

What you'd actually do

  1. Develop and execute the integrated commissioning plan spanning generation, substations, protection, controls/EMS, and data center power distribution.
  2. Define and manage hold points, witness points, readiness gates, and acceptance criteria; ensure test evidence is complete and auditable.
  3. Lead system-level commissioning activities and critical tests, including:
  4. Black-start rehearsals and restoration sequencing validation
  5. Islanding and re-synchronization tests (as applicable)
  6. Power quality acceptance testing at data center PCCs (voltage/frequency regulation, harmonics/flicker where applicable, transient response)
  7. Identify and close system-level gaps during commissioning (integration issues across OEM boundaries, settings conflicts, undocumented dependencies, sequencing errors).
  8. Drive structured troubleshooting and root-cause analysis for:
  9. Control hunting/instability
  10. Protection miscoordination
  11. Nuisance trips and unexpected interactions across integrated assets
  12. Coordinate OEMs, EPCs, utilities, and operators in live/mission-critical environments; align stakeholders on safe sequencing and clear “go/no-go” criteria.
  13. Establish operational reliability readiness for turnover, including:
  14. RAM targets/assumptions and early-life reliability approach
  15. Spares recommendations and inventory readiness
  16. Maintenance readiness, procedures, and runbooks
  17. Training needs and turnover package expectations
  18. Maintain commissioning risk register, punch list, retest plans, and corrective action tracking through stabilization and turnover to O&M.

Skills

Required

  • Commissioning large-scale generation, T&D/substations, microgrids, or equivalent mission-critical power systems
  • System-level commissioning
  • Troubleshooting and root-cause analysis
  • Stakeholder coordination (OEMs, EPCs, utilities, operators)
  • Reliability readiness establishment
  • Documentation and communication

Nice to have

  • Commissioning integrated systems with inverter-based resources (BESS, fuel cells, etc.), thermal generation, and utility interconnections
  • Protection and controls commissioning workflows
  • PCC power quality testing
  • Building repeatable commissioning governance
  • Familiarity with large-scale cloud provider / hyperscale data center power environments

What the JD emphasized

  • 10+ years commissioning large-scale generation, T&D/substations, microgrids, or equivalent mission-critical power systems.
  • Demonstrated track record bringing complex systems to COD and stabilizing operations during the early-life period (first 90 days).
  • Strong experience coordinating OEMs, EPCs, utilities, and operators in live environments with strict safety and uptime expectations.
  • Proven ability to detect and resolve system-level integration gaps (beyond component-level commissioning).