Lead Data Center Support Services Technician

Oracle Oracle · Enterprise · Santa Clara, CA +1

This role leads complex data center hardware support services, focusing on ensuring resilience, reliability, and availability. Responsibilities include managing change activities, serving as the highest operational escalation point for incidents, partnering with engineering teams for improvements and automation, and mentoring team members. The role also involves overseeing network configuration, installation, and augmentation, technical support, troubleshooting, safety, security, compliance, and continuous improvement initiatives.

What you'd actually do

  1. Leads complex, data center hardware support services to ensure resilience, reliability, and availability at scale.
  2. Designs, governs, and signs off on advanced change activities and site augmentations, driving standardization of SOPs/MOPs across regions.
  3. Serves as the highest operational escalation point for incidents, leading cross-functional triage, root-cause analysis, and long-term corrective actions.
  4. Partners with engineering, capacity, networking, and security teams to anticipate risks, implement architectural improvements, and automate repeatable workflows.
  5. Mentors and develops team mates, uplifts best practices, and influences policy and process changes that improve performance across the group and organization.

Skills

Required

  • Data center hardware support
  • Incident management
  • Root-cause analysis
  • Change management
  • Network configuration
  • Technical troubleshooting
  • Safety and compliance
  • Mentoring
  • Process improvement
  • Automation

Nice to have

  • Power distribution practices
  • Hardware interventions
  • Cabling
  • Upstream and downstream systems integration
  • Ticket taxonomy
  • Triage playbooks
  • Automation triggers
  • Physical security procedures
  • Audit requirements
  • Functional testing of security controls
  • Documentation
  • Audit trails
  • Change records
  • Compliance reviews
  • Post-incident reviews
  • Enterprise-level RCAs
  • Availability risks
  • Single points of failure mitigation
  • Observability
  • Alert fidelity
  • Vendor relations
  • On-call readiness plans

What the JD emphasized

  • highest operational escalation point
  • highest operational escalation point
  • highest operational escalation point