Data Center Operations Manager, Server Operations, Hardware Operations

Google Google · Big Tech · Bridgeport, AL +1

This role manages a team of Data Center Technicians responsible for the installation, configuration, testing, troubleshooting, and maintenance of server hardware and components, as well as server software and networking equipment. The role involves overseeing quality installation, managing complex installations/troubleshooting, leading small project teams, and developing project contingency plans. It also includes managing a team of Machine Learning (ML) travelers remotely and contributing to 24/7 initiatives.

What you'd actually do

  1. Lead a team of individuals, communicate individual and team priorities that support organizational goals to repair, fix, and perform preventative maintenance on equipment, servers, machines, or infrastructure based on issues.
  2. Partner with teams to meet goals and stakeholders to manage facility activities and set/implement strategies.
  3. Maintain, monitor, and execute security and operational procedures and analyze trends to identify opportunities for improvements ensuring alignment with organizational policies.
  4. Support and contribute to the implementation of Environmental Health and Safety (EHS) and other compliance programs and initiatives in collaboration with other teams to ensure environmental and safety incidents are investigated, resolved, and reported.
  5. Manage a team of Machine Learning (ML) travelers remotely and contribute and support 24/7 initiatives.

Skills

Required

  • technical leadership
  • hardware installation and maintenance
  • server hardware and components
  • server software
  • networking equipment
  • Linux/Unix system administration
  • team management
  • vendor management
  • contract management
  • service delivery
  • ability to work non-standard hours

Nice to have

  • data center operations
  • large-scale infrastructure building and operation
  • network and compute architecture and lifecycle
  • strategic initiative execution
  • global environment experience
  • data gathering, analysis, and presentation
  • EHS initiative leadership and improvement