Global Foc Shift Supervisor - Data Center Infrastructure

ByteDance ByteDance · Big Tech · San Jose, CA · Infrastructure

This role supports global hyperscale data center facility infrastructure operations, focusing on end-to-end lifecycle management, scalability, reliability, emergency response, risk management, and operational governance. The candidate will drive implementation of O&M systems, monitor metrics, and optimize workflows.

What you'd actually do

  1. Oversee overall operation, maintenance and emergency response for global data center infrastructure. Provide remote emergency support, coordinate with on-site facility managers and local contacts, and liaise with business teams when service disruptions occur.
  2. Escalate and report major risks in a timely manner, follow up on rectification progress, and coordinate internal and external resources to ensure full closed-loop management of O&M risks.
  3. Drive the implementation, rollout and supervision of O&M management systems. Unify operational specifications across all data centers, and guide on-site teams on daily routine work.
  4. Regularly monitor and analyze regional O&M metrics, conduct quality supervision and process audits. Identify potential operational risks, and continuously optimize workflows and service quality.
  5. Support the capability building of data center infrastructure platforms, including data ingestion, data governance and alarm management.

Skills

Required

  • 3+ years of working experience in data center infrastructure O&M
  • Familiar with relevant emergency management procedures
  • Effective communication and collaboration with global teams, vendors and partners
  • Strong ability to respond to unexpected faults and emergencies
  • Solid skills in cross-team arrangement, problem breakdown and resource coordination
  • Sound logical thinking
  • Strong sense of responsibility
  • Skilled in office and data tools
  • Standard documentation habits
  • Able to compile and update workflows and technical documents

Nice to have

  • Bachelor's degree or above preferred, in Electrical Engineering, Mechanical Engineering, Facilities Management, Data Center Operations, Computer Engineering, or related technical disciplines
  • Equivalent hands-on O&M experience may also be considered
  • Good risk identification capability
  • Able to accurately assess major risks and complete standardized escalation and reporting
  • Possess closed-loop management awareness
  • Strong comprehensive analysis and problem-solving skills
  • Familiar with management workflows and O&M systems for data center infrastructure
  • Experience in developing large-scale infrastructure O&M systems is a plus
  • Strong data awareness and O&M quality control skills
  • Proficient in KPI tracking, data analysis and fault troubleshooting
  • Experience in large enterprises
  • Global cross-team collaboration
  • End-to-end fault coordination