Data Center Controls Network Engineer

OpenAI OpenAI · AI Frontier · San Francisco, CA · Scaling

OpenAI is seeking a mid to senior OT Network Engineer to design, validate, and scale the controls and OT network architectures for high-density AI data centers. This role involves defining requirements, developing reference architectures, and ensuring the resilience, security, and scalability of OT network designs, partnering with various engineering and operations teams.

What you'd actually do

  1. Define controls, automation, and OT network requirements for AI data center campuses.
  2. Develop reference architectures, engineering standards, and reusable design templates.
  3. Review and develop basis-of-design and functional design documents, including OT network diagrams, IP/VLAN schemes, telemetry architectures, data flow diagrams, and commissioning requirements.
  4. Design OT and infrastructure network architectures, including physical topology, logical topology, IP addressing, subnetting, VLANs, routing, switching, redundancy, segmentation, firewall policy coordination, out-of-band management, monitoring, and remote access patterns.
  5. Develop day-two network operations requirements, including change management, configuration backups, golden configurations, monitoring thresholds, firmware lifecycle, rollback plans, and post-change validation.

Skills

Required

  • 8+ years of relevant experience in controls engineering, industrial automation, OT networking, mission-critical facilities, or similar critical infrastructure environments.
  • Strong expertise in resilient OT network architecture, implementation, troubleshooting, and lifecycle support.
  • Experience with OT/IT boundary design, secure enterprise integration, firewall policy design, redundant topologies, out-of-band management, and monitoring.
  • Hands-on experience with Layer 3 OT network design, including IP addressing, subnetting, routing, VRFs, ACLs, inter-VLAN traffic control, and network segmentation.
  • Hands-on experience with Layer 2 security and controls, including MACsec, port security, loop prevention, and switch-level access control.
  • Hands-on experience in designing resilient OT network topologies using industrial redundancy protocols and architectures such as PRP, HSR, Cisco REP, RSTP/MSTP, and ring or star topologies.
  • Hands-on experience in designing resilient infrastructure network architectures using HSRP/VRRP, spine-leaf topologies, redundant uplinks, and failure-domain isolation.
  • Hands-on experience with industrial and infrastructure network equipment such as Cisco switches/routers, Juniper switches/routers, Palo Alto firewalls, Rockwell Automation Stratix switches, Siemens Ruggedcom or comparable industrial networking platforms.
  • Experience with network management and observability platforms such as Cisco Catalyst Center (DNA Center), Palo Alto Panorama, Juniper Mist, industrial NMS tools, packet brokers, and OT monitoring platforms.
  • Hands-on experience with industrial Ethernet, VPN tunneling, IPsec-based connectivity, and secure remote access.
  • Hands-on experience with virtualized OT or controls server environments such as VMware vSAN, Microsoft Azure Stack HCI / Hyper-V, or comparable infrastructure platforms.
  • Experience with industrial communication and OT infrastructure protocols, including BACnet/IP, BACnet MSTP, Modbus TCP/RTU, OPC UA, IEC-61850 MMS/GOOSE, MQTT, SNMP, syslog, NTP/PTP, IRIG-B, and vendor-specific interfaces, and strong understanding of their behavior across OT network architectures.
  • Experience reviewing and producing technical design documentation, commissioning plans, and acceptance test procedures.
  • Experience with factory witnessed testing, site acceptance testing, failover testing, telemetry validation, protocol compatibility testing, and root-cause analysis.