AI Infrastructure Operations Engineer

Cerebras · Semiconductors · Headquarters +1 · Development Infrastructure

Entry-level AI Infrastructure Operations Engineer responsible for deploying, monitoring, and troubleshooting Cerebras AI infrastructure in data center environments. Supports CS systems, cluster server hardware, networking hardware, and telemetry tools.

What you'd actually do

  1. Assist with deployment and bring-up of CS-X systems, cluster servers, and networking hardware
  2. Monitor hardware telemetry, alerts, and dashboards
  3. Perform first-line troubleshooting and structured escalation
  4. Collect logs, telemetry, and observations during incidents
  5. Use existing monitoring, telemetry, and incident tracking tools

Skills

Required

  • Bachelor’s degree in a relevant engineering field or equivalent experience
  • 0–3 years experience in hardware operations, systems engineering, or datacenter environments
  • basic familiarity with server hardware
  • networking fundamentals
  • Linux systems

Nice to have

  • Internship or early-career experience in datacenter or hardware lab environments
  • exposure to monitoring or telemetry systems
  • comfort working in data centers