Senior Support Engineer - Dublin

OpenAI OpenAI · AI Frontier · Dublin, Ireland · User Operations

Senior Support Engineer role focused on providing technical guidance, resolving complex issues, and supporting enterprise customers using OpenAI's API platform. The role involves designing and running operational processes, monitoring, incident response, and leveraging AI/automation to scale support operations. It acts as the last line of defense before core engineering.

What you'd actually do

  1. Be among the foremost technical and troubleshooting experts for our API platform at OpenAI. You are the last line of defense before the core Engineering team.
  2. Proactively identify and implement opportunities to scale support operations by leveraging automation and advancements in AI technologies. Contribute to shaping the future of technical support in an AI-driven era.
  3. Configure and use advanced monitoring and alerting workflows to proactively detect customer impacting issues in real time.
  4. In partnership with engineering, contribute to reliability reviews and preparedness for new features, launches, or strategic customer requirement updates. Ensure that operational readiness (monitoring, alerting, and fallback plans) is in place for any such changes.
  5. Design and refine incident response processes and documentation across strategic customers, engineering and support teams.

Skills

Required

  • Bachelor’s degree in Computer Science or a related field
  • 5+ years of experience in technical operations roles such as SRE/NOC
  • designing monitoring systems
  • resolving production issues in fast-paced and mission-critical environments
  • troubleshooting complex technical problems at the systems level
  • modern monitoring, alerting, and observability practices
  • Hands‑on experience setting up or managing metrics, logging, and tracing for distributed systems
  • SLIs/SLOs, alert tuning, dashboard creation
  • leading incident response for high‑severity outages or service disruptions
  • real‑time incident coordination, root cause analysis, and drive follow‑ups
  • scripting or software engineering (e.g., Python or similar)
  • solid understanding of cloud infrastructure and distributed systems fundamentals
  • cloud services, load balancers, databases, and containerized applications
  • working cross‑functionally
  • communication skills

Nice to have

  • automation-first mindset
  • advancements in AI technologies

What the JD emphasized

  • last line of defense before the core Engineering team
  • low volume, high difficulty
  • last line of defense
  • highest-severity outages or service disruptions