Senior Site Reliability Engineer

Oracle Oracle · Enterprise · United States

Seeking a Senior Site Reliability Engineer to build and operate reliable, scalable cloud-native platforms and services for Oracle Health's healthcare technology initiatives. The role focuses on platform engineering, cloud infrastructure, distributed systems, and operational tooling, with collaboration across various teams to ensure service health, respond to incidents, and support modernization efforts including AI-driven capabilities.

What you'd actually do

  1. Design, build, test, and operate reliable cloud infrastructure, platform capabilities, and services on Oracle Cloud Infrastructure and legacy deployment models.
  2. Partner with software engineering teams to develop scalable, resilient services, APIs, integrations, and distributed systems.
  3. Forecast capacity needs, analyze service trends, and take proactive steps to ensure systems can support current and future workloads.
  4. Monitor service health, availability, latency, performance, and capacity using observability and reporting tools.
  5. Participate in incident response, troubleshooting, root cause analysis, postmortems, and follow-up remediation.

Skills

Required

  • cloud infrastructure
  • distributed systems
  • automation
  • observability
  • incident response
  • CI/CD
  • DevOps

Nice to have

  • healthcare interoperability
  • large-scale healthcare data platforms
  • AI-enabled capabilities