Principal Site Reliability Engineer (database) — Oracle Health, Platform Engineering

Oracle Oracle · Enterprise · United States

This role focuses on Site Reliability Engineering for Oracle databases and Exadata infrastructure, with an emphasis on using AI-assisted tools to enhance developer productivity, code quality, and operational improvements. The role involves supporting production and non-production environments, troubleshooting, capacity management, automation, and contributing to cloud migration initiatives. While AI tools are encouraged for productivity, the core function is not AI model development but rather leveraging AI to improve existing engineering processes.

What you'd actually do

  1. Support day-to-day operations of Oracle databases and Exadata (Prod and Non-Prod), including incident response and on-call support as needed (in alignment with local regulations).
  2. Triage database alerts and issues; perform deep-dive troubleshooting, root cause analysis, and implement corrective/preventative actions.
  3. Perform capacity management, performance analysis, and reliability improvements for database platforms.
  4. Develop and improve automation, tooling, and scripts to reduce toil and improve operational consistency.
  5. Contribute to roadmap projects, including migration planning/execution for OCI and Autonomous Database.

Skills

Required

  • 6+ years of experience as an Oracle DBA, Site Reliability Engineer, or Oracle Database Architect.
  • 6+ years of experience managing scalable on-prem and/or cloud-native distributed systems.
  • Hands-on experience with PL/SQL and Python, Perl, and/or Shell scripting.
  • Experience supporting production databases running on Exadata.
  • Oracle Database administration and operations
  • Oracle Grid Infrastructure, ASM & RAC
  • Oracle Cloud (OCI) fundamentals (migration and/or operations)
  • Scripting/automation (PL/SQL, shell, Python/Perl)
  • Observability and operational readiness (monitoring/alerting, runbooks, incident response)

Nice to have

  • Oracle Maximum Availability Architecture (MAA) and Exadata best practices
  • High availability & replication technologies (e.g., Data Guard, GoldenGate)
  • Advanced scripting/coding and automation engineering (Shell/Perl/Python)
  • Advanced compression
  • Security Technical Implementation Guides (STIGs) and secure operations practices
  • Oracle Autonomous Database experience

What the JD emphasized

  • U.S. citizenship required due to security clearance requirements.
  • Must be able to obtain and maintain the required security clearance.