Principal Site Reliability Developer

Oracle Oracle · Enterprise · United States

This Principal Site Reliability Developer role focuses on the infrastructure for Oracle GoldenGate and Autonomous Database (ADB) within Oracle Cloud Infrastructure (OCI). The responsibilities include solving complex infrastructure problems, building automation to prevent recurrence, designing and deploying software for availability, scalability, and efficiency, and managing large-scale distributed systems. The role involves full stack ownership of services, capacity planning, performance analysis, and partnering with development teams to improve service architecture and capabilities. Key areas include real-time data migration, replication, and cross-domain orchestration.

What you'd actually do

  1. Solve complex problems related to GoldenGate and ADB infrastructure in OCI and build automation to prevent problem recurrence.
  2. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services.
  3. Design and develop designs, architectures, standards, and methods for large-scale distributed systems.
  4. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.
  5. Work with Site Reliability Engineering (SRE) team on the shared full stack ownership of a collection of services and/or technology areas.

Skills

Required

  • Site Reliability Engineering (SRE)
  • distributed systems
  • automation
  • performance tuning
  • capacity planning
  • system architecture
  • cloud infrastructure (OCI)
  • data replication technologies (GoldenGate)
  • database infrastructure (ADB, MSSQL, MySQL)

Nice to have

  • software development
  • security best practices
  • troubleshooting complex issues
  • cross-domain orchestration

What the JD emphasized

  • mission critical stack
  • security, resiliency, scale, and performance
  • end-to-end performance and operability
  • automation and orchestration principles
  • complex or critical issues
  • deep understanding of service topology and their dependencies
  • distributed systems
  • real-time CDC and replication
  • GoldenGate Microservices Architecture
  • cross-domain / tenancy orchestration