Production Service Systems Administrator 3

Oracle Oracle · Enterprise · Romania

This role is responsible for the operation, reliability, and performance of production environments supporting critical business operations, leveraging expertise in Oracle Cloud Infrastructure (OCI). It involves troubleshooting, monitoring, automation, and incident management for large-scale systems, applications, and databases, with a focus on improving availability, scalability, and operational efficiency. Experience with SRE and DevOps practices is preferred.

What you'd actually do

  1. Monitor, administer, and support large-scale production environments.
  2. Troubleshoot and resolve complex infrastructure, application, and database issues.
  3. Serve as an escalation point for critical production incidents.
  4. Perform root cause analysis and implement preventive solutions.
  5. Recommend and drive improvements to system availability, performance, and operational efficiency.

Skills

Required

  • production environments support
  • cloud infrastructure platforms
  • OCI
  • Linux/Unix systems
  • databases
  • distributed systems
  • troubleshooting
  • monitoring
  • automation
  • incident management
  • analytical skills
  • problem-solving skills

Nice to have

  • Kubernetes
  • Terraform
  • Site Reliability Engineering (SRE)
  • DevOps practices