Software Developer 4

Oracle Oracle · Enterprise · Nashville, TN +1

This role focuses on ensuring the availability, performance, and security of Oracle Cloud Infrastructure (OCI) services by managing and mitigating major incidents, automating tasks, and improving operational processes. It involves deep understanding of cloud computing, incident management, and distributed systems, with a focus on minimizing customer impact and driving continuous improvement within a mature cloud environment.

What you'd actually do

  1. Solve complex problems related to infrastructure cloud services and automate common tasks to ensure continuous availability with minimal human intervention.
  2. Command and coordinate SMEs and service leaders to restore services as quickly as possible during major incidents, while keeping accurate and timely data on the progress of such incidents.
  3. Utilize a deep understanding of cloud computing design patterns and their dependencies to mitigate complex major incidents.
  4. Embed a methodical approach to troubleshoot large, complex, interconnected systems used in incident detection and orchestration.
  5. Document pertinent information related to incidents that aids process improvement, identifies deviations, and enables the creation of an incident knowledge base.

Skills

Required

  • public cloud operations experience (e.g., AWS, Azure, GCP, OCI)
  • Extensive experience with Major Incident Management in a cloud-based environment
  • automation and orchestration principles
  • modern object-oriented programming language
  • professional software engineering standard methodologies such as Agile project management, coding standards, code reviews, source control management, build processes, testing, and operations
  • infrastructure automation tools such as Chef, Ansible, Jenkins, Terraform
  • Infrastructure-as-a-Service
  • CI/CD systems
  • Docker
  • RESTful APIs
  • log analysis tools
  • debugging tools

Nice to have

  • Bachelor’s degree or higher in Computer Science or relevant work experience.

What the JD emphasized

  • public cloud operations experience
  • Extensive experience with Major Incident Management in a cloud-based environment
  • Demonstrate clear understanding of automation and orchestration principles