Site Reliability Developer 3

Oracle Oracle · Enterprise · Seattle, WA +1

This role focuses on Site Reliability Engineering for cloud infrastructure services, specifically within Oracle's National Security Realms. Responsibilities include building automation, improving availability and scalability of services, capacity planning, and providing cloud operations support. The role involves working in a 24/7 shift rotation and requires experience with Linux, Kubernetes, Terraform, and scripting languages. It is not directly involved in AI/ML model development but supports the infrastructure that may host such services.

What you'd actually do

  1. Solve complex problems related to infrastructure cloud services and build automation to prevent problem recurrence.
  2. Design, write, and deploy software to improve the availability, scalability, and efficiency of Oracle products and services.
  3. Design and develop designs, architectures, standards, and methods for large-scale distributed systems.
  4. Facilitate service capacity planning and demand forecasting, software performance analysis, and system tuning.
  5. Provide cloud operations for Oracle National Security Realms.

Skills

Required

  • Linux and Unix operating systems
  • Docker, Kubernetes, and Terraform
  • Scripting languages such as Shell, Perl, Python, Java, and Go
  • Proficient with writing services/task automation in Python, Bash, Ruby, Perl, JavaScript, or Java
  • Deep knowledge of Linux internals and host-based networking
  • Knowledge of Linux and/or Unix operating systems
  • Familiarity with configuration management solutions such as Chef, Puppet, etc
  • Experience with devising, managing, and extending monitoring solutions for large scale environments.
  • Knowledge of cloud computing concepts
  • Experience working in a mission-critical environment (Operations, Technical Support, NOC etc)
  • Proficient with communication skills (writing, organization, learning exchange)
  • Experience executing tasks under change management procedures
  • Experience resolving auto-cut and manual alarms following runbooks
  • A focus on customer satisfaction
  • US Citizenship
  • U.S. Citizenship and possess and maintain TS/SCI w/Poly security clearance

Nice to have

  • A desire to learn and keep up with modern technologies
  • Familiarity with core protocols (DNS, DHCP, HTTP, TCP)

What the JD emphasized

  • US Citizenship
  • TS/SCI w/Poly security clearance