Lead Principal Platform Software Engineer, Shepherd — Ic5

Oracle Oracle · Enterprise · United States

Lead Principal Platform Software Engineer for Oracle Cloud Infrastructure's (OCI) release orchestration platform (Shepherd). This role focuses on the architecture, design, and implementation of distributed systems for safe and reliable software deployment across cloud environments, including sovereign and regulated ones. The position requires deep expertise in cloud platform architecture, deployment orchestration, and operational excellence, with a focus on building highly available and resilient systems.

What you'd actually do

  1. Lead the architecture, design, and implementation of Shepherd platform capabilities for release orchestration, deployment safety, rollback automation, dependency modeling, and operational workflows.
  2. Design and build highly available distributed systems that operate reliably across cloud-scale, multi-region, and partially connected environments.
  3. Drive long-term platform architecture, including APIs, service boundaries, persistence models, workflow execution, resiliency, compatibility, and extensibility.
  4. Partner across engineering, SRE, security, compliance, infrastructure, and OCI service teams to deliver cross-organizational technical initiatives.
  5. Establish engineering standards for code quality, testing, documentation, observability, operational readiness, and maintainability.

Skills

Required

  • designing, building, and operating large-scale distributed systems in production cloud environments
  • cloud platform architecture
  • orchestration systems
  • APIs
  • resiliency
  • observability
  • operational safety
  • Java, Python, Go, TypeScript, or similar languages
  • leading complex cross-functional technical initiatives
  • influencing architecture across teams
  • software engineering fundamentals
  • testing
  • code quality
  • design patterns
  • maintainability
  • databases
  • distributed systems
  • API design
  • production debugging
  • AI-assisted software engineering
  • engineering automation

Nice to have

  • Oracle Cloud Infrastructure (OCI) or another hyperscale cloud platform
  • deployment orchestration
  • release automation
  • cloud control planes
  • internal developer platforms
  • compliance-sensitive or sovereign cloud environments
  • defining engineering standards, SLOs, rollout policies, and observability frameworks
  • developing reusable platform libraries, automation frameworks, and reference implementations

What the JD emphasized

  • AI-assisted engineering workflows