Principal Software Engineer

Oracle Oracle · Enterprise · Nashville, TN +1

This Principal Software Engineer role on the Oracle Cloud Infrastructure (OCI) Multi-cloud team focuses on building and operating large-scale cloud services. The role emphasizes using AI-assisted engineering tools (like Codex, Copilot) to enhance productivity, automate tasks, and improve system quality. Responsibilities include designing, developing, deploying, and operating Java-based cloud services, APIs, and containerized workloads, with a strong focus on reliability, scalability, and operational excellence. The role also involves leading technical projects, mentoring engineers, and participating in an on-call rotation.

What you'd actually do

  1. lead the design, development, deployment, and operation of Java-based cloud services, RESTful APIs, and containerized workloads running on Kubernetes
  2. work across OCI, Oracle, and partner teams to deliver reliable integration platforms and automation capabilities
  3. own complex features and service improvements from architecture through production, including design, implementation, testing, debugging, observability, operational readiness, and long-term maintainability
  4. help guide technical direction, review designs, mentor engineers, and raise the engineering bar for reliability, scalability, security, and supportability
  5. use AI-assisted development tools such as Codex, Claude Code, GitHub Copilot, or similar tools to improve engineering productivity, code quality, documentation, testing, and troubleshooting

Skills

Required

  • 5+ years of experience designing, building, and operating production-scale distributed systems or cloud services
  • BS/MS in Computer Science or equivalent experience
  • Deep hands-on experience with Java is strongly preferred; strong experience in another systems or application programming language, such as Python, Go, C/C++, or C#, will also be considered
  • Demonstrated ability to apply AI-assisted development tools such as Codex, Claude Code, GitHub Copilot, or similar tools to improve engineering productivity, code quality, documentation, testing, debugging, and operational workflows
  • Ability to design, create, and guide adoption of reusable prompts, workflows, tools, or AI skills that improve team execution and engineering quality
  • Deep experience designing, deploying, operating, and troubleshooting Linux-based containerized services using Docker and Kubernetes, including service configuration, networking, scaling, health checks, observability, and production incident debugging
  • Deep experience designing, building, operating, and debugging RESTful APIs over HTTPS, including API contracts, authentication, authorization, error handling, idempotency, retries, timeouts, observability, backward compatibility, and production troubleshooting
  • Experience building or operating cloud services, distributed systems, or integration platforms
  • Strong foundation in computer science fundamentals, including data structures, algorithms, operating systems, networking, and distributed systems
  • Strong experience improving CI/CD, infrastructure automation, service observability, metrics, logging, tracing, alerting, and operational readiness for production services
  • Working knowledge of cloud infrastructure, identity, networking, security, or multi-cloud environments
  • Demonstrated ownership of production reliability, including incident response, root cause analysis, operational reviews, automation, testing, and durable corrective actions
  • Demonstrated ability to lead technical projects across multiple engineers or teams
  • Experience mentoring engineers through design reviews, code reviews, debugging, operational readiness, and production support
  • Willingness to participate in an on-call rotation and support production services

What the JD emphasized

  • AI-assisted development tools
  • designing, building, and operating production-scale distributed systems or cloud services
  • Deep hands-on experience with Java is strongly preferred
  • Deep experience designing, deploying, operating, and troubleshooting Linux-based containerized services using Docker and Kubernetes
  • Deep experience designing, building, operating, and debugging RESTful APIs over HTTPS
  • Strong experience improving CI/CD, infrastructure automation, service observability, metrics, logging, tracing, alerting, and operational readiness for production services
  • Demonstrated ownership of production reliability, including incident response, root cause analysis, operational reviews, automation, testing, and durable corrective actions
  • Demonstrated ability to lead technical projects across multiple engineers or teams
  • Experience mentoring engineers through design reviews, code reviews, debugging, operational readiness, and production support