Senior Manager, Application Software Engineering

Oracle Oracle · Enterprise · United States

Senior Manager, Site Reliability Engineering for Oracle Health AI, leading a team responsible for the reliability, performance, security, and operational excellence of healthcare cloud services for Federal customers. The role focuses on transforming operations to an AI-assisted, automation-first model, eliminating toil through software engineering and AI agents, and ensuring high availability and security in a regulated environment.

What you'd actually do

  1. Lead and develop a team of software engineers, Site Reliability Engineers (SREs), and technical leaders responsible for the performance, availability, security, reliability, and operational excellence of Oracle Health Federal customer environments.
  2. Drive the organization's transformation to an SRE-first and DevOps operating model through adoption of Infrastructure as Code (IaC), Configuration as Code, Policy as Code, GitOps, progressive delivery, automated rollback strategies, canary deployments, self-healing infrastructure, and measurable operational toil reduction.
  3. Build AI-native operational capabilities using Oracle-approved AI technologies and secure data handling practices to accelerate software development, production support, incident response, troubleshooting, change execution, knowledge retrieval, engineering productivity, and customer operations.
  4. Eliminate repetitive operational work through software engineering, intelligent automation, AI agents, reusable runbooks, validation frameworks, self-service platforms, and exception-based operational workflows.
  5. Own operational excellence across the complete service lifecycle, including Day 0 platform deployment, Day 1 customer onboarding, and Day 2 production operations, reliability, maintenance, and continuous improvement.

Skills

Required

  • Software engineering leadership
  • Site Reliability Engineering (SRE)
  • DevOps
  • Platform engineering
  • Production engineering
  • Cloud operations
  • Cloud-native platforms
  • Infrastructure as Code (IaC)
  • AI-assisted engineering
  • AI-enabled automation
  • Agentic workflows
  • Observability
  • Incident management
  • Resilience engineering
  • Disaster recovery
  • Executive communication
  • Organizational leadership
  • Stakeholder management

Nice to have

  • Oracle Cloud Infrastructure (OCI)
  • Kubernetes
  • Containerized platforms
  • Terraform
  • CI/CD pipelines
  • Modern observability platforms
  • FedRAMP
  • DoD
  • VA
  • HIPAA
  • HITRUST

What the JD emphasized

  • AI-assisted operating model
  • AI-enabled operational excellence
  • AI-native operational capabilities
  • AI agents
  • AI-assisted engineering
  • regulated, security-sensitive, or high-availability environments
  • Federal or healthcare customers

Other signals

  • AI-assisted operating model
  • AI-enabled operational excellence
  • AI-native operational capabilities
  • AI agents
  • AI-assisted engineering