Senior Software Engineer - AI Infrastructure

Oracle Oracle · Enterprise · Austin, TX +1

Senior Software Engineer role focused on developing and owning software for Oracle's Cloud Infrastructure, specifically within the Compute Bare Metal Provisioning team. Responsibilities include automating server lifecycle management, hardware bring-up, instance provisioning, and firmware management, working with bare metal hardware and full-stack orchestration. The role involves building high-performance, scalable micro-services and tooling for provisioning, configuring, securing, and validating server platforms across OCI's fleet, with a focus on cutting-edge GPU infrastructure.

What you'd actually do

  1. own the software design and development for major components of Oracle's Cloud Infrastructure
  2. automating the full server lifecycle from new platform shape (AMD/Intel/Arm/Nvidia) creation, hardware bring-up to customer-ready instance provisioning and firmware management
  3. builds high performance, scalable micro-services and tooling that provision, configure, secure, and validate server platforms across OCI’s massive fleet of Compute and GPU Infrastructure
  4. partner closely across other teams in Compute, Networking, Security, Data center Engineering, and Hardware Development to ensure OCI can launch, scale, and maintain new server platforms with minimal operational overhead and high reliability
  5. work directly with cutting edge GPU hardware and see the direct impact of your work on the business

Skills

Required

  • distributed systems
  • Linux engineer
  • software design
  • software development
  • Object Oriented programming (C++ or Java)
  • scripting languages (Python)
  • data structures
  • algorithms
  • operating systems
  • distributed systems fundamentals
  • Terraform
  • networking protocols (TCP/IP, HTTP)
  • network architectures
  • databases
  • NoSQL systems
  • storage
  • distributed persistence technologies
  • troubleshooting
  • performance tuning

Nice to have

  • multi-tenant, virtualized infrastructure

What the JD emphasized

  • Systems triage experiance
  • Systems triage experiance
  • Systems experiance
  • Systems experiance
  • Systems debugging