Senior Software Engineer AI Infra (joinoci-sde)

Oracle Oracle · Enterprise · Austin, TX +1

Senior Software Engineer AI Infra role focused on building and scaling bare-metal provisioning systems for Oracle Cloud Infrastructure (OCI), specifically supporting AI and GPU workloads. The role involves designing and developing highly reliable and secure infrastructure for server lifecycle management, from hardware bring-up to instance provisioning and firmware management.

What you'd actually do

  1. design, build, and scale the next generation of bare-metal provisioning systems powering millions of servers worldwide
  2. develop highly reliable and secure infrastructure, tackle complex distributed systems challenges, and help deliver the foundation for OCI’s most performant compute services
  3. The Compute Bare Metal Provisioning team owns the critical infrastructure responsible for automating the full server lifecycle from new platform shape (AMD/Intel/Arm/Nvidia) creation, hardware bring-up to customer-ready instance provisioning and firmware management
  4. The team builds high performance, scalable micro-services and tooling that provision, configure, secure, and validate server platforms across OCI’s massive fleet of Compute and GPU Infrastructure
  5. own the software design and development for major components of Oracle’s Cloud Infrastructure

Skills

Required

  • 5-8 years' experience delivering and operating large scale, highly available distributed systems, Linux development and Systems debugging
  • Strong knowledge of Object Oriented programming such as C++ or Java, and experience with scripting languages such as Python
  • Strong knowledge of data structures, algorithms, operating systems, and distributed systems fundamentals
  • Experience with tools such as Terraform for Infrastructure as Code
  • Working familiarity with networking protocols (TCP/IP, HTTP) and standard network architectures
  • Strong understanding of databases, storage and distributed persistence technologies
  • Strong troubleshooting and performance tuning skills

Nice to have

  • Experience building multi-tenant, virtualized infrastructure a strong plus

What the JD emphasized

  • next generation of bare-metal provisioning systems
  • AI infrastructure
  • GPU infrastructure
  • cutting edge GPU hardware