Software Development Director, AI Infrastructure

Oracle Oracle · Enterprise · Austin, TX +1

Director of Software Engineering to lead Validation for New Product Introduction for GPU SKUs and their health over their lifecycle, driving technical strategy and mentoring a team. The role involves hands-on leadership with GPU servers, configuration, validation, benchmarking, debugging, and repairs, with a focus on leveraging AI for automation, diagnosis, and repair, and implementing DevOps and CI/CD practices.

What you'd actually do

  1. Define and execute the technical and product strategy for GPU Validation across existing and new GPU SKUs.
  2. Ensure smooth and reliable delivery of new capacity via Validation.
  3. Work with customers to align quality expectations , maximize customer fleet health.
  4. Constantly monitor and improve procedures and systems , pushing for greater automation.
  5. Leverage AI to build feedback loops that accelerate diagnosis and repair

Skills

Required

  • 12+ years of software engineering leadership experience
  • operating at fleet infrastructure level
  • delivering high-availability, cloud services
  • strong focus on security, compliance, and operational excellence
  • lead, inspire, and develop world-class engineering teams
  • collaborating with cross-functional, geographically distributed teams and stakeholders
  • Excellent communication skills
  • strategic thinking capabilities
  • analytical problem-solving abilities

Nice to have

  • Experience with Oracle Cloud Infrastructure (OCI) stack
  • Familiarity with regulatory requirements and compliance frameworks

What the JD emphasized

  • GPU servers
  • validation
  • benchmarking
  • debugging
  • diagnosis
  • repairs
  • AI

Other signals

  • Leverage AI to build feedback loops that accelerate diagnosis and repair
  • Automate diagnosis with increasing reliance on AI
  • innovative AI solutions