Program Manager, Operations and Sustainment

Anduril Anduril · Defense · Costa Mesa, CA · Connected Warfare : Edge Compute & Communications Engineering : ECC Programs and Partnerships

Program Manager for a defense technology company's AI-powered ground station product (TITAN). The role focuses on sustaining deployed systems, diagnosing and resolving hardware/software issues, improving system observability, analyzing failure data, and supporting global customers. Requires strong problem-solving, technical system mastery, and communication skills, with a focus on operational readiness and reliability in a defense context.

What you'd actually do

  1. Sustain Anduril's TITAN deployments by combining an understanding of our customers' missions with familiarity of our products and delivered capabilities
  2. Triage, diagnose, and root cause product incidents, driving postmortem actions including providing status visibility through resolution. Then abstract incidents and optimize wider system and product reliability. This will include conducting log reviews and diagnostics, as well as utilizing Linux and command line tools for system management and issue resolution
  3. Consistently assess, and seek to improve, the quality of the fleet's observability and health telemetry in partnership with multiple functions across the TITAN team
  4. Collect, organize, and analyze system failure data to define trends, drive proactive sustainment processes, and support resource allocation
  5. Support Anduril's global customers through proactive communications and detail-oriented execution. You will track and manage customer-facing incidents and concerns, ensuring timely response, mitigation, and communication of resolution

Skills

Required

  • Demonstrated experience as a self-starter, able to find and resolve issues on your own
  • Strong aptitude for problem solving in unstructured situations at the interface of hardware, software, and networking. Ability to drive challenging and vague technical problems to clarity and resolution
  • Proven ability to master a technical system and support it in operational environments. Must demonstrate an innate drive to be self-sufficient across the depth and breadth of a technical system
  • Confident with navigating ambiguity and crafting new ways of doing things
  • Excellent written, visual, and verbal communication skills
  • Active Top Secret clearance

Nice to have

  • Experience supporting and/or operating compute-enabled communications systems, including electronic warfare domain experience, as a DOD employee, contractor, or end-user
  • Experience supporting and/or performing incident driven workflows requiring analysis, triage, prioritization, and reporting
  • Experience in on-call support operations and working in limited risk tolerance environments
  • Experience executing sustainment and reliability processes for a defense-focused service or product
  • Experience contributing to data analytics to inform business decisions
  • Experience with observability tooling such as Grafana and exposure to software development tooling such as Git and Jira
  • Military, DOD, or other Government agency experience preferred
  • Willingness to work non-standard hours and weekends as needed
  • STEM degree, preferably in computer science, software engineering, electrical engineering, information technology, or similar
  • 2+ years of technical support, IT, or software

What the JD emphasized

  • root cause
  • diagnose
  • diagnostics
  • failure data
  • technical system