Senior Site Reliability Engineer, Space

Anduril Anduril · Defense · Costa Mesa, CA · AFS : Space Engineering

Senior Site Reliability Engineer for Anduril's Space team, focusing on expanding AI-powered capabilities into space. The role involves re-imagining the infrastructure pipeline for a scalable deployment solution, from software testing to hardware rollout across a fleet of Kubernetes nodes. Responsibilities include developing CI/CD and developer capabilities, collaborating with software developers to identify bottlenecks, and working with cloud services, Linux systems, and networking fundamentals.

What you'd actually do

  1. re-imagine the infrastructure pipeline required to convert a nascent rollout process into a hardened test and release pipeline
  2. develop a through line from pure software testing, to hardware, that is pushed from unclassified to classified spaces and rolled out to a fleet of Kubernetes nodes
  3. wear the hat of a product manager, speaking to software developers, to discern bottlenecks in the development process to accelerate their velocity
  4. develop modular CI/CD and developer capabilities to support newly formed programs in the Space organization

Skills

Required

  • cloud services like AWS, Azure, or Google Cloud Platform
  • Configuring of Linux-based systems
  • networking fundamentals
  • communication skills
  • collaborate across cross-functional teams
  • intuition for finding solutions to complex problems that involve multiple first and third party technologies (related to simulation, data management, compute infrastructure, and networking)
  • Strong engineering background from industry or school, ideally in areas/fields such as Robotics, Computer Science, Software Engineering, Mechatronics, Electrical Engineering, Mathematics, or Physics
  • Eligible to obtain and maintain an active U.S. security clearance

Nice to have

  • deploying AWS environments, using EC2, VPC, S3, ECS/EKS, CloudWatch, AWS Config, IAM, Load Balancers and Cost Management
  • Go, Rust, C++, Python
  • hardware integration testing
  • Track record of working with customers to deliver novel software capabilities
  • network topology design, IT network engineering, and embedded device security
  • Nix, NixOS, nixpkgs

What the JD emphasized

  • Eligible to obtain and maintain an active U.S. security clearance
  • Eligible to obtain and maintain an active U.S. Secret security clearance