Senior Site Reliability Engineer

Apptronik Apptronik · Robotics · HQ · Software Engineering

Seeking an experienced Site Reliability Engineer to own and maintain the deployment of cloud-based infrastructure to customer sites, ensuring smooth deployment of solutions that collect training data and deploy models to real-time robotic systems. This role will accelerate progress integrating AI models to humanoid robot hardware.

What you'd actually do

  1. Partnering with customers and Applications Engineers to remove roadblocks to deployment success
  2. Writing and fixing Infrastructure as Code (Terraform, Helm, Ansible)
  3. Developing and maintaining code in Python / Typescript
  4. Troubleshooting networking, performance, and security challenges
  5. Collaborating with engineering and product teams to shape improvements

Skills

Required

  • Infrastructure as Code tools (Terraform, Helm, Ansible)
  • Kubernetes
  • Python
  • Typescript
  • Linux systems engineering
  • networking experience
  • customer empathy
  • flexibility to adapt

Nice to have

  • C++
  • Grafana
  • PagerDuty
  • travel

What the JD emphasized

  • own and maintain the deployment
  • real-time robotic systems
  • humanoid robot hardware