Site Reliability Engineer

Wayve Wayve · Robotics · Tokyo, Japan · Vehicle SW Engineering

Site Reliability Engineer for Wayve, an Embodied AI company developing autonomous driving systems. The role focuses on ensuring the operational excellence, robustness, and reliability of AI-driven autonomous vehicles on public roads. Responsibilities include automating operations, developing metrics and monitoring systems, and providing out-of-hours support.

What you'd actually do

  1. Guarantee the seamless operation of our autonomous vehicles on public roads, enhancing our ability to transform urban mobility.
  2. Drive the development of cutting-edge tools and automation to streamline operations, from first-line support enhancements to fleet management.
  3. Develop key metrics and monitoring/logging systems that preempt problems and promote rapid resolution, ensuring the highest levels of performance and reliability.

Skills

Required

  • Site Reliability Engineering
  • Python
  • C++
  • Rust
  • cloud computing platforms (AWS, GCP, or Azure)
  • monitoring and troubleshooting utilities (DataDog, Prometheus, Grafana, OpenTelemetry, Splunk, Humio, etc.)
  • Linux
  • systems foundations

Nice to have

  • Azure
  • AV companies
  • embedded systems
  • ML/AI

What the JD emphasized

  • on-call rotation
  • out-of-hours support