Sr Software Engineer - Av Data Quality, Av Labs

Uber Uber · Consumer · Sunnyvale, CA · Engineering

This role focuses on building the data integrity layer for an L4 autonomous driving data platform, validating and safeguarding the quality of multi-modal sensor streams. It bridges raw robotics telemetry and downstream machine learning, ensuring systems learn from ground truth.

What you'd actually do

  1. Systems architecture design, including management of upstream and downstream dependencies.
  2. Lead the systems architecture for end-to-end data validation, managing complex dependencies between raw sensor streams (LiDAR, Camera, IMU, etc.) and downstream ML training environments.
  3. Architect scalable solutions that detect various data issues to ensure the platform stands the test of petabyte-scale availability.
  4. Partner with Perception, Hardware, Middleware, and Infra Engineering teams to define data quality standards and integrate the latest AI techniques for automated data remediation.
  5. Participate in periodic on-call rotations and be available for critical issues

Skills

Required

  • Python/C++
  • Linux
  • batch cloud computing technologies
  • Systems architecture design
  • end-to-end data validation
  • data quality standards

Nice to have

  • Ray
  • Spark
  • handling multi-modal autonomous driving sensor data
  • Data/AI/ML platform

What the JD emphasized

  • state-of-the-art AV data platform
  • petabyte-scale availability
  • multi-modal autonomous driving sensor data

Other signals

  • data quality
  • ML training data
  • autonomous driving
  • multi-modal sensor streams