Graduate 2026 Phd Software Engineer II (av Labs), United States

Uber Uber · Consumer · San Francisco, CA +1 · University

This role focuses on developing algorithms and foundation models for Physical AI in autonomous vehicles. The primary goal is to extract high-fidelity semantic meaning from complex urban edge cases to enrich L4 data lakes, optimize dataset quality for ML acceleration through advanced sensor data processing and auto-labeling, and implement scalable ML systems. The role involves collaboration with platform, product, and security engineering teams.

What you'd actually do

  1. Develop algorithms and foundation models that extract high-fidelity semantic meaning from complex urban edge cases to enrich our L4 data lake
  2. Implement scalable ML systems, including management of upstream sensor dependencies
  3. Deliver high-quality datasets to accelerate ML technologies through advanced sensor data collection, processing, and auto-labeling
  4. Partner with platform, product, and security engineering teams to enable the successful deployment of the latest machine learning techniques into production

Skills

Required

  • Completing or recently completed a PhD in Computer Science, Robotics, Machine Learning, Computer Vision, Electrical Engineering, or a related technical field
  • Python
  • PyTorch

Nice to have

  • Strong publication record in top-tier AI, ML, robotics, or computer vision conferences
  • Deep knowledge of machine learning for robotics, computer vision, or autonomous systems
  • Experience working with large-scale sensor data (e.g., camera, LiDAR) and building data pipelines for ML applications
  • Experience developing or deploying ML models in real-world or safety-critical systems
  • Familiarity with C++ and high-performance or real-time systems
  • Proven ability to translate research into production-grade systems
  • Excellent communication skills, with the ability to explain complex technical concepts to cross-functional stakeholders

What the JD emphasized

  • Physical AI
  • high-quality data
  • long-tail driving data
  • data race
  • advanced autonomy algorithms and models
  • rich semantics
  • machine learning techniques
  • complex edge cases
  • robust algorithmic solutions
  • high technical excellence bar
  • high-fidelity semantic meaning
  • complex urban edge cases
  • scalable ML systems
  • advanced sensor data collection, processing, and auto-labeling
  • latest machine learning techniques
  • large-scale sensor data
  • data pipelines for ML applications
  • deploying ML models in real-world or safety-critical systems

Other signals

  • Physical AI
  • autonomous technology ecosystem
  • unlocking real-world, long-tail driving data
  • extract high-fidelity semantic meaning from complex urban edge cases
  • accelerate ML technologies through advanced sensor data collection, processing, and auto-labeling