Applied Scientist / Machine Learning Engineer

Wayve Wayve · Robotics · Sunnyvale, CA · AI Platform

This role is part of the AI Platform organization, focusing on the data flywheel that powers foundation models for embodied AI in autonomous driving. The primary responsibilities involve data curation, data enrichment, and foundation model evaluation to improve model generalization and safety. The role also touches upon building and fine-tuning large-scale pretrained models, including vision-language-action and vision-language models.

What you'd actually do

  1. Mine world-scale fleet data for rare, long-tail, and safety-critical events using active learning, smart sampling, and embedding-based retrieval and dedup.
  2. Figure out what makes a good training dataset: which data, mix, and balance actually move the model, and turn that into repeatable curation across cities, sensor rigs, and embodiments.
  3. Build high-quality enrichments that teams across the company depend on, through (semi-)automated enrichment and labeling pipelines and data quality at scale.
  4. Build and fine-tune large-scale pretrained models, and run smaller-scale experiments to test and derisk ideas before committing serious compute.
  5. Design rigorous offline and closed-loop evaluation: metrics and benchmarks that correlate with real on-road behaviour and safety, with deliberate coverage of rare and safety-critical scenarios.

Skills

Required

  • Python
  • PyTorch or similar deep-learning framework
  • ML fundamentals
  • software fundamentals
  • large-scale data wrangling
  • foundation model training
  • foundation model evaluation

Nice to have

  • Autonomous driving, robotics, or other embodied-AI domains
  • Foundation models, VLMs, world models, diffusion or autoregressive generative models, or reinforcement learning and reward modeling
  • Large-scale data infrastructure: embedding and vector search (e.g. turbopuffer, Milvus), distributed data processing (Ray Data, Daft, Spark), lakehouse formats (Lance, Iceberg), or annotation tooling
  • Closed-loop or simulation-based evaluation, and safety-critical ML
  • Publications at top ML, CV, or robotics venues (NeurIPS, ICML, ICLR, CVPR, CoRL, RSS)

What the JD emphasized

  • track record of taking ML from research into production systems that run at scale
  • Hands-on strength in one or more of: data curation, foundation model training, large-scale data wrangling, and foundation-model evaluation
  • large-scale data and/or large neural networks
  • large, messy, real-world datasets

Other signals

  • data flywheel
  • foundation models
  • data engine
  • deployment scales