Technical Lead Manager, Autonomy Evaluation and Intelligence

Nuro Nuro · Robotics · CA · Autonomy

Lead the technical roadmap for systems that validate and verify the intelligence of Nuro's self-driving car technology. This involves developing evaluation tooling, designing intelligent agents and metrics to measure ML model performance, and closing the learning loop by identifying and refining behavioral weaknesses. The role requires defining the technical architecture for unified, automated, data-driven validation.

What you'd actually do

  1. Design and deploy "smart" evaluation agents that simulate complex interactions,
  2. Design and deploy advanced algorithms that measure the cognitive performance of the ML models powering our self-driving car’s behavior, utilizing physics-based modeling and machine learning.
  3. Leverage these evaluation frameworks to identify specific behavioral weaknesses in the NuroDriver, driving technical execution to refine the agent's decision-making logic.
  4. Work cross-functionally with Autonomy and Infrastructure engineers to set a roadmap that unifies evaluation frameworks, moving us toward automated, data-driven validation.

Skills

Required

  • AI/ML
  • robotics
  • large scale evaluation systems
  • evaluate complex trade-offs
  • exceptional technical judgment
  • set a clear vision
  • earn the trust of a talented organization
  • aligning the team behind a unified technical strategy
  • inherent sense of urgency
  • fast, high-quality technical execution
  • hiring
  • organizing
  • fostering a world-class engineering culture
  • inspiring team members to grow their skills
  • work closely with cross functional teams
  • manage dependencies
  • ensure the Autonomy team has the compute and infrastructure needed to succeed

Nice to have

  • BS/MS/PhD in Computer Science, Robotics, Statistics, Physics, Math, or a related quantitative field
  • 7+ years of industry expertise
  • proven track record of driving team success through proactive mentorship and cross-functional initiative
  • passion for inspiring team members to grow their skills

What the JD emphasized

  • drive the technical roadmap
  • validate and verify the intelligence
  • ensures our technology scales safely
  • real-world robotics challenges
  • Intelligent Agents
  • Intelligent Metrics
  • Close the Learning Loop
  • technical strategy
  • high-quality technical execution

Other signals

  • leading development of evaluation tooling
  • ensures technology scales safely
  • design and deploy smart evaluation agents
  • design and deploy advanced algorithms that measure cognitive performance of ML models
  • identify specific behavioral weaknesses
  • driving technical execution to refine agent's decision-making logic
  • set a roadmap that unifies evaluation frameworks
  • automated, data-driven validation