Senior Mlops Manager

NVIDIA NVIDIA · Semiconductors · Shanghai, China

Senior MLOps Manager at NVIDIA focusing on leading data operations and ground truth (GT) production for the Automotive industry's autonomous driving stack. The role involves owning the end-to-end GT operations lifecycle, defining requirements, managing internal and external teams, monitoring operational metrics, and improving data pipelines, with potential to leverage AI agents for efficiency.

What you'd actually do

  1. Owning the GT production and delivery pipeline, ensuring datasets are delivered on time, at the right quality, and at the right scale for model training and validation.
  2. Defining GT requirements with Data Analyst and MLE teams, and translating them into clear labeling specs, QA specs, volumes, SLAs, and quality targets.
  3. Managing day-to-day GT operations across internal teams and external vendors, including task allocation, throughput tracking, and issue triage.
  4. Establishing and monitoring key operational metrics (throughput, quality, rework rate, audit pass rate, labeling cost), driving continuous improvement using data‑driven insights.
  5. Partnering with tooling and data infrastructure teams to improve labeling tools, QA workflows, and GT data pipelines, including opportunities to leverage AI agents to boost efficiency.

Skills

Required

  • BS degree in Computer Science, Engineering, Data Science, or related field (or equivalent experience)
  • 10+ years of overall professional experience in data operations, labeling operations, ML data programs, or related fields, working with large-scale data or ML-driven products.
  • 5+ years of direct leadership/management experience leading data operations, labeling teams, or cross-functional GT programs (people management and/or end-to-end program ownership with defined KPIs and SLAs).
  • Strong communication and stakeholder management skills, with experience working across engineering, ML, product, and vendor teams.
  • Solid analytical skills and comfort working with metrics and dashboards to drive decisions.

Nice to have

  • Experience leading GT or labeling operations for autonomous driving, robotics, or other complex ML domains.
  • Hands-on familiarity with annotation tools, dataset management platforms, and data quality frameworks.
  • Experience applying AI/agentic tooling to improve operations (e.g., auto-labeling, active learning, agent-assisted QA, or workflow automation).
  • Prior experience managing external labeling vendors, including setting up SLAs, quality targets, and feedback loops.

What the JD emphasized

  • 10+ years of overall professional experience
  • 5+ years of direct leadership/management experience
  • Proven track record owning end-to-end data or GT operations, with clear examples of how you improved quality, efficiency, and reliability.

Other signals

  • managing data operations for ML
  • ensuring datasets are delivered on time, at the right quality, and at the right scale for model training and validation
  • defining GT requirements with Data Analyst and MLE teams
  • managing day-to-day GT operations across internal teams and external vendors
  • establishing and monitoring key operational metrics
  • partnering with tooling and data infrastructure teams to improve labeling tools, QA workflows, and GT data pipelines
  • leading cross-functional reviews and post-mortems