Principal Product Manager - AI Data Quality

F5 F5 · Enterprise · Seattle, WA

Product Manager to architect and scale an AI-Ready Data Quality Platform built on Databricks and Unity Catalog. The role focuses on defining and shipping platform capabilities for an AI Data Fabric, including real-time anomaly detection, schema enforcement, and automated data contract validation. It emphasizes treating data quality like SRE treats uptime and driving data ownership as a product discipline, aligning catalog metadata with AI feature stores and inference pipelines.

What you'd actually do

  1. Define and ship native data quality capabilities inside Databricks Lakehouse
  2. Design real-time anomaly detection systems (statistical + ML-driven)
  3. Establish a data product ownership model across service teams
  4. Define how governed datasets become AI-ready assets

Skills

Required

  • 5+ years in Product Management for Data Platforms, Analytics, or AI Infrastructure
  • Databricks Lakehouse architecture
  • Unity Catalog governance constructs
  • dbt transformation workflows
  • CI/CD patterns for data pipelines
  • Data observability and monitoring patterns
  • Strong SQL fluency
  • comfort reading Python/Scala data pipeline code
  • Experience defining data contracts and schema evolution strategies
  • Understanding of streaming frameworks (Kafka, Spark Structured Streaming, etc.)
  • Experience supporting AI/ML workloads in production environments

Nice to have

  • Experience with modern data observability platforms (Monte Carlo, Bigeye, etc.)
  • Familiarity with feature stores and model lifecycle tooling
  • Knowledge of domain-oriented data mesh architectures

What the JD emphasized

  • AI-native enterprise
  • AI-Ready Data Quality Platform
  • AI Data Fabric
  • production-grade data for AI use cases
  • data quality like SRE treats uptime
  • data product ownership model
  • governed datasets become AI-ready assets
  • AI systems are only as good as the data feeding them

Other signals

  • AI-native enterprise
  • AI-Ready Data Quality Platform
  • AI Data Fabric
  • production-grade data for AI use cases
  • support model reproducibility and dataset versioning