Latin America (alac) Aiml Data Quality & Governance Scientist

Apple Apple · Big Tech · Sao Paulo, Sao Paulo, Brazil · Sales and Business Development

This role focuses on building and evolving enterprise knowledge graphs to serve as a foundation for AI agents and analytical tools for sales teams. It involves capturing business knowledge, designing graph schemas, implementing data quality and governance policies, and integrating unstructured data. The role also includes designing retrieval architectures and evaluation pipelines for AI agent consumption of this knowledge.

What you'd actually do

  1. Engage deeply with business stakeholders across sales, channel operations, and analytics —in close collaboration with the AI Business Champion— to extract domain expertise and bridge how the business thinks about its data with how AI systems need that knowledge structured
  2. Design, build, and continuously evolve enterprise knowledge graphs that represent ALAC's business entities, data assets, metrics, channels, and processes in a machine-readable, AI-consumable form — maintaining backward compatibility and communicating schema changes to downstream consumers
  3. Build and operationalize data quality monitoring pipelines — defining rules, anomaly detection, and drift analysis to surface issues before they reach AI outputs or sales-facing tools
  4. Propose and design ingestion pipelines for unstructured content — market reports, channel briefings, operational documents — making them first-class citizens in the knowledge graph
  5. Design retrieval architectures — including vector search and graph traversal — aligned to the CoE Lead's solution architecture, and build evaluation pipelines to measure how accurately AI agents consume ALAC-specific knowledge

Skills

Required

  • Data Science
  • Knowledge Engineering
  • Extracting complex business concepts
  • Translating concepts into formal ontologies, schemas, or knowledge graph structures
  • Designing and building knowledge graphs in a production environment
  • Graph query languages (Cypher, SPARQL, Gremlin)
  • Semantic standards (RDF, OWL, SHACL)
  • Metadata management tools (e.g., Alation, Collibra, DataHub)
  • Data quality
  • Designing and building data pipelines
  • Designing ingestion architectures
  • Embedding models
  • Entity extraction
  • Classification techniques
  • Designing retrieval architectures
  • Vector search
  • Graph traversal
  • Building evaluation pipelines

Nice to have

  • Discussing sales channel dynamics
  • Discussing business process logic
  • Enterprise Knowledge Graphs
  • AI agents
  • Analytical tools
  • Decision-support systems
  • Sales teams
  • Interconnected enterprise knowledge ecosystem
  • Partner/channel/third-party data sources
  • Unstructured data
  • Structured data
  • AI Business Process Champion
  • Global AI platform and engineering teams
  • Sales user experience
  • Cross-functional and global collaboration
  • Analytics
  • AI platforms
  • Technology infrastructure
  • Sales operations
  • Channel management
  • Regional and global stakeholders

What the JD emphasized

  • core work will be to listen deeply to business experts, extract and formalize what they know, and encode it into Enterprise Knowledge Graphs
  • knowledge graphs you build must fit coherently into a broader, interconnected enterprise knowledge ecosystem
  • definitions, ontologies, and data models must be consistent with and linkable to graphs owned by partner teams
  • lead data quality and governance efforts
  • design ingestion architectures for unstructured, structured, and partner/channel/third-party data sources
  • formal ontologies and knowledge graph schemas
  • enterprise knowledge graphs
  • data quality monitoring pipelines
  • data governance policies and procedures
  • ingestion pipelines for unstructured content
  • embedding models, entity extraction, and classification techniques
  • retrieval architectures — including vector search and graph traversal
  • build evaluation pipelines to measure how accurately AI agents consume ALAC-specific knowledge
  • 6+ years in Data Science, Knowledge Engineering, or a related field — with demonstrated experience extracting complex business concepts and translating them into formal ontologies, schemas, or knowledge graph structures
  • Hands-on experience designing and building knowledge graphs (e.g., Neo4j, GraphDB, Amazon Neptune) in a production environment
  • Proficiency in graph query languages (Cypher, SPARQL, Gremlin), semantic standards (RDF, OWL, SHACL), and metadata management tools (e.g., Alation, Collibra, DataHub)
  • Strong experience with data quality

Other signals

  • Enterprise Knowledge Graphs
  • AI agents
  • Data Quality
  • Governance
  • Ontologies
  • Schema design
  • Embedding models
  • Entity extraction
  • Classification
  • Vector search
  • Graph traversal
  • Evaluation pipelines