Visiting Researcher, Fair (university Grad)

Meta Meta · Big Tech · Menlo Park, CA

Research role focused on defining and establishing foundational data quality standards, task methodologies, and validation frameworks for Computer Use Agent (CUA) data creation, upstream of AI model training. The role involves primary research to set 'gold standard' protocols for data governance and integrity.

What you'd actually do

  1. Establish Foundational Standards: Lead primary and secondary research to design, build, and implement rigorous, scalable CUA data quality standards and validation frameworks with Meta Superintelligence Labs.
  2. Develop Task Methodologies: Architect scientifically sound task creation methodologies and annotation guidelines to ensure downstream datasets are highly accurate, reproducible, and representative.
  3. Mitigate Data Risk: Conduct deep dive data integrity research to identify systemic biases or quality gaps, proactively mitigating "garbage in, garbage out" risks for AI model training.
  4. Cross Functional Leadership: Partner closely with data science, software engineering, and operational teams to translate complex research methodologies into clear, executable data pipelines.
  5. Define Metrics & Baselines: Establish statistical baselines and key quality metrics to evaluate, audit, and continuously improve CUA data health across the product lifecycle.

Skills

Required

  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • Master’s or PhD degree in a quantitative or research-heavy field
  • Demonstrated experience conducting empirical research, data curation & validation, or designing complex data collection/annotation methodologies
  • Deep expertise in establishing data quality control frameworks, statistical sampling, and data governance
  • Experience designing and scaling data quality frameworks or human-in-the-loop (HITL) annotation methodologies specifically for training Machine Learning, NLP, LLM foundation models
  • Familiarity with user behavior log data, or defining complex interaction taxonomies
  • Experience solving complex problems and evaluating alternative solutions, tradeoffs, and perspectives to determine a path forward
  • Proven track record of building data governance metrics and quality baselines from scratch in an ambiguous, fast-paced environment
  • Experience translating complex, abstract research methodologies into clear, executable operational pipelines for cross-functional engineering and product partners

What the JD emphasized

  • rigorous, scalable CUA data quality standards
  • scientifically sound task creation methodologies
  • data integrity research
  • data quality control frameworks
  • data governance metrics and quality baselines from scratch

Other signals

  • architecting scientific frameworks
  • task-creation methodologies
  • quality standards
  • primary and empirical research
  • gold standard protocols