Applied Scientist, Artificial General Intelligence

Amazon Amazon · Big Tech · Boston, MA · Applied Science

The Applied Scientist will develop and maintain LLM-as-a-Judge systems and auditing frameworks to ensure the quality of data used for training and evaluating Amazon Nova models, impacting LLM products and services.

What you'd actually do

  1. lead the development of comprehensive quality strategies and auditing frameworks that safeguard the integrity of data collection workflows.
  2. perform expert-level manual audits, conduct meta-audits to evaluate auditor performance, and provide targeted coaching to uplift overall quality capabilities.
  3. developing and maintaining LLM-as-a-Judge systems, including designing judge architectures, creating evaluation rubrics, and building machine learning models for automated quality assessment.
  4. set up the configuration of data collection workflows and communicate quality feedback to stakeholders.
  5. have a direct impact on enhancing customer experiences through high-quality training and evaluation data that powers state-of-the-art LLM products and services.

Skills

Required

  • machine learning
  • statistics
  • quality assurance
  • auditing methodologies
  • automated evaluation systems
  • programming in Java, C++, Python or related language
  • SQL and an RDBMS (e.g., Oracle) or Data Warehouse

Nice to have

  • implementing algorithms using both toolkits and self-developed code
  • publications at top-tier peer-reviewed conferences or journals

What the JD emphasized

  • ensure the highest standards of data quality
  • safeguard the integrity of data collection workflows
  • expert-level manual audits
  • evaluate auditor performance
  • automated quality assessment
  • high-quality training and evaluation data

Other signals

  • develops and maintains LLM-as-a-Judge systems
  • designs judge architectures
  • builds machine learning models for automated quality assessment
  • develops comprehensive quality strategies and auditing frameworks
  • designs auditing strategies with detailed SOPs, quality metrics, and sampling methodologies