Expert Consultant- Stem (contract), Agi - Data Services

Amazon Amazon · Big Tech · Bellevue, WA · Research Science

This role focuses on providing strategic oversight and mentorship for human-in-the-loop and model-in-the-loop data pipelines, ensuring data quality for LLM training and evaluations. The consultant will collaborate with AI teams, optimize data collection, and develop benchmarks for GenAI model performance.

What you'd actually do

  1. Serve as a trusted domain advisor to cross-functional teams, providing strategic direction and specialized problem-solving support
  2. Champion domain knowledge sharing across multiple channels and teams to maintain data quality excellence and standardization
  3. Drive collaborative efforts with science teams to optimize output of complex data collections in your domain expertise, ensuring data excellence through iterative feedback loops
  4. Foster team excellence through mentorship and motivation of peers and junior team members
  5. Make informed decisions on behalf of our customers, ensuring that selected code meets industry standards, best practices, and specific client needs

Skills

Required

  • Guiding and coaching researchers
  • Working with or evaluating AI systems
  • Creating mathematical textbooks, research papers, or educational content
  • STEM background (Master's degree or equivalent experience)
  • Domain expertise

Nice to have

  • Ph.D. in STEM
  • Machine learning concepts
  • Defining and creating benchmarks for GenAI model performance
  • Multi-team, cross-disciplinary projects
  • Analytical and decision-making skills
  • Excellent written and verbal communication skills

What the JD emphasized

  • 1+ years of guiding and coaching a group of researchers experience
  • 1+ years of working with or evaluating AI systems experience
  • 1+ years of creating or contributing to mathematical textbooks, research papers, or educational content experience
  • Master's degree in Science, Technology, Engineering, or Mathematics (STEM), or experience working in Science, Technology, Engineering, or Mathematics (STEM)
  • Knowledge of machine learning concepts and their application to reasoning and problem-solving
  • 1+ years of experience in defining and creating benchmarks for assessing GenAI model performance

Other signals

  • human-in-the-loop
  • model-in-the-loop
  • data pipelines
  • LLM training
  • evaluations