Research Scientist II - Amz9698004

Amazon Amazon · Big Tech · Seattle, WA · Corporate Operations

Research Scientist II role focused on psychometric aspects of exam development and operations, with a requirement to use ML, NLP, and GenAI for research and automation. The role involves developing and applying statistical and psychometric modeling to ensure exam validity and reliability, and automating psychometric workflows using R or Python. Experience with large-scale assessment programs and complex test designs is required.

What you'd actually do

  1. Perform and support the main psychometric aspects of exam development and operations, including but not limited to automated test assembly, item and test analyses, optimal item bank design, job task analysis, standard setting, quality assurance, and project planning.
  2. Conduct main aspects of psychometric analysis in operational work including performing item analysis using psychometric methods, building optimal test forms and pools via optimization techniques, analyzing and monitoring item bank health, setting pass standards via standard setting studies, and supporting Job Task Analysis (JTA) to define and refresh test blueprints.
  3. Conduct main aspects of psychometric analysis in developing and applying statistical and psychometric modeling to evaluate and ensure AWS certification exams’ validity, reliability, applicability, efficiency, and accuracy.
  4. Participate in research projects to improve existing operational processes and quality using advanced techniques such as Machine Learning (ML), statistical modeling, Natural Language Processing (NLP), Generative Artificial Intelligence (GenAI), etc.
  5. Develop automation code using R or Python for psychometric workflow pipeline and other tasks to improve operational efficiencies.

Skills

Required

  • Statistics
  • Psychometrics
  • Educational Measurement
  • Quantitative Psychology
  • Data Science
  • Industrial-Organizational (I/O) Psychology
  • large-scale education, licensure, or certification assessment programs
  • operational psychometric tasks
  • item analysis
  • equating and scaling
  • item response theory
  • classical test theory
  • form and pool assembly
  • item bank health analysis
  • standard setting
  • job task analysis
  • linear-on-the-fly testing (LOFT)
  • computerized adaptive testing (CAT)
  • Machine Learning (ML)
  • Natural Language Processing (NLP)
  • R
  • Python

Nice to have

  • Generative Artificial Intelligence (GenAI)

What the JD emphasized

  • PhD or foreign equivalent degree in Statistics, Psychometrics, Educational Measurement, Quantitative Psychology, Data Science, Industrial-Organizational (I/O) Psychology, or a related field
  • one year of research or work experience in the job offered, or as a Research Scientist, Research Assistant, Software Engineer, or a related occupation
  • large-scale education, licensure, or certification assessment programs
  • operational psychometric tasks on large-scale education, licensure, or certification assessment programs including item analysis, equating and scaling, item response theory, classical test theory, form and pool assembly, item bank health analysis, standard setting, and job task analysis
  • at least one of the complex test designs such as linear-on-the-fly testing (LOFT), computerized adaptive testing (CAT)
  • at least one of the following areas including machine learning (ML) or natural language processing (NLP)
  • Programming skills in at least one script-based programming language (R, Python)

Other signals

  • Participate in research projects to improve existing operational processes and quality using advanced techniques such as Machine Learning (ML), statistical modeling, Natural Language Processing (NLP), Generative Artificial Intelligence (GenAI), etc.
  • Develop automation code using R or Python for psychometric workflow pipeline and other tasks to improve operational efficiencies.