Data Scientist Ii, Long Term Planning and Forecasting

Amazon Amazon · Big Tech · Bellevue, WA · Data Science

This Data Scientist II role focuses on building scientific tooling for how business customers interact with Long-Term Planning and Forecasting (LTPF) forecasts and plans. The role involves developing causal inference models, automated explainability frameworks, and variance bridging methodologies. It also includes building GenAI-powered narrative generation capabilities and automated hypothesis ranking to synthesize quantitative variance outputs into human-readable performance summaries and identify drivers of forecast error. The position emphasizes leading cross-functional programs, defining multi-year strategy, and leveraging insights for strategic decision-making.

What you'd actually do

  1. You will develop causal inference models, automated explainability frameworks, and variance bridging methodologies that translate LTPF's forecasts and plans into actionable business intelligence.
  2. You will build automated Plan-vs-Actual and Actual-vs-Actual variance decomposition models that quantify the contribution of individual demand drivers to observed gaps across revenue, price, units, inventory, and capacity metrics at multiple granularities to serve audiences from working-level analysts to VP-level planning reviews cycles.
  3. You will build and maintain a causal model library with standardized hypothesis generation and validation pipelines, applying techniques from causal inference, time-series econometrics, and Bayesian methods.
  4. You will develop GenAI-powered narrative generation capabilities that synthesize quantitative variance outputs into human-readable performance summaries and design automated hypothesis ranking to determine which demand drivers are most responsible for observed forecast error.

Skills

Required

  • 2+ years of data scientist experience
  • 3+ years of data querying languages (e.g. SQL), scripting languages (e.g. Python) or statistical/mathematical software (e.g. R, SAS, Matlab, etc.) experience
  • 3+ years of machine learning/statistical modeling data analysis tools and techniques, and parameters that affect their performance experience
  • 1+ years of guiding and coaching a group of researchers experience
  • 1+ years of working with or evaluating AI systems experience
  • 1+ years of creating or contributing to mathematical textbooks, research papers, or educational content experience
  • Master's degree in Science, Technology, Engineering, or Mathematics (STEM), or experience working in Science, Technology, Engineering, or Mathematics (STEM)
  • Experience applying theoretical models in an applied environment

Nice to have

  • Ph.D. in Science, Technology, Engineering, or Mathematics (STEM)
  • Knowledge of machine learning concepts and their application to reasoning and problem-solving
  • Experience in Python, Perl, or another scripting language
  • Experience in defining and creating benchmarks for assessing GenAI model performance
  • Experience effectively communicating complex concepts through written and verbal communication

What the JD emphasized

  • drive scientific tooling
  • customer engagement
  • seamlessly access, understand, and act upon our forecasting outputs
  • architect the customer interaction experience
  • viewing capabilities, auditing tools, what-if analysis frameworks, and forecast intervention workflows
  • causal inference models
  • automated explainability frameworks
  • variance bridging methodologies
  • Plan-vs-Actual and Actual-vs-Actual variance decomposition models
  • causal model library
  • standardized hypothesis generation and validation pipelines
  • GenAI-powered narrative generation capabilities
  • automated hypothesis ranking
  • forecast error

Other signals

  • develop causal inference models
  • automated explainability frameworks
  • variance bridging methodologies
  • GenAI-powered narrative generation capabilities
  • automated hypothesis ranking