Data Scientist Ii, Long Term Planning and Forecasting

Amazon Amazon · Big Tech · Bellevue, WA · Data Science

This role focuses on developing causal inference models, automated explainability frameworks, and GenAI-powered narrative generation to translate forecasting outputs into actionable business intelligence for Amazon's business customers. The data scientist will build automated variance decomposition models and a causal model library with standardized pipelines, applying techniques from causal inference and time-series econometrics.

What you'd actually do

  1. You will develop causal inference models, automated explainability frameworks, and variance bridging methodologies that translate LTPF's forecasts and plans into actionable business intelligence.
  2. You will build automated Plan-vs-Actual and Actual-vs-Actual variance decomposition models that quantify the contribution of individual demand drivers to observed gaps across revenue, price, units, inventory, and capacity metrics at multiple granularities to serve audiences from working-level analysts to VP-level planning reviews cycles.
  3. You will build and maintain a causal model library with standardized hypothesis generation and validation pipelines, applying techniques from causal inference, time-series econometrics, and Bayesian methods.
  4. You will develop GenAI-powered narrative generation capabilities that synthesize quantitative variance outputs into human-readable performance summaries and design automated hypothesis ranking to determine which demand drivers are most responsible for observed forecast error.

Skills

Required

  • 2+ years of data scientist experience
  • 3+ years of data querying languages (e.g. SQL), scripting languages (e.g. Python) or statistical/mathematical software (e.g. R, SAS, Matlab, etc.) experience
  • 3+ years of machine learning/statistical modeling data analysis tools and techniques, and parameters that affect their performance experience
  • 1+ years of guiding and coaching a group of researchers experience
  • 1+ years of working with or evaluating AI systems experience
  • 1+ years of creating or contributing to mathematical textbooks, research papers, or educational content experience
  • Master's degree in Science, Technology, Engineering, or Mathematics (STEM), or experience working in Science, Technology, Engineering, or Mathematics (STEM)
  • Experience applying theoretical models in an applied environment

Nice to have

  • Ph.D. in Science, Technology, Engineering, or Mathematics (STEM)
  • Knowledge of machine learning concepts and their application to reasoning and problem-solving
  • Experience in Python, Perl, or another scripting language
  • Experience in defining and creating benchmarks for assessing GenAI model performance
  • Experience effectively communicating complex concepts through written and verbal communication
  • Experience in forecasting analyses

What the JD emphasized

  • drive scientific tooling
  • multi-year roadmap
  • customer engagement
  • seamlessly access, understand, and act upon our forecasting outputs
  • manage the lifecycle of complex, cross-functional programs
  • architect the customer interaction experience
  • viewing capabilities, auditing tools, what-if analysis frameworks, and forecast intervention workflows
  • Leading large, cross-functional planning and strategy workstreams
  • Defining multi-year program vision and strategy
  • Prioritizing operational excellence work alongside feature delivery
  • Driving organizational alignment across multiple teams and stakeholders
  • develop causal inference models
  • automated explainability frameworks
  • variance bridging methodologies
  • build automated Plan-vs-Actual and Actual-vs-Actual variance decomposition models
  • build and maintain a causal model library
  • standardized hypothesis generation and validation pipelines
  • develop GenAI-powered narrative generation capabilities
  • automated hypothesis ranking
  • creating or contributing to mathematical textbooks, research papers, or educational content
  • Experience applying theoretical models in an applied environment
  • defining and creating benchmarks for assessing GenAI model performance

Other signals

  • develop causal inference models
  • automated explainability frameworks
  • variance bridging methodologies
  • GenAI-powered narrative generation
  • automated hypothesis ranking