Applied Scientist, Demand Forecasting

Amazon Amazon · Big Tech · Bellevue, WA · Data Science

Research scientist role focused on designing and building large-scale foundation models for time series demand forecasting. The role involves developing novel architectures, training strategies, and data generation techniques, with a strong emphasis on both scientific research (publications) and production deployment impacting millions of dollars in automated decisions. Experience with transfer learning, zero-shot forecasting, and synthetic data generation is key.

What you'd actually do

  1. Design and implement novel deep learning architectures (e.g., Transformers, SSMs, or Graph Neural Networks) for time-series foundation models that generalize across hundreds of millions of products and diverse global contexts.
  2. Drive the full development cycle - from whiteboarding new algorithmic approaches to overseeing production-scale deployments.
  3. Collaborate with SDEs to build high-performance, distributed training and inference pipelines; translate complex scientific concepts into scalable, production-grade code in Python and Scala.
  4. Leverage and develop agentic GenAI workflows to automate the end-to-end research cycle from synthesizing state-of-the-art literature and auto-generating experimental code to rapidly iterating on model architectures across millions of products.
  5. Maintain a high bar for scientific excellence by publishing novel research in top-tier venues (e.g., NeurIPS, ICLR, KDD) and contributing to Amazon’s internal patent and science community.

Skills

Required

  • PhD, or Master's degree and 3+ years of deep learning, computer vision, human robotic interaction, algorithms implementation experience
  • 3+ years of building models for business application experience
  • Experience programming in Java, C++, Python or related language

Nice to have

  • PhD in computer science, machine learning, engineering, or related fields
  • Experience building complex software systems, especially involving deep learning, machine learning and computer vision, that have been successfully delivered to customers
  • Experience operating highly available, distributed systems of data extraction, ingestion, and processing of large data sets, or experience with training and deploying machine learning systems to solve large-scale optimizations
  • Strong publication record in top-tier AI/ML conferences (e.g., NeurIPS, ICLR, ICML, KDD, CVPR) or a history of contributing novel algorithmic improvements to production-scale systems.
  • Fluency in Python.

What the JD emphasized

  • novel architectures
  • foundation models
  • time series research
  • large-scale
  • production deployment
  • publish your work at top-tier conferences
  • scientific excellence
  • novel research

Other signals

  • foundation models
  • time series
  • large-scale
  • novel architectures
  • transfer learning
  • zero-shot forecasting
  • synthetic data generation
  • production deployment
  • scientific publications