Applied Scientist

Amazon Amazon · Big Tech · Seattle, WA · Applied Science

Research scientist role focused on applying Generative AI, VLMs, and multimodal reasoning to product catalog understanding and agentic shopping experiences. The role involves formulating research problems, pushing boundaries of foundation models, advancing efficient model deployment, and ensuring reliability through interpretability and uncertainty calibration. It spans the full research lifecycle from problem formulation to production deployment, with a strong emphasis on publishing findings and mentoring.

What you'd actually do

  1. Formulate open research problems at the intersection of GenAI, multimodal reasoning, and large-scale information retrieval—defining the scientific questions that transform ambiguous, real-world catalog challenges into publishable, high-impact research
  2. Push the boundaries of VLMs, foundation models, and agentic architectures by designing novel approaches to product identity, relationship inference, and catalog understanding—where the problem complexity (billions of products, multimodal signals, inherent ambiguity) demands methods that don't yet exist
  3. Advance the science of efficient model deployment—developing distillation, compression, and LLM/VLM serving optimization strategies that preserve frontier-level multimodal reasoning in compact, production-grade architectures while dramatically reducing latency, cost, and infrastructure footprint at billion-product scale
  4. Make frontier models reliable—advancing uncertainty calibration, confidence estimation, and interpretability methods so that frontier-scale GenAI systems can be trusted for autonomous catalog decisions impacting millions of customers daily
  5. Own the full research lifecycle from problem formulation through production deployment—designing rigorous experiments over petabytes of multimodal data, iterating on ideas rapidly, and seeing your research directly improve the shopping experience for hundreds of millions of customers

Skills

Required

  • PhD, or Master's degree and 4+ years of CS, CE, ML or related field experience
  • Experience programming in Java, C++, Python or related language
  • 2+ years of building machine learning models or developing algorithms for business application experience

Nice to have

  • Experience with LLMs, VLMs, foundation models, or large-scale deep learning systems—including multimodal pretraining, fine-tuning, RLHF, prompt engineering, or agentic architectures
  • Experience with LLM/VLM serving optimization, including model distillation, quantization, pruning, speculative decoding, or other model compression and efficient inference techniques
  • Experience with explainable AI, model interpretability, or uncertainty quantification
  • Strong experimental design skills and statistical analysis expertise
  • Track record of deploying ML models at scale in production environments processing billions of data points
  • Publications in top-tier venues such as NeurIPS, ICML, ICLR, CVPR, ICCV, EMNLP, ACL, NAACL, COLING, KDD, SIGMOD, WWW, AAAI, or similar
  • Excellent written, verbal communication & data presentation skills

What the JD emphasized

  • PhD, or Master's degree and 4+ years of CS, CE, ML or related field experience
  • Experience programming in Java, C++, Python or related language
  • 2+ years of building machine learning models or developing algorithms for business application experience
  • Experience with LLMs, VLMs, foundation models, or large-scale deep learning systems—including multimodal pretraining, fine-tuning, RLHF, prompt engineering, or agentic architectures
  • Experience with LLM/VLM serving optimization, including model distillation, quantization, pruning, speculative decoding, or other model compression and efficient inference techniques
  • Experience with explainable AI, model interpretability, or uncertainty quantification
  • Track record of deploying ML models at scale in production environments processing billions of data points
  • Publications in top-tier venues such as NeurIPS, ICML, ICLR, CVPR, ICCV, EMNLP, ACL, NAACL, COLING, KDD, SIGMOD, WWW, AAAI, or similar

Other signals

  • Generative AI
  • VLMs
  • multimodal reasoning
  • agentic shopping experiences
  • large-scale information retrieval
  • efficient model deployment
  • uncertainty calibration
  • explainable AI
  • model interpretability