Senior Applied Scientist, Shopping Core Foundations - Buyforme

Amazon Amazon · Big Tech · Seattle, WA · Applied Science

This role focuses on building and researching autonomous AI agents for online shopping, operating on the open web. It involves LLMs, reinforcement learning, multimodal reasoning, and large-scale systems, with a focus on production-grade reliability, scalability, and safety. The scientist will design evaluation systems, develop agent planning and adaptation techniques, build multimodal reasoning systems, and lead scientific direction for agent reliability and customer trust.

What you'd actually do

  1. Design scalable evaluation and benchmarking systems for autonomous agents operating in dynamic web environments.
  2. Develop techniques for robust agent planning, error recovery, and adaptation under distribution shift.
  3. Build multimodal AI systems that reason over screenshots, DOM structures, user intent, and interaction trajectories.
  4. Lead scientific direction for agent reliability, task completion, and customer trust.
  5. Mentor scientists and engineers on advanced AI methodologies and experimentation.

Skills

Required

  • building machine learning models for business application
  • PhD, or Master's degree and 6+ years of applied research experience
  • Experience programming in Java, C++, Python or related language
  • Experience with neural deep learning methods and machine learning

Nice to have

  • modeling tools such as R, scikit-learn, Spark MLLib, MxNet, Tensorflow, numpy, scipy etc.
  • large scale distributed systems such as Hadoop, Spark etc.

What the JD emphasized

  • production-grade reliability, scalability, and safety
  • autonomous AI agents
  • open-world environments rather than static benchmarks
  • generalize across thousands of constantly evolving third-party websites
  • scalable evaluation and benchmarking systems
  • robust agent planning, error recovery, and adaptation under distribution shift
  • multimodal AI systems that reason over screenshots, DOM structures, user intent, and interaction trajectories
  • agent reliability, task completion, and customer trust

Other signals

  • autonomous AI agents
  • LLMs
  • reinforcement learning
  • multimodal reasoning
  • web agents
  • autonomous purchasing systems