Applied Scientist - Agentic Ai, Amazon Fulfillment Technology

Amazon Amazon · Big Tech · Bellevue, WA · Machine Learning Science

This role focuses on developing and researching agentic AI systems for operational decision-making and orchestration within Amazon's fulfillment network. It involves building full agentic systems using multi-agent orchestration, tool use, memory, and action execution, training LLMs through various methods including RL, and conducting rigorous evaluations. The role also includes leading research projects, mentoring, and publishing academic papers.

What you'd actually do

  1. Generating training and preference data for specific use cases (reasoning trajectories, tool traces)
  2. Reward modeling and policy optimization for LLMs: DPO, IPO, RLHF/RLAIF with PPO/GRPO, rejection sampling.
  3. Supervised fine-tuning on step-by-step trajectories and tool-use traces
  4. Verbal Reinforcement Learning and Continual Learning
  5. RL for LLMs, Offline RL and off-policy evaluation

Skills

Required

  • building models for business application experience
  • PhD, or Master's degree and 4+ years of CS, CE, ML or related field experience
  • Experience programming in Java, C++, Python or related language
  • Experience in any of the following areas: algorithms and data structures, parsing, numerical optimization, data mining, parallel and distributed computing, high-performance computing

Nice to have

  • Experience using Unix/Linux
  • Experience in professional software development

What the JD emphasized

  • building models for business application experience
  • PhD, or Master's degree and 4+ years of CS, CE, ML or related field experience
  • Experience programming in Java, C++, Python or related language
  • Experience in any of the following areas: algorithms and data structures, parsing, numerical optimization, data mining, parallel and distributed computing, high-performance computing

Other signals

  • agentic systems
  • multi-agent orchestration
  • tool use
  • LLM training
  • offline and online evaluations
  • agentic reasoning
  • coding and analytics
  • research projects
  • academic papers