Applied Scientist Ii, Aws Just-walk-out Science Team

Amazon Amazon · Big Tech · Seattle, WA · Applied Science

This role focuses on developing and implementing advanced visual reasoning systems and autonomous AI agents that understand complex spatial relationships, object interactions, and customer behavior patterns in real-time retail environments. It involves working at the intersection of computer vision and large language models to advance state-of-the-art visual AI.

What you'd actually do

  1. help solve a variety of technical challenges and mentor other scientists
  2. tackle challenging, novel situations every day
  3. develop and implement advanced visual reasoning systems that can understand complex spatial relationships and object interactions in real-time
  4. design autonomous AI agents that can make intelligent decisions based on visual inputs, understand customer behavior patterns, and adapt to dynamic retail environments
  5. develop systems that can perform complex scene understanding, reason about object permanence, and predict customer intentions through visual cues

Skills

Required

  • building models for business application experience
  • PhD, or Master's degree and 4+ years of CS, CE, ML or related field experience
  • Experience in patents or publications at top-tier peer-reviewed conferences or journals
  • Experience programming in Java, C++, Python or related language
  • Experience in any of the following areas: algorithms and data structures, parsing, numerical optimization, data mining, parallel and distributed computing, high-performance computing

Nice to have

  • Experience using Unix/Linux
  • Experience in professional software development

What the JD emphasized

  • building models for business application experience
  • PhD, or Master's degree and 4+ years of CS, CE, ML or related field experience
  • Experience in patents or publications at top-tier peer-reviewed conferences or journals

Other signals

  • visual reasoning
  • generative AI
  • agentic frameworks
  • computer vision
  • large language models