Applied Scientist Iii, Aft Ai, Amazon Aft AI

Amazon Amazon · Big Tech · DE, Belgium +1 · Applied Science

Develop agentic AI and multi-modal deep learning models for Amazon's Fulfillment network, focusing on understanding warehouse operations and visual defect detection. This role involves working with large, diverse datasets and applying cutting-edge AI techniques to solve complex, real-world problems at scale, with a strong emphasis on production deployment and iterative improvement.

What you'd actually do

  1. build agentic AI solutions and multi-modal deep learning models that understand how products and packages flowing through Amazon’s fulfillment network
  2. build models that solve challenging problems like understanding warehouse operations systems, or visual defect detection on Amazon's entire retail catalog (billions of different items, thousands of new items every day)
  3. work with a diverse set of very large multi-modal real-world datasets, including imagery, natural language and structured data
  4. face a high level of research ambiguity and problems that require creative, ambitious, and inventive solutions
  5. adapt state-of-the-art agentic AI, deep learning, language understanding and computer vision techniques to develop solutions for business problems in the Amazon Fulfillment Network

Skills

Required

  • Python
  • C++
  • PyTorch
  • Pandas
  • NumPy
  • scikit-learn
  • Hugging Face Transformers
  • transformers
  • diffusion models
  • neural architecture search
  • self-supervised learning
  • distributed training
  • mixed precision
  • gradient accumulation
  • DeepSpeed
  • FSDP
  • Megatron-LM
  • quantization
  • pruning
  • distillation
  • large language models (GPT, LLaMA, Claude)
  • vision-language models (CLIP, LLaVA, Qwen)
  • agentic AI systems
  • LangChain
  • Strands
  • multi-agent workflows
  • tool-augmented reasoning systems
  • RAG systems
  • chain-of-thought
  • few-shot
  • RLHF
  • DPO
  • computer vision
  • object detection
  • segmentation
  • 3D vision
  • depth estimation
  • point cloud processing
  • natural language processing
  • text generation
  • information extraction
  • multimodal learning
  • model serving infrastructure
  • A/B testing frameworks
  • feature stores
  • MLOps
  • annotation pipeline design
  • active learning pipelines
  • AutoML
  • hyperparameter optimization

Nice to have

  • diffusion models for image/video synthesis
  • autoregressive models for multimodal generation
  • compositional generation systems
  • controllable generation
  • style transfer
  • neural rendering techniques
  • model interpretability and explainability methods
  • attention visualization
  • feature attribution
  • interpretable AI systems
  • few-shot learning
  • meta-learning
  • continual learning
  • domain adaptation
  • models that generalize across distribution shifts
  • long-tail scenarios
  • adapt to new tasks with minimal data

What the JD emphasized

  • production-quality code standards
  • production deployment
  • production experience
  • production environments
  • production experience
  • production experience

Other signals

  • develop agentic AI solutions
  • multi-modal deep learning models
  • apply state-of-the-art AI on real-world problems at scale
  • build and deploy models
  • pushing beyond the state of the art