Applied Scientist Ii, Amazon Travel & Events

Amazon Amazon · Big Tech · IN, KA, Bengaluru · Applied Science

Applied Scientist II role focused on building AI-driven solutions for Amazon Travel & Events, leveraging Generative AI, LLMs, NLU, conversational AI, and Applied ML. Responsibilities include designing, developing, and evaluating ML models using GenAI, multimodal reasoning, and information retrieval for catalog understanding, applying VLMs and LLM-based approaches with fine-tuning and RAG, implementing model optimization techniques for efficiency, driving experiments, building ML pipelines, contributing to model reliability through interpretability and calibration, and collaborating with teams to translate business requirements into ML solutions. The role also involves staying current with research and co-authoring publications.

What you'd actually do

  1. Design, develop, and evaluate ML models leveraging GenAI, multimodal reasoning, and large-scale information retrieval to solve well-defined catalog understanding challenges such as product identity and relationship inference
  2. Apply and adapt VLMs, foundation models, and LLM-based approaches to address product catalog problems—experimenting with fine-tuning, prompt engineering, and retrieval-augmented generation techniques
  3. Implement model optimization techniques—including distillation, quantization, and serving optimizations—to improve latency, cost, and efficiency of deployed models under guidance from senior scientists
  4. Drive the design and execution of rigorous experiments and ablation studies on large-scale datasets, delivering results with statistical rigor and clear recommendations to the team
  5. Build and iterate on ML pipelines from prototyping through production deployment, writing clean, well-tested, production-quality code

Skills

Required

  • building machine learning models or developing algorithms for business application experience
  • PhD, or Master's degree and 3+ years of CS, CE, ML or related field experience
  • developing and implementing deep learning algorithms, particularly with respect to computer vision algorithms
  • solving business problems through machine learning, data mining and statistical algorithms
  • algorithms and data structures
  • parsing
  • numerical optimization
  • data mining
  • parallel and distributed computing
  • high-performance computing

Nice to have

  • LLMs, VLMs, or foundation models—including fine-tuning, prompt engineering, or retrieval-augmented generation
  • travel and events domain—including corporate travel management, booking systems, supplier negotiations, travel expense and compliance analytics
  • model optimization techniques such as distillation, quantization, or efficient inference strategies
  • large-scale datasets and distributed computing frameworks (Spark, Ray, or equivalent)
  • multimodal learning (text + image) or computer vision techniques
  • explainable AI, model interpretability, or uncertainty quantification
  • Publications in ML/AI conferences or journals (NeurIPS, ICML, ICLR, ACL, CVPR, etc.) are a plus
  • Strong experimental design skills and statistical analysis expertise
  • Excellent written and verbal communication skills

What the JD emphasized

  • building machine learning models or developing algorithms for business application experience
  • deep learning algorithms
  • computer vision algorithms
  • solving business problems through machine learning, data mining and statistical algorithms
  • LLMs
  • VLMs
  • foundation models
  • fine-tuning
  • prompt engineering
  • retrieval-augmented generation
  • distillation
  • quantization
  • serving optimizations
  • multimodal learning (text + image)
  • computer vision techniques
  • explainable AI
  • model interpretability
  • uncertainty quantification

Other signals

  • Generative AI
  • LLMs
  • NLU
  • conversational AI
  • Applied Machine Learning
  • VLMs
  • foundation models
  • multimodal reasoning
  • information retrieval
  • fine-tuning
  • prompt engineering
  • retrieval-augmented generation
  • distillation
  • quantization
  • serving optimizations
  • uncertainty calibration
  • confidence estimation
  • interpretability