Data Scientist Ii, Rufusx Science UK

Amazon Amazon · Big Tech · London, United Kingdom · Data Science

This role focuses on developing and optimizing AI-driven conversational shopping experiences using ML, NLP, and multimodal technologies. The Data Scientist will work on agentic systems, information retrieval, recommender systems, and multimodal LLMs to improve customer journeys, analyze experiments, and collaborate on deploying production systems. The role involves handling large-scale data and contributing to both agent capabilities and the underlying inference infrastructure.

What you'd actually do

  1. Perform hands-on analysis and modeling of enormous multimodal datasets to develop insights into how to best help customers throughout their shopping journeys.
  2. Use statistical methods, machine learning, and data mining techniques to create scalable solutions for measuring and optimizing shopping assistant systems based on a rich set of structured and unstructured contextual signals.
  3. Design and analyze A/B tests and experiments to evaluate new features and model improvements, ensuring statistical rigor and actionable insights.
  4. Develop metrics, dashboards, and reporting frameworks to monitor system performance, customer engagement, and business impact.
  5. Build predictive models and conduct deep-dive analyses to identify opportunities for improving customer experience, conversion, and satisfaction.

Skills

Required

  • Experience with machine learning/statistical modeling data analysis tools and techniques, and parameters that affect their performance
  • Experience in a ML or data scientist role with a large technology company
  • Experience with data scripting languages (e.g. SQL, Python, R etc.) or statistical/mathematical software (e.g. R, SAS, or Matlab)
  • Experience effectively communicating complex concepts through written and verbal communication
  • Master's degree or above in Math, Statistics, Computer Science, or related science field

Nice to have

  • Experience with AWS services including S3, Redshift, Sagemaker, EMR, Kinesis, Lambda, and EC2
  • Experience in defining and creating benchmarks for assessing GenAI model performance
  • Experience working on multi-team, cross-disciplinary projects

What the JD emphasized

  • multimodal shopping experiences
  • conversational shopping
  • multimodal conversational systems
  • multimodal user queries
  • multimodal datasets

Other signals

  • developing conversation-based, multimodal shopping experiences
  • utilizing data analysis, statistical modeling, machine learning (ML) technologies, and experimentation
  • leveraging advanced analytics, Natural Language Processing (NLP), Machine Learning (ML), A/B testing, causal inference, and data-driven insights
  • measuring and improving multimodal conversational systems, in particular those based on large language models, information retrieval, recommender systems and knowledge graphs
  • making Rufus agentic, enabling customers to set price alerts or empower Rufus to act on their behalf and automatically purchase products when the price is right, to understanding multimodal user queries and generating answers that combine text, image, audio and video