Data Scientist I, Worldwide Product Compliance

Amazon Amazon · Big Tech · LU, Luxembourg · Data Science

Data Scientist I role focused on developing and delivering core data science capabilities for AI-enabled operations, leveraging LLMs, Generative AI, and predictive analytics to solve complex operational problems within Amazon's global footprint. The role involves assessing solution approaches, designing extensible solutions, owning end-to-end data science projects, collaborating with cross-functional teams, and contributing to AI strategy and investment priorities.

What you'd actually do

  1. Assess and select ideal solution approaches from a wide range of data science methodologies, including machine learning, statistical modeling, NLP, and LLM-based techniques, to solve complex, ambiguous operational problems with significant business impact.
  2. Apply deep expertise to problems involving complex interactions among software systems, data pipelines, and operational processes; design solutions that accurately model these interactions and are extensible, actionable, and easy for others to contribute to.
  3. Own and deliver end-to-end data science solutions for the business with minimal assistance, building a track record of successful launches that drive measurable operational improvements across Amazon's global footprint.
  4. Work closely with operations business teams to deeply understand their challenges, translate ambiguous needs into well-defined problem statements, and ensure data science solutions are grounded in real operational context.
  5. Stay current on data science developments and emerging research; raise awareness of new and well-established techniques across the team

Skills

Required

  • data scripting languages (e.g., SQL, Python, R, or equivalent) or statistical/mathematical software (e.g., R, SAS, Matlab, or equivalent)
  • data/research scientist, statistician or quantitative analyst in an internet-based company with complex and big data sources

Nice to have

  • Knowledge of statistical packages and business intelligence tools such as SPSS, SAS, S-PLUS, or R
  • Experience with clustered data processing (e.g., Hadoop, Spark, Map-reduce, and Hive)

What the JD emphasized

  • complex, ambiguous operational problems
  • complex interactions among software systems, data pipelines, and operational processes
  • end-to-end data science solutions
  • measurable operational improvements
  • deeply understand their challenges
  • translate ambiguous needs into well-defined problem statements
  • grounded in real operational context
  • AI-enabled operations
  • intelligent, data-driven operational solutions
  • data science strategy
  • model optimization
  • system architecture
  • AI ecosystem
  • decision intelligence platforms
  • causal engines

Other signals

  • LLMs
  • Generative AI
  • predictive analytics
  • intelligent, data-driven operational solutions
  • AI-enabled operations
  • data science strategy
  • model optimization
  • system architecture
  • AI ecosystem
  • decision intelligence platforms
  • causal engines