Zappos Data Scientist Iii, Zappos/shopbop Catalog Engineering

Amazon Amazon · Big Tech · Madison, WI · Software Development

Data Scientist III on the Shopbop/Zappos Catalog Tech team responsible for designing and implementing ML approaches to improve product catalog data quality, automate data capture and classification, and integrate ML models into production systems. The role involves working with computer vision and NLP, and influencing product decisions through data-driven insights.

What you'd actually do

  1. Design and implement machine learning approaches to improve catalog data quality.
  2. Develop and validate scientific methodologies for automated data capture and classification.
  3. Partner with engineering teams to integrate ML models into production systems.
  4. Create and present analysis that drives decision-making at the senior leadership level.

Skills

Required

  • 5+ years of data querying languages (e.g. SQL), scripting languages (e.g. Python) or statistical/mathematical software (e.g. R, SAS, Matlab, etc.) experience
  • 4+ years of data scientist experience
  • Experience with statistical models e.g. multinomial logistic regression
  • Bachelor's degree

Nice to have

  • 2+ years of data visualization using AWS QuickSight, Tableau, R Shiny, etc. experience
  • Experience managing data pipelines
  • Experience as a leader and mentor on a data science team

What the JD emphasized

  • revolutionize how we manage and enhance our product catalog data
  • solve complex data challenges through advanced analytics and machine learning
  • accelerate product data capture
  • develop state-of-the-art image classification systems for fashion features
  • push the boundaries of what's possible with applied machine learning
  • bring innovative solutions to bear for customers
  • transform our catalog operations
  • delivering robust, scalable solutions

Other signals

  • design and implement machine learning approaches to improve catalog data quality
  • develop and validate scientific methodologies for automated data capture and classification
  • integrate ML models into production systems
  • computer vision, NLP, and advanced ML models