Data Scientist

Redfin Redfin · Seattle · Detroit, MI

This role focuses on performing exploratory data analysis, feature engineering, and predictive modeling to provide insights and strategic direction. It involves developing and deploying machine learning models, identifying methods for continuous statistical testing, applying feature selection, and comparing models using performance metrics. The role also includes data visualization, presentation of results, and collaboration with engineering and product teams on the internal data platform.

What you'd actually do

  1. Analyze, manipulate and process large sets of data
  2. Develop predictive/machine learning models from large sets of structured and unstructured data
  3. Identify methods that allow continuous and automated statistical testing to enhance the predictability of deployed models
  4. Apply feature selection algorithms to models predicting outcomes of interest
  5. Apply sampling techniques to determine groups to be surveyed

Skills

Required

  • 3 years of experience working with large data sets in one or more of the following environments - Hadoop, AWS, Azure or GCP
  • 3 years of experience with statistical and data mining tools and methods
  • 3 years of experience with supervised models (e.g., regression, neural networks) and unsupervised techniques (clustering, PCA, SVMs)
  • 3 years of experience working with SQL
  • 3 years of experience in Python and at least one strongly typed language (e.g., C++, Java, etc.)
  • 3 years of experience with statistical packages
  • 3 years of experience providing predictive and prescriptive analytics in a business setting
  • Master’s degree in mathematics, statistics, computer science or a related field or equivalent experience
  • Advanced understanding of data visualization libraries
  • Advanced understanding of statistical methodologies for modeling and business analytics