Data Scientist

Reddit Reddit · Consumer · San Francisco, CA · Data Science

Data Scientist role at Reddit focused on designing, developing, and applying data science solutions to improve advertiser experience and ad platform performance. This involves analyzing large datasets, developing ML models for anomaly detection and prediction, and collaborating with product managers and engineers. Requires a Master's degree and experience in data science/ML, with specific skills in statistical modeling, ML algorithms, causal inference, experimentation, large-scale data processing (Spark, Hadoop, Hive), Python/R, ML libraries (scikit-learn, TensorFlow, PyTorch), and SQL.

What you'd actually do

  1. Design, develop, and apply DS solutions to inform improvements in advertiser experience as well as Reddit ad platform and marketplace performance.
  2. Analyse large-scale datasets to identify trends, patterns, and insights that can be used to improve the effectiveness of our advertising platform.
  3. Collaborate with product managers and engineers to define product requirements and translate them into data science solutions.
  4. Develop Machine Learning models and Data Science methods to improve anomaly detection, prediction and pattern recognition.
  5. Communicate findings and recommendations to stakeholders across the organization.

Skills

Required

  • Statistical modelling
  • Machine learning algorithms
  • Causal inference
  • Experimentation design
  • Large-scale data processing and analysis using tools such as Spark, Hadoop, or Hive
  • Python or R
  • Machine learning libraries such as scikit-learn, TensorFlow, or PyTorch
  • SQL and relational databases

What the JD emphasized

  • Machine Learning models
  • Data Science methods
  • Machine learning algorithms
  • Machine learning libraries

Other signals

  • Develop Machine Learning models and Data Science methods to improve anomaly detection, prediction and pattern recognition.
  • Analyse large-scale datasets to identify trends, patterns, and insights that can be used to improve the effectiveness of our advertising platform.