Senior Software Engineer, ML Training Platform

Reddit Reddit · Consumer · San Francisco, CA · Engineering

Senior Software Engineer, ML Training Platform at Reddit. This role focuses on building feature stores and infrastructure for ML model training and online inference, aiming to improve the ML feature software development lifecycle and advance the ML Platform. The role involves leading development, mentoring team members, and collaborating with other engineering teams.

What you'd actually do

  1. Build feature stores for online inference and model training, tools that enable machine learning engineers (MLEs) and data scientists (DSs) to improve the ML feature software development lifecycle.
  2. Lead the building, testing, and maintenance of our ML Feature Store and Feature Engineering infrastructure at Reddit that serves 100 million+ redditors.
  3. Work with other engineers across ML Platform and Reddit infrastructure to significantly advance the ML Platform.
  4. Work with management on team goal setting, planning, and de-risk project execution.
  5. Mentor other team members in adopting a rigorous DevOps approach to maintain and/or improve ML infra components and services health and quality.

Skills

Required

  • Master’s degree (or its foreign equivalent) in Computer Science, Engineering (any field), Data Science, or closely related quantitative discipline and two (2) years of experience in the job offered or in any occupation in related field.
  • Designing and building machine learning model serving infrastructure
  • Deploy models for serving in production
  • Enhancing machine learning (ML) observability of feature drift detection and model performance
  • Python
  • Git
  • SQL or MYSQL
  • Designing technology stacks
  • Debugging and monitoring for production services

What the JD emphasized

  • machine learning model serving infrastructure
  • Deploy models for serving in production
  • ML observability of feature drift detection and model performance

Other signals

  • feature stores
  • ML feature software development lifecycle
  • ML Platform