Data Scientist 5 - Availability

Netflix Netflix · Big Tech · United States · Remote · Data & Insights

Netflix is seeking a Data Scientist 5 for their Infrastructure Availability team. This role involves developing metrics, building tools, performing analyses, and making recommendations to improve the reliability and availability of their systems. The ideal candidate will have experience in metric research, statistical modeling, causal inference, and working with large-scale distributed systems. Responsibilities include developing availability metrics, productionizing them with Data Engineering, conducting root cause analysis with various engineering teams, and communicating findings. The role requires strong skills in Python, SQL, and workflow orchestration tools like Airflow, as well as excellent communication and stakeholder management abilities.

What you'd actually do

  1. Develop metrics to measure the availability, reliability and performance of Netflix’s infra stack
  2. Partner with Data Engineering, Data Science and Analytics Engineers to productionize system level availability metrics and dashboards
  3. Partner with Software Engineers, Performance Engineers, Technical Product Managers and other product teams to conduct root cause and causal inference analysis of availability issues and make recommendations for how to remediate
  4. Connect with the larger analytics community at Netflix to bring more visibility to our work.

Skills

Required

  • Python
  • SQL
  • Apache Airflow
  • metric development
  • statistical modeling
  • causal inference
  • system reliability
  • large distributed systems

Nice to have

  • communication with technical and non-technical audiences
  • storytelling with data
  • stakeholder management
  • ambiguity tolerance
  • product sense

What the JD emphasized

  • metric development and measurement in the domain of system reliability / availability for large, distributed systems, platforms, and/or infrastructure