Software Engineer 5 - Research Data Products

Netflix Netflix · Big Tech · United States · Remote · Engineering

Software Engineer 5 role focused on maturing and expanding Research platforms, data products, and frameworks at Netflix. The role involves leveraging software and data engineering skills to build scalable, performant, and maintainable systems that support quantitative and qualitative researchers. Key responsibilities include designing and building distributed infrastructure, developing integrations with data platforms and experimentation systems, and enabling data scientists by creating scalable methods and utilities. The role requires proficiency in SQL, Python, or Spark, experience with data pipelines and cloud platforms, and an understanding of privacy-preserving techniques. A strong emphasis is placed on observability and owning the solutions built.

What you'd actually do

  1. Solve business needs at scale by applying your software engineering and analytical problem-solving skills
  2. Design and building robust, scalable, and highly available distributed infrastructure
  3. Lead cross-functional initiatives and collaborating with engineers, product managers, and technical program managers across teams
  4. Sharing experiences with open source communities and contributing to Netflix OSS
  5. Develop integrations with data SaaS platforms and first party infrastructure.
  6. Support integrations with Research, A/B testing and experimentation systems for sampling and allocation protocols
  7. Develop libraries and SDKs to extend core functionality to new systems
  8. Enable Data Scientists by developing scalable methods, utilities and applications
  9. Partner with other Data Engineers & Software Engineers to make data available for self-service and wider integration.

Skills

Required

  • SQL
  • Python
  • Spark
  • data pipelines
  • ETL
  • cloud platforms (AWS, GCP, Azure)
  • understanding of privacy-preserving techniques
  • JVM stack (e.g., Java, Scala)
  • data intuition
  • analytical skills
  • data engineering fundamentals
  • craft scalable systems and solutions
  • observability
  • monitoring
  • logging
  • alerting

Nice to have

  • federated learning
  • synthetic data

What the JD emphasized

  • understanding of privacy-preserving techniques
  • understand how ML systems consume data
  • prioritize observability in your designs