Software Engineer (l5) - Experimentation Platform

Netflix Netflix · Big Tech · United States · Remote · Engineering

Senior Distributed Systems Engineer to design, build, and operate the next generation of experimentation and feature flag infrastructure at Netflix. This role focuses on building and evolving critical experimentation and feature flag services, owning reliability and performance, shaping data and integration surfaces, and partnering with product engineering teams.

What you'd actually do

  1. Design and implement high-scale, low-latency services for experiment allocation and feature flag evaluation
  2. Advance core distributed systems for decentralized allocation, rules evaluation, and real-time decisioning
  3. Participate in on-call, lead incident response, and drive long-term reliability improvements
  4. Instrument services with rich observability (metrics, logs, traces) and continuously tune for resilience, performance, and scalability
  5. Collaborate with teams using technologies like Flink, Spark, Elasticsearch, and Druid to ensure experimentation data is correct, timely, and usable

Skills

Required

  • Java or another JVM language
  • distributed systems
  • backend services
  • monitoring
  • on-call
  • debugging
  • performance tuning
  • reliability improvements

Nice to have

  • Flink
  • Spark
  • Elasticsearch
  • Druid

What the JD emphasized

  • high-scale
  • low-latency
  • reliability
  • observability
  • performance
  • scalability