Distributed Systems Engineer 5 - Decisioning & Optimization

Netflix Netflix · Big Tech · New York, NY +4 · Engineering

Netflix is seeking a Distributed Systems Engineer to build and scale the core infrastructure for their ad tech ecosystem, focusing on real-time ad decisioning, ML model serving, and optimization systems. The role involves working across the stack from model serving to auction execution and pacing, shipping production systems that directly impact revenue and advertiser outcomes.

What you'd actually do

  1. Build and evolve the real-time ad decisioning path: ranking, scoring, bidding, and pacing under strict latency and throughput constraints
  2. Develop and operate ML model serving infrastructure supporting dozens of concurrent hot-path models with sub-20ms P99 inference, including model routing, lifecycle management, fallback tiers, and calibration serving
  3. Partner with Science and Platform teams to productionize models and deploy algorithms into the serving stack
  4. Build simulation and testing frameworks to enable offline validation of marketplace changes before live rollout
  5. Implement and improve real-time pacing systems that drive budget delivery accuracy across campaign lifetimes

Skills

Required

  • 7+ years building distributed systems and backend services at scale
  • Ads domain experience (2+ years): worked on ad serving, delivery, or marketplace systems
  • Experience with ML model serving infrastructure: real-time inference, model deployment pipelines, feature hydration, fallback strategies
  • Built or worked on core ad tech systems: ad servers, bidders, pacers, or ranking and scoring components
  • Built APIs and backend services that integrate across a multi-team platform
  • Understanding of ad serving concepts: inventory management, frequency capping, member ad experience quality, and supply-demand dynamics
  • Comfortable working at the intersection of engineering and data science, productionizing ML models into low-latency serving paths

Nice to have

  • Experience with auction mechanics: first-price, second-price, reserve pricing, bid shading
  • Experience building multi-stage ranking systems (retrieval, scoring, reranking), podding and ad break planning
  • Built or improved budget pacing and delivery control systems
  • Familiar with CTV constraints: server-side ad insertion, live event ad serving at scale
  • Experience with experimentation infrastructure: A/B testing, holdout groups, marketplace experiments
  • Built simulation or counterfactual testing platforms for marketplace or auction systems

What the JD emphasized

  • strict latency and throughput constraints
  • sub-20ms P99 inference
  • productionize models
  • ML model serving infrastructure
  • real-time inference
  • productionizing ML models into low-latency serving paths

Other signals

  • ML model serving infrastructure
  • real-time inference
  • productionizing ML models
  • distributed systems at scale
  • ad tech ecosystem