Distributed Systems Engineer 6 - Decisioning & Optimization

Netflix Netflix · Big Tech · New York, NY +4 · Engineering

Netflix is seeking a Distributed Systems Engineer 6 to lead the technical direction of the Decisioning & Optimization team within their ad tech ecosystem. This role involves architecting and scaling real-time ad decisioning systems, including ML model serving infrastructure, ranking, scoring, and optimization under strict latency constraints. The engineer will also drive operational excellence and collaborate with Science and Platform teams to productionize ML algorithms.

What you'd actually do

  1. Own the technical direction of the Decisioning & Optimization team: architecture reviews, incident leadership, capacity planning, and scaling
  2. Architect and evolve the real-time ad decisioning optimization path: multi-stage auction, ranking, scoring, bidding, and pacing under strict latency and throughput constraints
  3. Scale our ads model serving infrastructure to support dozens of concurrent hot-path ML models with sub-20ms P99 inference, including config-driven model routing, multi-model lifecycle management, fallback tiers, and calibration serving
  4. Work closely with Science and Platform teams, ensuring seamless model productionization and algorithm deployment
  5. Build out various simulation and containerized testing frameworks to enable offline validation of marketplace changes before live rollout

Skills

Required

  • 10+ years building distributed systems and backend services at large scale
  • 3+ years in the ads domain
  • Deep experience with ML model serving infrastructure
  • scaling real-time inference on the hot path at high QPS with sub-20ms P99 latency
  • model deployment pipelines
  • feature hydration
  • fallback strategies
  • Built and operated core ad tech systems
  • ad servers
  • bidders
  • pacers
  • ranking and scoring components
  • Designed APIs
  • platform abstractions
  • data models
  • Strong understanding of ad serving concepts
  • inventory management
  • frequency and recency capping
  • member ad experience quality
  • supply-demand dynamics
  • Track record of technical leadership across multiple teams
  • setting architectural direction
  • influencing cross-functional roadmaps
  • Comfortable at the intersection of engineering, data science, and product
  • translating ML research and algorithms into production systems
  • Demonstrated ability to operate in the environment which is a mix of big-tech scale and startup speed
  • taking projects that normally take years and delivering production-ready results with tight timelines

Nice to have

  • Experience with auction mechanics
  • first-price
  • second-price
  • reserve pricing
  • bid shading
  • marketplace competition dynamics
  • Multi-stage ranking systems
  • retrieval
  • scoring
  • reranking
  • podding
  • ad break planning
  • Built or improved budget pacing and delivery control systems
  • Yield optimization
  • inventory forecasting
  • dynamic pricing
  • fill rate optimization
  • demand/supply allocation strategies
  • Familiar with CTV constraints
  • server-side ad insertion
  • live event ad serving at scale
  • Experience with experimentation infrastructure
  • A/B testing
  • holdout groups
  • interference-aware marketplace experiments
  • Built simulation or counterfactual testing platforms for marketplace or auction systems
  • Strong background in resiliency and reliability
  • ensuring system availability under extreme load
  • live events
  • traffic spikes

What the JD emphasized

  • ML model serving infrastructure
  • sub-20ms P99 inference
  • real-time ad decisioning optimization path
  • goal-based delivery optimization
  • technical leadership

Other signals

  • ML model serving infrastructure
  • real-time ad decisioning
  • ranking and scoring
  • sub-20ms P99 inference
  • goal-based delivery optimization