ML Software Engineer 6 - AI for Member Systems (aims)

Netflix Netflix · Big Tech · Los Gatos, CA +1 · Data & Insights

Netflix is seeking an ML Software Engineer to build and operate the platform integration layer for their AI for Member Systems (AIMS) organization. This role focuses on connecting AIMS' AI capabilities (recommendations, personalization, search, discovery) to Netflix's ML infrastructure and serving ecosystem, ensuring scalability and integration. The engineer will drive architectural decisions, own integration points, and design reusable infrastructure components to support ML models and GenAI capabilities across various surfaces.

What you'd actually do

  1. Design and build the platform integration layer between AIMS' intelligence systems and Netflix's ML infrastructure, serving layer, and product engineering teams
  2. Drive architectural decisions on how AIMS capabilities (ranking, retrieval, orchestration, foundation models) connect to shared platform services including model serving, inference infrastructure, feature stores, and experiment platforms
  3. Own the technical design of key integration points between the intelligence layer and the member experience serving stack, ensuring clean contracts, reliable handoffs, and operational excellence
  4. Architect “paved paths” for AIMS application teams to build and deploy ML models and GenAI capabilities through consistent patterns in data access, training, evaluation, deployment, and monitoring
  5. Design reusable, horizontal infrastructure components that multiple AIMS teams can adopt, avoiding duplication and ensuring improvements propagate across surfaces

Skills

Required

  • Python
  • JVM language (Scala or Java)
  • production ML systems
  • platform integration layers
  • ML capabilities to shared infrastructure
  • modern ML and GenAI patterns
  • recommendation/ranking systems
  • LLM serving
  • agentic architectures
  • design frameworks and abstractions
  • prototypes
  • technical direction across multiple teams
  • communication skills

Nice to have

  • orchestration runtimes
  • agentic architectures
  • member-facing serving infrastructure
  • distributed systems
  • large-scale real-time and batch processing
  • personalization domains
  • recommendation systems
  • search
  • discovery
  • TensorFlow
  • PyTorch
  • JAX
  • cost efficiency
  • capacity planning
  • compute optimization for ML workloads

What the JD emphasized

  • Significant experience designing, building, and operating production ML systems end-to-end
  • Experience building platform and integration layers that connect ML capabilities to shared infrastructure
  • Proven ability to identify common patterns and design frameworks and abstractions that are flexible, extensible, and easy for engineers to adopt
  • Hands-on ability to scope and validate architectures through prototypes, turning ambiguous problems into concrete proposals and reference implementations

Other signals

  • building a shared intelligence layer that unifies how personalization works
  • orchestrates foundation models, retrieval, ranking, and policy through a governed runtime
  • connects AIMS' AI capabilities to the broader Netflix ML and serving ecosystem
  • architect 'paved paths' for AIMS application teams to build and deploy ML models and GenAI capabilities