Distributed Systems Engineer 5 - Cloud Gateway

Netflix Netflix · Big Tech · United States · Remote · Engineering

Netflix is seeking an experienced Distributed Systems Engineer for their Cloud Gateway team, which manages the communication backbone for Netflix's Cloud. The role involves designing, building, and operating infrastructure that handles billions of requests daily for various business areas, including Gen AI. The engineer will focus on enhancing security, observability, and resilience of the traffic traversing the cloud, building software solutions, and providing technical mentorship. Experience with API Gateways, distributed systems, Kubernetes, and network protocols is essential.

What you'd actually do

  1. Designing, developing, and integrating functionality within our data plane and control plane to uplift the security, observability, and resilience posture for traffic traversing the cloud.
  2. Building software solutions and tools to improve developer experience and product experience in collaboration with team members and stakeholders.
  3. Responsible for driving alignment with stakeholders and setting partner expectations.
  4. Provide technical mentorship for junior engineers and set best practices through detailed design reviews and high-quality code reviews.

Skills

Required

  • Experience designing and developing software for highly reliable and highly scalable distributed systems in a cloud environment.
  • Hands-on experience with API Gateway technologies, such as Netflix Zuul OSS, Envoy Gateway, Kubernetes Gateway API, Istio Gateway, or other Gateway solutions.
  • Deep expertise in L4 (TCP/UDP) and L7 (HTTP/S, gRPC, WebSockets) protocols.
  • Solid understanding of Kubernetes internals (Networking, CNI, CRDs, and Operator patterns)
  • Exceptional analytical and problem-solving skills, with the ability to break down complex problems, identify root causes using data, and deliver clear actionable solutions.
  • Demonstrated high degree of ownership and accountability for execution, managing multiple priorities effectively in a fast-paced, dynamic environment.
  • Proficiency in programming languages such as Java, Go, or C++

Nice to have

  • Evidence of leading large-scale migrations is a plus.

What the JD emphasized

  • highly reliable and highly scalable distributed systems
  • API Gateway technologies
  • Deep expertise in L4 (TCP/UDP) and L7 (HTTP/S, gRPC, WebSockets) protocols
  • Solid understanding of Kubernetes internals
  • Exceptional analytical and problem-solving skills
  • high degree of ownership and accountability