Software Engineer L5, Python Platform

Netflix Netflix · Big Tech · United States · Remote · Engineering

Netflix is seeking a Software Engineer L5 for their Python Platform team. This role involves designing and promoting internal Python libraries, improving the Python development experience by integrating best practices and new technologies, optimizing the tech stack for scalability and performance, and collaborating with developers to provide a comprehensive software development lifecycle. The role also includes support and on-call rotations. While AI fluency is mentioned, the core responsibilities focus on general Python platform engineering and optimization, not direct AI/ML model development.

What you'd actually do

  1. Design and promote internal Python libraries that address common challenges faced by Netflix's Python developers.
  2. Understand and improve Python development experience by bringing in best practices and the latest technologies into runtime management, dependency resolution/management, testing, delivery, monitoring, and operation.
  3. Optimize the tech stack for better scalability and performance.
  4. Work backward from Python developers to understand their needs and wants, and collaborate with partner teams to provide an opinionated, batteries-included software development lifecycle for Python developers.
  5. Participate in the team’s support and on-call rotations.

Skills

Required

  • Python web stack (FastAPI, Flask)
  • Performance optimization
  • NVIDIA GPU software stack (CUDA-aware Python, TensorRT, torch.compile, ONNX)
  • Low-latency/high-throughput distributed systems
  • Cross-functional collaboration
  • Navigating ambiguity
  • Project and product management

Nice to have

  • Inference-serving frameworks (Triton, LitServe, Ray Serve, vLLM, Dynamo)

What the JD emphasized

  • Extensive experience with the Python web stack (e.g. FastAPI, Flask) and its performance optimization.
  • Familiarity with integrating and working with the NVIDIA GPU software stack (e.g. CUDA-aware Python, TensorRT, torch.compile, ONNX).
  • Expertise in performance optimization for low-latency/high-throughput distributed systems.
  • AI fluency.