About the Team The Global Traffic Infrastructure (GTI) team leverages unified platform capabilities to manage edge infrastructure outside China (both self-built and third-party) providing standardized, compliant, scalable, and cost-effective traffic infrastructure capabilities for edge services. Our vision is to build a global edge traffic infrastructure platform and become the long-term cornerstone of ByteDance’s global edge business in terms of scale, performance, and cost.
Responsibilities
- Design, build and operate container components for edge Kubernetes clusters (e.g., CNI dataplane/control plane, L4/L7 LB, Ingress/Gateway), with a focus on performance, reliability and security.
- Drive end-to-end delivery of cloud-native features from requirements, architecture, implementation and testing to production rollout, troubleshooting and continuous optimization.
- Build observability and troubleshooting capabilities for cloud-native based components (metrics/logs/tracing/profiling), and improve incident response efficiency through automation.
- Collaborate with SRE, platform, and product teams to provide standardized features and reusable platform capabilities for traffic-intensive services.
Requirements
Minimum Qualifications
- BS/MS in Computer Science, Computer Engineering or equivalent practical experience.
- Strong software engineering fundamentals and solid backend/system development experience in one or more languages such as Go/C/C++/Rust.
- Hands-on production experience with containers and Kubernetes (cluster operations, networking, service discovery, Ingress/Gateway, rollout strategies).
- Solid understanding and practical experience with cloud-native stack, especially CNI and load balancing (L4/L7 concepts, NAT/contract/IPVS, health check, traffic steering).
- Strong problem-solving skills, ownership, and ability to work effectively in cross-functional, fast-paced environments.
Preferred Qualifications
- Experience with or contributions to cloud-native open-source projects (e.g., Kubernetes, Containerd, Cilium/eBPF).
- Strong Linux knowledge (kernel, scheduling ,TCP/IP, routing, iptables/nftables) and production troubleshooting experience.
- Experience designing multi-tenant isolation, network security controls, and reliability mechanisms for high-traffic systems.
- Familiar with CDN, Livestream, RTC or other edge traffic production architecture.