Currently tracking 106 active AI roles, with 26 new openings in the last 4 weeks. Primary focus: Serve · Engineering.
Big Tech · ByteDance core (Doubao / Seed / infra)
| Title | Stage | AI score |
|---|---|---|
| Software Engineer - AI Compute Infrastructure Software Engineer focused on building and maintaining large-scale, Kubernetes-native AI compute infrastructure for LLM inference, emphasizing performance, scalability, and cost-efficiency. The role involves architecting GPU-optimized systems and collaborating on inference solutions using various LLM engines. | Serve | 7 |
| Software Engineer - AI Compute Infrastructure Software Engineer focused on building and maintaining large-scale, Kubernetes-native LLM inference infrastructure (AIBrix) with a focus on performance, scalability, and cost-efficiency. The role involves architecting GPU-optimized systems, collaborating on inference solutions using various LLM engines, and contributing to open-source projects. | Serve | 7 |
| Cloud Acceleration Engineer – DPU & AI Infra This role focuses on designing and developing DPU network software and exploring AI/ML infrastructure acceleration, specifically for distributed training and inference. It involves software-hardware co-design and performance optimization of systems related to AI computing. | ServeData | 7 |
| Cloud Acceleration Engineer – DPU & AI Infra ByteDance is seeking a Cloud Acceleration Engineer to focus on DPU and AI infrastructure. The role involves designing and developing high-performance DPU network software, collaborating on software-hardware co-design, and exploring AI/ML infrastructure acceleration for distributed training and inference. The position requires strong C/C++ and Linux systems development skills, with a background in areas like software-hardware co-design, distributed systems, networking, or AI/ML systems. | ServeData | 7 |
| Senior Software Engineer, AI Infrastructure - Developer Tooling Senior Software Engineer to build AI-powered developer tools, focusing on retrieval infrastructure (RAG), a coding agent with multi-step generation and tool use, and evaluation frameworks for measuring effectiveness. Requires strong Python/TypeScript, systems-level language experience, and practical LLM integration. | AgentData | 7 |
| Tech Lead, AML Orchestration Tech Lead for an Applied Machine Learning (AML) team focused on building and advancing distributed orchestration platforms for recommendation systems, ads ranking, and search ranking. The role involves leading a team of ML Engineers, setting technical strategy for resource efficiency, distributed training, and online inference systems, and optimizing large-scale distributed orchestration and scheduling strategies. | ServeAgent | 7 |
| Machine Learning Engineer (User Growth & Intelligent Marketing) - Global e-Commerce Machine Learning Engineer focused on optimizing user growth and intelligent marketing algorithms for TikTok's e-commerce platform. This role involves developing and implementing solutions for personalized recommendations, user value modeling, uplift modeling, and marketing efficiency to drive e-commerce GMV growth. | Ship | 7 |
| Machine Learning Engineer, Search - Local Services Team Machine Learning Engineer for ByteDance's Local Services team, focusing on enhancing user discovery and ecosystem growth for hospitality, dining, and leisure experiences. The role involves leveraging large-scale ML for search and recommendation systems, aiming to improve personalized relevance, CTR/CVR prediction, and conversion efficiency for billions of users. Responsibilities include designing and implementing full-stack search algorithms, query analysis, ranking, and personalized behavior modeling. | Ship | 7 |
| Machine Learning Platform Engineer, Applied Machine Learning Team Machine Learning Platform Engineer to develop and maintain a platform supporting deep learning models for code development, testing, training, model deployment, and other core business functions. The role supports recommendation, advertising, and search systems, focusing on distributed training of large-scale deep learning models. | ServeData | 7 |
| Senior Software Engineer, Cross Platform Applications Senior Software Engineer to build AI-powered developer tools that integrate AI/ML into the toolchain to accelerate software development, improve code quality, and simplify engineering workflows. Focus on intelligent assistants, static/dynamic analyzers, and smart automation features. | Agent | 7 |
| Software Engineer - Compute Infrastructure (Orchestration & Scheduling) Software Engineer role focused on building and optimizing large-scale compute infrastructure (Kubernetes, Serverless) to support AI and LLM workloads, including training and inference. The role involves enhancing cluster management, developing intelligent scheduling systems leveraging AI models for resource optimization, and leading infrastructure for next-gen ML workloads. | ServeAgent | 7 |
| Senior Software Engineer - Compute Infrastructure (Orchestration & Scheduling) Senior Software Engineer focused on building and optimizing large-scale compute infrastructure (Kubernetes, Serverless) for AI and LLM workloads, including scheduling, resource management, and inference. The role involves developing intelligent scheduling systems using AI models and contributing to open-source projects. | ServeAgent | 7 |
| Senior Software Engineer - Compute Infrastructure (Orchestration & Scheduling) Senior Software Engineer focused on building and optimizing large-scale compute infrastructure (Kubernetes, Serverless) for AI and LLM workloads, including scheduling, resource management, and inference. The role involves enhancing performance, scalability, and cost-efficiency for training and inference, with a focus on heterogeneous resources (CPU, GPU) and open-sourcing key technologies. | ServeAgent | 7 |
| Software Engineer - Compute Infrastructure (Orchestration & Scheduling) Software Engineer role focused on building and optimizing large-scale compute infrastructure (Kubernetes, Serverless) for AI and LLM workloads, emphasizing resource efficiency, scheduling, and reliability. The role involves developing intelligent scheduling systems leveraging AI models and leading infrastructure for ML training/inference. | ServeAgent | 7 |
| Machine Learning Engineer - PICO Perception - San Jose Machine Learning Engineer focused on optimizing and deploying AI algorithms on Qualcomm chips for XR devices, emphasizing low-power consumption and performance improvement. This role involves close collaboration with hardware vendors and contributing to the AI toolchain and technical ecosystem. | Serve | 7 |
| Machine Learning Engineer, NLP - TikTok E-commerce Knowledge Graph Machine Learning Engineer focused on NLP and Knowledge Graphs for TikTok E-commerce. Responsibilities include constructing massive product knowledge graphs to enhance feed ranking, recommendations, and ads, and collaborating with cross-functional teams on product strategies. Requires a Bachelor's degree, 3+ years of ML/NLP/CV experience, and proficiency in C++/Python/Go/Java. | Data | 7 |
| Senior Site Reliability Engineer - Applied Machine Learning Site Reliability Engineer for an Applied Machine Learning team focused on next-generation recommendation algorithms and platforms. The role involves ensuring high availability and creating automated systems for large-scale AI/recommendation systems. | ServeShip | 7 |
| AI/LLM Network Software Development Engineer Develops and optimizes high-speed network infrastructure and communication frameworks specifically for AI/LLM applications, focusing on performance, scalability, and reliability in large-scale data centers. | Serve | 7 |
| Security Software Engineer (TDR) Security Software Engineer focused on threat detection and response (TDR) for large-scale platforms, with a specific emphasis on securing AI and LLM systems, including agent architectures. The role involves designing and building defensive frameworks, identifying attack surfaces, and ensuring the safe deployment of AI-powered products. | Agent | 5 |
| Senior Industrial Design/Computational Designer / AI Design Engineer - Pico Senior Industrial Designer/Computational Designer with AI Design Engineering focus, responsible for incubating new product concepts by rapidly prototyping end-to-end user experiences that leverage emerging technologies. The role involves designing physical products as contextual nodes in a distributed AI system, prototyping both hardware and software to validate sensing, inference, feedback, and user behavior change loops. Key responsibilities include architecting AI contextual ecosystems, iterative design prototyping from 'works-like' to high-fidelity, defining sensor architecture for AI context perception, and developing a physical language for AI semantics. The role requires strong scripting/programming skills in Python and familiarity with AI-assisted workflows. | Agent | 5 |
| Software Engineer - Compute Infrastructure (Cloud Native) Software Engineer role focused on building and optimizing large-scale compute infrastructure (Kubernetes, Serverless) that supports AI/LLM workloads. The role involves improving system performance, developing resource management and scheduling systems, and driving standardization for efficiency and reliability. While the infrastructure supports AI, the core craft is in infrastructure engineering, not direct AI/ML model development. | — | 5 |
| Senior Software Engineer - Compute Infrastructure (Cloud Native) Senior Software Engineer focused on building and optimizing large-scale Kubernetes-based compute infrastructure for diverse workloads, including AI and LLM applications. The role involves improving system performance, developing resource management and scheduling systems, and driving standardization for efficiency and reliability in cloud-native environments. | — | 5 |
| Senior Software Engineer - Compute Infrastructure (Cloud Native) Senior Software Engineer role focused on building and optimizing large-scale Kubernetes and Serverless compute infrastructure that powers AI/LLM workloads. The role involves designing, improving, and managing resource orchestration, scheduling, and observability for these systems, with a focus on cost efficiency and performance. The team also contributes to open-source projects. | Serve | 5 |
| Software Engineer - Compute Infrastructure (Cloud Native) This role focuses on building and optimizing large-scale compute infrastructure using Kubernetes and Serverless technologies, specifically for AI and LLM workloads. The engineer will improve system performance, develop resource management systems, and enhance observability for these demanding applications. | — | 5 |
| Software Engineer - Applied Machine Learning, Engine Software Engineer role focused on building and running distributed recommendation systems, with a focus on the efficiency tools and platform for training online models and managing hardware resources. Involves research, design, development, and maintenance of software and systems, with a preference for ML framework experience. | Serve | 5 |
| Senior/Tech Lead AI/LLM Network Software Development Engineer - San Jose This role focuses on designing, implementing, and deploying high-speed network technologies specifically to support AI/LLM applications. Responsibilities include developing platforms for monitoring and diagnosing large-scale AI networks, researching and optimizing AI communication frameworks, network protocols, and host-network-application co-design for scalability and performance, and building next-generation AI network infrastructure. | Serve | 5 |
| Video Algorithm Engineer - Multimedia Lab The role focuses on designing and implementing algorithms for video systems, including encoding, understanding, processing, enhancement, quality metrics, delivery, and streaming. It requires strong CS fundamentals, programming skills, and experience in video-related areas, with a preference for deep learning and neural-network-based approaches. | Serve | 5 |
| Camera System Architect - PICO Lab VST - San Jose This role is for a Camera System Architect at ByteDance's PICO Lab, focusing on AR/VR camera systems. The architect will define and own end-to-end camera solutions, driving hardware-software co-design and integrating computer vision and AI algorithms into production systems. The role requires experience in camera/imaging system development, full-cycle product development, and leveraging AI-assisted engineering tools. | — | 2 |
| Cloud Infrastructure Architect The Cloud Infrastructure Architect role at ByteDance focuses on optimizing AI server costs and automating processes within the cloud supply chain system. This involves resource planning, capacity management, and architecture evolution to ensure the availability of computing resources for the company's AI business. | — | 1 |
| Technical Project Manager - Resource Management ByteDance Cloud Team is seeking a Technical Project Manager to manage the full lifecycle of global core cloud resource delivery, including acquisition, cost management, and risk mitigation. The role involves collaborating with various teams and suppliers to ensure efficient delivery and utilization of cloud infrastructure, supporting the company's AI initiatives. | — | 1 |
| Site Reliability Engineer - System Service Global Site Reliability Engineer responsible for managing and maintaining large-scale host infrastructure and foundational services in ByteDance's global data centers, focusing on reliability, availability, and automation. | — | 0 |
| Supply Chain Sourcing Manager - CDN This role is for a Supply Chain Sourcing Manager focused on CDN infrastructure services, including servers, network equipment, and logistics. Responsibilities include executing sourcing strategies, managing RFX processes, contract lifecycle, tracking quotations, maintaining price validation, assisting with supplier onboarding, and supporting TCO analysis. The role requires a strong understanding of the IDC/CDN/POP datacenter hardware supply chain and related services. | — | 0 |
| U.S. International Tax Senior Manager (Los Angeles) Seeking a U.S. International Tax Senior Manager to join the Tax Department, supporting a broad range of tax matters. This role involves working with domestic and non-US tax country managers and cross-functional teams to assess tax implications of ByteDance’s and its affiliates’ overall tax positions, products, business developments, and operational initiatives. Responsibilities include participating in domestic and cross-border discussions, collecting and analyzing financial data, supporting tax calculations and modeling, and developing practical, business-oriented tax guidance. The ideal candidate will possess strong technical knowledge of U.S. domestic and international tax, sound judgment, hands-on modeling capabilities, and the ability to translate complex tax concepts into clear, actionable business recommendations. This individual should be intellectually curious, execution-oriented, and comfortable operating independently in a fast-paced global environment. | — | 0 |
| U.S. International Tax Senior Manager (San Jose) Seeking a U.S. International Tax Senior Manager to join the Tax Department, supporting a broad range of tax matters. This role involves working with domestic and non-US tax country managers and cross-functional teams to assess tax implications of ByteDance’s and its affiliates’ overall tax positions, products, business developments, and operational initiatives. Responsibilities include participating in domestic and cross-border discussions, collecting and analyzing financial data, supporting tax calculations and modeling, and developing practical tax guidance. The ideal candidate will have strong technical knowledge of U.S. domestic and international tax, sound judgment, hands-on modeling capabilities, and the ability to translate complex tax concepts into clear, actionable business recommendations. The individual should be intellectually curious, execution-oriented, and comfortable operating independently in a fast-paced global environment. | — | 0 |
| Senior Network Capacity Planning Engineer Senior Network Capacity Planning Engineer responsible for end-to-end network planning for large-scale data center and backbone networks, translating product growth and traffic patterns into multi-quarter and multi-year capacity plans, identifying risk hotspots, and driving investment and delivery priorities. | — | 0 |
| Senior Leader Security and Safety Operations - Data Centers Senior Leader for Security and Safety Operations in Data Centers, responsible for implementing and enforcing security and EHS policies, overseeing physical security and EHS operations, conducting audits, and coordinating with various stakeholders. The role involves developing security strategies, managing incidents, and ensuring compliance with global safety programs and local regulations. | — | 0 |
| Site Reliability Engineer - Security Engineering - San Jose ByteDance is seeking an experienced Site Reliability Engineer for their Security Engineering team in San Jose. This role involves designing and implementing security SRE frameworks, building cutting-edge SRE technologies for system deployment, upgrade, capacity planning, troubleshooting, and disaster recovery. The engineer will also focus on automation, intelligence, and monitoring of SRE infrastructure, and support cross-functional teams with security products and services. The role requires strong programming skills, experience with distributed systems, cloud-native frameworks like Kubernetes, and SRE tools. | — | 0 |
| Network Software Development Engineer Develop and test core functionalities of the Network Operating System (NOS) for hyperscale data center networks and AI infrastructure. Research and implement next-generation switch software for network monitoring, telemetry, load balancing, congestion control, and system reliability. Design and maintain CI/CD pipelines and automated testing frameworks. | — | 0 |
| Cloud Infrastructure Architect - Ashburn This role focuses on optimizing AI server costs and automating processes within ByteDance's cloud infrastructure supply chain system. The goal is to ensure the availability of computing resources for the company's AI business development by managing resource planning, platform module improvements, and architecture evolution. | — | 0 |
| Data Center Site Acquisition Manager - Infrastructure Power & Energy Seeking a Data Center Site Acquisition Manager to support global energy expertise for hyper-scale data centers. Responsibilities include leading utility and power due diligence, managing energy supply, supporting lifecycle power management, and conducting energy cost analysis. Requires 5+ years of experience in infrastructure development and acquisition with a focus on power and utility, and 3+ years managing utility agreements and energy projects. | — | 0 |
| Data Center Site Acquisition Manager - Infrastructure Power & Energy ByteDance is seeking a Data Center Site Acquisition Manager to manage power and energy functions, including availability analysis and reliability due diligence for site expansion. The role involves working with various internal teams and external stakeholders to manage construction and service projects, focusing on utility and power due diligence, energy supply management, lifecycle power management, and energy cost analysis. | — | 0 |
| Data Center Site Acquisition Manager - Infrastructure Power & Energy ByteDance is seeking a Data Center Site Acquisition Manager to oversee power and energy functions for their global data centers. This role involves leading utility due diligence, managing energy supply, supporting power lifecycle management, and conducting energy cost analysis. The ideal candidate will have extensive experience in real estate/tech infrastructure development, utility agreements, and energy markets, with a focus on power and reliability. | — | 0 |
| Network Engineer, Global BackBone ByteDance is seeking a Network Engineer for its Global BackBone team to design, build, and operate hyperscale data-center networking solutions. The role involves driving network design, buildout projects, and operations for high availability and performance, collaborating with cross-functional teams, and managing vendor engagements. Requires expertise in large-scale global backbone network design and operation, including TE, Segment Routing, BGP, ISIS, and experience with DataCenter, Ex-Network, and POP Networks. | — | 0 |
| Backend Software Engineer - Platforms Backend Software Engineer for ByteDance's Data Center Systems (DCS) platform team, focusing on building and managing tooling for data center operations, infrastructure, and resource management to provide stable, high-performance computing resources for all business lines. | — | 0 |
| Cloud Leader - RD & SRE This role focuses on building, scaling, and operating global infrastructure, including hyperscale data centers, cloud solutions, and foundational infrastructure services. Responsibilities include developing internal tools, automation, monitoring systems, managing cloud images, improving operational efficiency, and handling incidents. The role requires strong SRE/DevOps experience with Linux, networking, storage, and programming languages like Go or Python, along with familiarity with public cloud platforms and reliability engineering practices. | — | 0 |
| XR Module Engineer - Tunable Lens and Multiphysics simulation - PICO - San Jose Engineering role focused on the development and design of tunable lenses and multiphysics simulation for XR eyepiece systems, requiring a PhD in a related field and experience in optical design and FEA. | — | 0 |
| Backend Software Engineer - Platforms Backend Software Engineer for ByteDance's Data Center Systems (DCS) platform team. This role focuses on building and managing tooling for data center operations, including infrastructure, server delivery, asset management, and cloud/server system services. The goal is to provide stable, high-performance, and cost-effective computing resources for the company's business lines. Responsibilities include developing platforms and tools for internal/external teams, improving engineering productivity, and maintaining software. Requirements include a BS in CS or equivalent, experience with distributed systems, and specific experience in DICM, ITOM, or ITSM. GPU, big data (Hadoop/Kafka), or streaming experience is a plus. | — | 0 |
| Supply Chain Manager - Global Logistics Supply Chain Manager role focused on data center operations, including logistics, asset lifecycle management, RMA workflows, and process optimization. Requires experience in global supply chain, data center infrastructure, and project management. | — | 0 |
| Tech Lead Manager, Global Traffic Infrastructure Tech Lead Manager for Global Traffic Infrastructure (GTI) team at ByteDance, focusing on edge infrastructure management, platformization, and cost-efficiency. The role involves leading a senior engineering team, defining technical vision, and ensuring reliability and scalability of global traffic infrastructure. | — | 0 |
| Tech Lead Manager, Global Traffic Infrastructure Tech Lead Manager for Global Traffic Infrastructure, focusing on edge infrastructure management, platformization, and scaling for global edge business. Requires strong leadership in cloud-native and networking domains. | — | 0 |