What AI roles is ByteDance hiring for?

ByteDance currently has 115 active AI-related roles in our index. The most common open titles are: Cloud Acceleration Engineer – DPU & AI Infra (2), LLM AIOps Development Engineer - Data Center Networking (2), Multimodal Model Training and Inference Optimization Engineer (2), Research Engineer - LLM Training Infrastructure - Seed Infra (2), Research Engineer - LLM/VLM Inference Optimization (Seed Infra) (2). Most positions are in Engineering and Research.

What stage of AI development does ByteDance focus on?

ByteDance's active AI hiring is concentrated in: serving infrastructure (38%), agents (25%), post-training (12%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

Where is ByteDance hiring AI talent?

ByteDance is hiring AI talent in: United States (115 roles).

What skills does ByteDance look for in AI roles?

Job postings at ByteDance most frequently mention: Machine Learning, Production ML Systems, Algorithms & Data Structures, GPU Computing, Optimization Methods.

How many AI roles has ByteDance posted recently?

In the past 30 days, ByteDance has posted 2 new AI-related roles. That is a -78% change versus the prior 30 days (9 → 2).

ByteDance — AI hiring signals

ByteDance currently has 112 active AI-related job listings. The majority of these roles are focused on serving infrastructure, accounting for 39% of the total, followed by agents at 27%. Engineering is the most frequent function, with research also being a significant area. The company is hiring for these positions primarily in the United States. Frequent tech tags include model_serving, inference_infra, and multimodal. In the last 30 days, ByteDance added 14 new AI roles, representing a 27% increase compared to the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Currently tracking 109 active AI roles, down 43% versus the prior 4 weeks. Primary focus: Serve · Engineering.

Hiring

109 / 115

Momentum (4w)

↓-10 -43%

13 opens last 4w · 23 prior 4w

Salary range

—

Tracked since

Feb '21

last role today

Hiring velocityscroll left for older weeks

1 new role

Nov 8

1 new role

Feb 21

1 new role

Apr 4

1 new role

May 16

2 new roles

Jun 27

1 new role

Jul 4

1 new role

Oct 31

1 new role

Dec 5

1 new role

Jan 2

2 new roles

1 new role

Feb 6

1 new role

Mar 20

1 new role

Apr 24

1 new role

May 15

1 new role

2 new roles

Jul 17

1 new role

Aug 7

1 new role

2 new roles

Sep 4

1 new role

Oct 23

1 new role

Nov 13

2 new roles

Dec 11

1 new role

Apr 8

1 new role

2 new roles

May 13

1 new role

Jun 24

1 new role

Jul 8

1 new role

2 new roles

Aug 26

1 new role

Sep 2

1 new role

3 new roles

Nov 4

1 new role

2 new roles

Dec 9

1 new role

3 new roles

1 new role

Jan 6

2 new roles

1 new role

Feb 3

1 new role

2 new roles

Mar 3

2 new roles

5 new roles

4 new roles

Apr 7

1 new role

9 new roles

1 new role

May 19

4 new roles

6 new roles

Jun 2

1 new role

3 new roles

1 new role

2 new roles

Jul 7

3 new roles

1 new role

Aug 4

7 new roles

3 new roles

4 new roles

3 new roles

Sep 1

2 new roles

5 new roles

17 new roles

1 new role

Oct 13

2 new roles

1 new role

2 new roles

Nov 3

2 new roles

1 new role

Dec 8

3 new roles

1 new role

3 new roles

2 new roles

Jan 5

1 new role

5 new roles

3 new roles

4 new roles

Feb 2

17 new roles

2 new roles

13 new roles

Mar 2

4 new roles

2 new roles

3 new roles

10 new roles

Apr 6

3 new roles

14 new roles

1 new role

5 new roles

May 4

9 new roles

6 new roles

3 new roles

5 new roles

Jun 1

3 new roles

4 new roles

1 new role

Jobs (115)

109 AI · 270 total active

Title	Stage	Function	Location	First seen	AI score
Software Engineer / Researcher, AI-Native database systems The role focuses on building AI-native database systems that act as reasoning engines, retrieval platforms, and memory for AI agents. Responsibilities include architecting and implementing databases for structured, unstructured, and vectorized data, optimizing storage for embeddings and multimodal retrieval, building scalable vector search systems, developing AI-augmented query processors using LLMs, and collaborating on RAG infrastructure and agent memory backends. The role also involves driving innovation in learned index structures and self-optimizing databases, with an emphasis on systems for AI workloads.	AgentServe	Engineering	Seattle, WA	May '25	8
Senior Software Engineer / Researcher, AI-Native database systems This role focuses on building next-generation AI-native database systems that act as reasoning engines, retrieval platforms, and memory for AI agents. The engineer/researcher will architect and implement systems integrating various data types, optimize storage for embeddings, build vector search, develop AI-augmented query processors, and contribute to RAG infrastructure and LLM agent memory backends. The role also involves driving innovation in learned index structures and AI-integrated transaction systems, with opportunities for publication.	AgentServe	Engineering	Seattle, WA	May '25	8
Software Engineer/Researcher, AI-Native Database Systems Software Engineer/Researcher to build and own AI-native database systems, acting as reasoning engines, retrieval platforms, and real-time memory for AI agents. The role involves architecting systems that integrate structured, unstructured, and vectorized data, optimizing storage for embeddings, building scalable vector search, developing AI-augmented query processors using LLMs, and collaborating on RAG infrastructure and LLM agent memory backends. Innovations in learned index structures and self-optimizing databases are also key.	AgentServe	Engineering	San Jose, CA	May '25	8
Senior Research Engineer / Scientist - Storage for LLM Senior Research Engineer/Scientist focused on designing and implementing a high-performance KV cache layer for LLM inference to improve latency, throughput, and cost-efficiency. This role involves optimizing caching for transformer-based models, collaborating with inference teams, and potentially extending open-source KV stores or building custom GPU-aware caching layers.	Serve	Engineering	Seattle, WA	May '25	8
Research Engineer / Scientist - Storage for LLM Research Engineer/Scientist focused on designing and implementing a high-performance KV cache layer for LLM inference to improve latency, throughput, and cost-efficiency in transformer-based model serving.	Serve	Research	San Jose, CA	May '25	8
Senior Research Engineer / Scientist -AI for Databases Research Engineer/Scientist focused on applying AI/ML to database management systems, including query optimization, indexing, workload forecasting, and developing self-managing databases. The role involves integrating AI models into production systems and publishing research findings.	ServeData	Research	Seattle, WA	May '25	8
Research Engineer / Scientist -AI for Databases Research Engineer/Scientist role focusing on applying AI/ML to database management systems, including query optimization, indexing, workload forecasting, and developing self-managing databases. The role involves research and development, integrating AI models into production systems, analyzing large datasets, and publishing findings. Requires a PhD and strong publication record in AI/databases/systems, with experience in database internals and ML frameworks.	ServeData	Research	Seattle, WA	May '25	8
Research Engineer / Scientist -AI for Databases Research Engineer/Scientist focused on applying AI/ML to database management systems, including query optimization, indexing, and workload forecasting, with a goal of building AI-native data infrastructure and intelligent optimization. The role involves research and development, integrating models into production, and publishing findings.	ServeData	Research	San Jose, CA	May '25	8
Algorithm Tech Lead Manager - Enterprise Solution RD - San Jose Algorithm Tech Lead Manager for ByteDance's enterprise solutions, focusing on implementing LLMs, VLLMs, and AI Agents in business scenarios like intelligent recommendations and AI Copilots. The role involves designing and implementing data pipelines and algorithm applications, leading a team of algorithm engineers, and collaborating with product managers and business developers to enhance enterprise service construction.	Agent	Engineering	San Jose, CA	Apr '25	8
Machine Learning Engineer, E-commerce Governance Algorithms Machine Learning Engineer focused on e-commerce governance, using GNNs, LLMs, and time series for fraud detection, quality control, and logistics optimization. The role involves building and deploying AI solutions to improve platform health, seller compliance, and user trust.	AgentData	Engineering	Seattle, WA	Apr '25	8
Machine Learning Engineer - Inference Machine Learning Engineer focused on designing, implementing, and optimizing distributed inference infrastructure for large-scale AI models in the consumer domain, specifically for ads, feeds, and search ranking.	Serve	Engineering	San Jose, CA	Mar '25	8
Senior Research Engineer, 3D vision Research Engineer focused on AI for 3D digital content creation, specifically human face and body, involving generative models and representations like NeRF. The role involves research, development, and transferring technology to products.	Data	Research	San Jose, CA	Nov '24	8
Tech Lead Manager, Large Language Models & Generative AI Tech Lead Manager for Large Language Models & Generative AI focusing on developing long-term memory capabilities and delivering personalized chat, search, and recommendation experiences. Responsibilities include developing advanced AI algorithms, improving natural language understanding, full-stack development of large-scale ML and recommendation systems, and applying LLM techniques for information finding. Requires strong coding, analytical skills, and experience with NLU, Recall, Sort, large-scale search, recommendation, and LLM systems.	AgentServe	Engineering	San Jose, CA	Dec '23	8
Research Scientist - AI Security Research Scientist focused on AI security, investigating threats like adversarial attacks and model tampering, and developing mitigation strategies for NLP and computer vision models. Requires experience in AI/ML security research and programming skills.	Post-train	Research	San Jose, CA	Dec '23	8
Machine Learning Engineer - Data Recommendation (CapCut) Machine Learning Engineer focused on recommendation algorithms for video creation tools like CapCut, aiming to optimize content distribution and drive user growth. Requires strong ML knowledge and coding skills.	Ship	Engineering	San Jose, CA	3w ago	7
Machine Learning Engineer - AI Compiler Optimization Machine Learning Engineer focused on AI compiler optimization for recommendation systems. Responsibilities include building and implementing compilation optimization systems, collaborating on hardware-software co-design, and adapting recommendation models from PyTorch to the engine to maximize hardware efficiency and simplify deployment.	Serve	Engineering	San Jose, CA	6w ago	7
Research Scientist, Operations Research (Infrastructure Lab) Research Scientist role focusing on operations research for AI-native data infrastructure. The role involves designing and optimizing vector indexing algorithms for vector databases, and exploring the integration of LLM, RL, and Agent technologies into operations research optimization pipelines. This includes developing AI for infrastructure optimization and LLM-based tooling like NL2SQL.	AgentData	Research	San Jose, CA	7w ago	7
Senior Research Scientist, Operations Research (Infrastructure Lab) Research Scientist role focused on designing and optimizing state-of-the-art vector indexing algorithms for next-generation vector database infrastructure, and exploring AI for Operations Research by integrating LLM, RL, and Agent technologies into optimization pipelines.	AgentData	Research	San Jose, CA	7w ago	7
Senior Research Scientist, Operations Research (Infrastructure Lab) Research Scientist role focused on operations research for AI-native data infrastructure, including next-generation databases, AI for infra optimization, and LLM-based tooling. The role involves designing and optimizing vector indexing algorithms and exploring AI integration into operations research pipelines.	DataAgent	Research	Seattle, WA	7w ago	7
Research Scientist, Operations Research (Infrastructure Lab) Research Scientist role focused on designing and optimizing state-of-the-art vector indexing algorithms and integrating AI (LLM, RL, Agent) into operations research optimization pipelines for AI data centers and cloud resource scheduling. The role involves building next-generation AI-native data infrastructure, including vector databases and intelligent algorithms for infrastructure optimization.	AgentData	Research	Seattle, WA	7w ago	7
Tech Lead - Machine Learning Platform Engineer Machine Learning Platform Engineer to develop and maintain a platform supporting deep learning models for code development, testing, training, model deployment, and other core business functions. The platform is foundational for recommendation, advertising, and search systems, involving recommended systems and distributed training of large-scale deep learning models.	ServeData	Engineering	San Jose, CA	Apr 24	7
Recommender System Engineer, AI-Driven (PICO-Lab) - San Jose Recommender System Engineer focused on building and productionizing recommendation models, designing low-latency serving pipelines, and running experiments for XR products.	Ship	Engineering	San Jose, CA	Apr 24	7
Machine Learning Engineer - Orchestration Machine Learning Engineer focused on optimizing resource efficiency in distributed orchestration and scheduling for training and inference systems, particularly for large-scale recommendation models. The role involves building and optimizing training system architectures and online inference architectures, integrating with MLops processes, and working within Kubernetes/Godel ecosystems.	ServePost-train	Engineering	San Jose, CA	Apr 6	7
Edge ML Software Engineer (Model Optimization-PICO) - San Jose Software Engineer focused on optimizing and deploying ML models for edge NPUs in VR/AR devices, involving quantization, performance profiling, and hardware-aware optimizations to meet latency, memory, and power constraints.	Serve	Engineering	San Jose, CA	Apr 1	7
Edge ML Software Engineer (Compiler-PICO) - San Jose Software Engineer specializing in ML compilers for edge NPU architectures, focusing on optimizing latency, memory, power, and thermal constraints for ML inference on target hardware. Requires strong compiler and deep learning model understanding, with preferred experience in quantization and ML compiler stacks.	Serve	Engineering	San Jose, CA	Apr 1	7
Edge ML Software Engineer (System Modeling-PICO) - San Jose Develop transaction-level models of edge NPU architectures for ML workloads (CNNs, Transformers) to simulate execution, analyze performance, and optimize for latency, memory, and power targets. Requires strong C/C++ and System C proficiency, computer architecture understanding, and experience with ML accelerator modeling.	Serve	Engineering	San Jose, CA	Apr 1	7
Vision Algorithm Evaluation Engineer - PICO Lab - San Jose ByteDance's PICO Lab is seeking a Vision Algorithm Evaluation Engineer to design and implement evaluation frameworks for computer vision and imaging algorithms in VR/MR/AR devices. This role involves creating test scenarios, defining metrics, analyzing algorithm performance, and providing data-driven recommendations to guide technology and product decisions.	Eval Gate	Engineering	San Jose, CA	Feb 24	7
LLM AIOps Development Engineer - Data Center Networking Develops an AIOps platform for data center networking, focusing on building an intelligent diagnostics system, exploring LLM/Agent applications for operations, and establishing capacity prediction. Integrates streaming telemetry and applies ML/DL for anomaly detection and root cause analysis.	AgentData	Engineering	Seattle, WA	Feb 10	7
LLM AIOps Development Engineer - Data Center Networking Develops and implements an AIOps platform for data center networking, leveraging LLMs and agents for intelligent diagnostics, automated remediation, and predictive capabilities. Focuses on building a panoramic network observability platform and applying ML/DL for anomaly detection and root cause analysis.	AgentData	Engineering	San Jose, CA	Feb 10	7
Software Development Engineer - Full Stack - PICO Lab - San Jose Software Development Engineer role focused on prototyping AI and XR product concepts, specifically agentic AI on mobile and smart devices. The role involves rapid software development and iteration across various platforms to validate product features and user experiences.	Agent	Engineering	San Jose, CA	Feb 9	7
Tech Lead Software Engineer - AI Compute Infrastructure The Tech Lead Software Engineer will design and build large-scale, container-based cluster management and orchestration systems with extreme performance, scalability, and resilience, focusing on GPU and AI accelerator infrastructure for LLM inference. This role involves architecting next-generation cloud-native systems, collaborating on inference solutions using various LLM engines, and contributing to open-source projects.	Serve	Engineering	San Jose, CA	Jan 9	7
Tech Lead Software Engineer - AI Compute Infrastructure Tech Lead Software Engineer focused on building and maintaining large-scale, Kubernetes-native LLM inference infrastructure (AIBrix). The role involves designing and architecting GPU-optimized orchestration systems for hyper-scale environments, collaborating on inference solutions using various LLM engines, and staying current with AI/ML infrastructure advancements.	Serve	Engineering	Seattle, WA	Jan 9	7
Research Scientist - DPU & AI Infra Research Scientist focused on DPU and AI infrastructure, aiming to accelerate distributed training and inference by co-designing software and hardware solutions. Explores AI/ML infrastructure acceleration leveraging DPUs, GPUs, and custom hardware.	ServeData	Research	San Jose, CA	Sep '25	7
Senior Research Scientist - DPU & AI Infra Research Scientist role focused on designing and developing DPU network software for AI/ML workloads, optimizing distributed training and inference, and exploring software-hardware co-design for cloud and AI computing infrastructure.	ServeData	Research	Seattle, WA	Sep '25	7
Research Scientist - DPU & AI Infra Research Scientist role focused on designing and developing DPU network software for AI/ML workloads, including distributed training and inference acceleration, and software-hardware co-design.	ServeData	Research	Seattle, WA	Sep '25	7
Tech Lead, Research Scientist - DPU & AI Infra Tech Lead, Research Scientist focused on DPU and AI infrastructure, optimizing distributed training and inference by leveraging DPUs, GPUs, and custom hardware. The role involves designing and developing high-performance network software, collaborating on software-hardware co-design, and driving end-to-end performance optimization.	ServeData	Engineering	Seattle, WA	Sep '25	7
Tech Lead, Research Scientist - DPU & AI Infra This role focuses on designing and developing DPU network software and exploring AI/ML infrastructure acceleration using DPUs, GPUs, and custom hardware to optimize distributed training and inference. It involves software-hardware co-design and end-to-end performance optimization for cloud-scale computing.	ServeData	Research	San Jose, CA	Sep '25	7
Senior Cloud Acceleration Engineer – DPU & AI Infra Senior Cloud Acceleration Engineer focused on DPU and AI infrastructure, involving software-hardware co-design to optimize distributed training and inference performance. Requires strong C/C++ and Linux systems development, with experience in networking, distributed systems, or AI/ML systems.	ServeAgent	Engineering	Seattle, WA	Sep '25	7
Senior Software Engineer - AI Compute Infrastructure Senior Software Engineer to design and build large-scale, container-based cluster management and orchestration systems for LLM inference, focusing on performance, scalability, and cost-efficiency. The role involves architecting GPU and AI accelerator infrastructure, collaborating on inference solutions using various LLM engines, and staying current with AI/ML infrastructure advancements.	Serve	Engineering	Seattle, WA	Sep '25	7
Software Engineer - AI Compute Infrastructure Software Engineer focused on building and maintaining large-scale, Kubernetes-native AI compute infrastructure for LLM inference, emphasizing performance, scalability, and cost-efficiency. The role involves architecting GPU-optimized systems and collaborating on inference solutions using various LLM engines.	Serve	Engineering	Seattle, WA	Sep '25	7
Software Engineer - AI Compute Infrastructure Software Engineer focused on building and maintaining large-scale, Kubernetes-native LLM inference infrastructure (AIBrix) with a focus on performance, scalability, and cost-efficiency. The role involves architecting GPU-optimized systems, collaborating on inference solutions using various LLM engines, and contributing to open-source projects.	Serve	Engineering	San Jose, CA	Sep '25	7
Cloud Acceleration Engineer – DPU & AI Infra This role focuses on designing and developing DPU network software and exploring AI/ML infrastructure acceleration, specifically for distributed training and inference. It involves software-hardware co-design and performance optimization of systems related to AI computing.	ServeData	Engineering	Seattle, WA	Sep '25	7
Cloud Acceleration Engineer – DPU & AI Infra ByteDance is seeking a Cloud Acceleration Engineer to focus on DPU and AI infrastructure. The role involves designing and developing high-performance DPU network software, collaborating on software-hardware co-design, and exploring AI/ML infrastructure acceleration for distributed training and inference. The position requires strong C/C++ and Linux systems development skills, with a background in areas like software-hardware co-design, distributed systems, networking, or AI/ML systems.	ServeData	Engineering	San Jose, CA	Sep '25	7
Senior Software Engineer, AI Infrastructure - Developer Tooling Senior Software Engineer to build AI-powered developer tools, focusing on retrieval infrastructure (RAG), a coding agent with multi-step generation and tool use, and evaluation frameworks for measuring effectiveness. Requires strong Python/TypeScript, systems-level language experience, and practical LLM integration.	AgentData	Engineering	San Jose, CA	Sep '25	7
Tech Lead, AML Orchestration Tech Lead for an Applied Machine Learning (AML) team focused on building and advancing distributed orchestration platforms for recommendation systems, ads ranking, and search ranking. The role involves leading a team of ML Engineers, setting technical strategy for resource efficiency, distributed training, and online inference systems, and optimizing large-scale distributed orchestration and scheduling strategies.	ServeAgent	Engineering	San Jose, CA	Sep '25	7
Video Codec Algorithm Engineer - Multimedia Lab Research role focused on designing and developing AI-powered video codec algorithms, optimizing performance, and pushing the boundaries of video coding technologies. The role involves foundational research into large models and next-generation standards for multimedia content.	Data	Research	San Diego, CA	Aug '25	7
Machine Learning Engineer (User Growth & Intelligent Marketing) - Global e-Commerce Machine Learning Engineer focused on optimizing user growth and intelligent marketing algorithms for TikTok's e-commerce platform. This role involves developing and implementing solutions for personalized recommendations, user value modeling, uplift modeling, and marketing efficiency to drive e-commerce GMV growth.	Ship	Engineering	Seattle, WA	Aug '25	7
Machine Learning Engineer, Search - Local Services Team Machine Learning Engineer for ByteDance's Local Services team, focusing on enhancing user discovery and ecosystem growth for hospitality, dining, and leisure experiences. The role involves leveraging large-scale ML for search and recommendation systems, aiming to improve personalized relevance, CTR/CVR prediction, and conversion efficiency for billions of users. Responsibilities include designing and implementing full-stack search algorithms, query analysis, ranking, and personalized behavior modeling.	Ship	Engineering	Seattle, WA	Jul '25	7
Machine Learning Platform Engineer, Applied Machine Learning Team Machine Learning Platform Engineer to develop and maintain a platform supporting deep learning models for code development, testing, training, model deployment, and other core business functions. The role supports recommendation, advertising, and search systems, focusing on distributed training of large-scale deep learning models.	ServeData	Engineering	San Jose, CA	May '25	7
Multimodal AI Algorithm Expert-EMG / Interaction Perception, PICO Research and develop deep learning models for multimodal data fusion using sEMG, computer vision, and IMU technologies, focusing on signal acquisition, processing, and handling sensor noise for enhanced human-virtual world interaction.	Data	Research	San Jose, CA	May '25	7

Frequently asked questions

What AI roles is ByteDance hiring for?
ByteDance currently has 115 active AI-related roles in our index. The most common open titles are: Cloud Acceleration Engineer – DPU & AI Infra (2), LLM AIOps Development Engineer - Data Center Networking (2), Multimodal Model Training and Inference Optimization Engineer (2), Research Engineer - LLM Training Infrastructure - Seed Infra (2), Research Engineer - LLM/VLM Inference Optimization (Seed Infra) (2). Most positions are in Engineering and Research.
What stage of AI development does ByteDance focus on?
ByteDance's active AI hiring is concentrated in: serving infrastructure (38%), agents (25%), post-training (12%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
Where is ByteDance hiring AI talent?
ByteDance is hiring AI talent in: United States (115 roles).
What skills does ByteDance look for in AI roles?
Job postings at ByteDance most frequently mention: Machine Learning, Production ML Systems, Algorithms & Data Structures, GPU Computing, Optimization Methods.
How many AI roles has ByteDance posted recently?
In the past 30 days, ByteDance has posted 2 new AI-related roles. That is a -78% change versus the prior 30 days (9 → 2).

Title

Stage

Function

Location

First seen

AI score

Software Engineer / Researcher, AI-Native database systems

The role focuses on building AI-native database systems that act as reasoning engines, retrieval platforms, and memory for AI agents. Responsibilities include architecting and implementing databases for structured, unstructured, and vectorized data, optimizing storage for embeddings and multimodal retrieval, building scalable vector search systems, developing AI-augmented query processors using LLMs, and collaborating on RAG infrastructure and agent memory backends. The role also involves driving innovation in learned index structures and self-optimizing databases, with an emphasis on systems for AI workloads.

AgentServe

Engineering

Seattle, WA

May '25

Senior Software Engineer / Researcher, AI-Native database systems

This role focuses on building next-generation AI-native database systems that act as reasoning engines, retrieval platforms, and memory for AI agents. The engineer/researcher will architect and implement systems integrating various data types, optimize storage for embeddings, build vector search, develop AI-augmented query processors, and contribute to RAG infrastructure and LLM agent memory backends. The role also involves driving innovation in learned index structures and AI-integrated transaction systems, with opportunities for publication.

AgentServe

Engineering

Seattle, WA

May '25

Software Engineer/Researcher, AI-Native Database Systems

Software Engineer/Researcher to build and own AI-native database systems, acting as reasoning engines, retrieval platforms, and real-time memory for AI agents. The role involves architecting systems that integrate structured, unstructured, and vectorized data, optimizing storage for embeddings, building scalable vector search, developing AI-augmented query processors using LLMs, and collaborating on RAG infrastructure and LLM agent memory backends. Innovations in learned index structures and self-optimizing databases are also key.

AgentServe

Engineering

San Jose, CA

May '25

Senior Research Engineer / Scientist - Storage for LLM

Senior Research Engineer/Scientist focused on designing and implementing a high-performance KV cache layer for LLM inference to improve latency, throughput, and cost-efficiency. This role involves optimizing caching for transformer-based models, collaborating with inference teams, and potentially extending open-source KV stores or building custom GPU-aware caching layers.

Serve

Engineering

Seattle, WA

May '25

Research Engineer / Scientist - Storage for LLM

Research Engineer/Scientist focused on designing and implementing a high-performance KV cache layer for LLM inference to improve latency, throughput, and cost-efficiency in transformer-based model serving.

Serve

Research

San Jose, CA

May '25

Senior Research Engineer / Scientist -AI for Databases

Research Engineer/Scientist focused on applying AI/ML to database management systems, including query optimization, indexing, workload forecasting, and developing self-managing databases. The role involves integrating AI models into production systems and publishing research findings.

ServeData

Research

Seattle, WA

May '25

Research Engineer / Scientist -AI for Databases

Research Engineer/Scientist role focusing on applying AI/ML to database management systems, including query optimization, indexing, workload forecasting, and developing self-managing databases. The role involves research and development, integrating AI models into production systems, analyzing large datasets, and publishing findings. Requires a PhD and strong publication record in AI/databases/systems, with experience in database internals and ML frameworks.

ServeData

Research

Seattle, WA

May '25

Research Engineer / Scientist -AI for Databases

Research Engineer/Scientist focused on applying AI/ML to database management systems, including query optimization, indexing, and workload forecasting, with a goal of building AI-native data infrastructure and intelligent optimization. The role involves research and development, integrating models into production, and publishing findings.

ServeData

Research

San Jose, CA

May '25

Algorithm Tech Lead Manager - Enterprise Solution RD - San Jose

Algorithm Tech Lead Manager for ByteDance's enterprise solutions, focusing on implementing LLMs, VLLMs, and AI Agents in business scenarios like intelligent recommendations and AI Copilots. The role involves designing and implementing data pipelines and algorithm applications, leading a team of algorithm engineers, and collaborating with product managers and business developers to enhance enterprise service construction.

Agent

Engineering

San Jose, CA

Apr '25

Machine Learning Engineer, E-commerce Governance Algorithms

Machine Learning Engineer focused on e-commerce governance, using GNNs, LLMs, and time series for fraud detection, quality control, and logistics optimization. The role involves building and deploying AI solutions to improve platform health, seller compliance, and user trust.

AgentData

Engineering

Seattle, WA

Apr '25

Machine Learning Engineer - Inference

Machine Learning Engineer focused on designing, implementing, and optimizing distributed inference infrastructure for large-scale AI models in the consumer domain, specifically for ads, feeds, and search ranking.

Serve

Engineering

San Jose, CA

Mar '25

Senior Research Engineer, 3D vision

Research Engineer focused on AI for 3D digital content creation, specifically human face and body, involving generative models and representations like NeRF. The role involves research, development, and transferring technology to products.

Data

Research

San Jose, CA

Nov '24

Tech Lead Manager, Large Language Models & Generative AI

Tech Lead Manager for Large Language Models & Generative AI focusing on developing long-term memory capabilities and delivering personalized chat, search, and recommendation experiences. Responsibilities include developing advanced AI algorithms, improving natural language understanding, full-stack development of large-scale ML and recommendation systems, and applying LLM techniques for information finding. Requires strong coding, analytical skills, and experience with NLU, Recall, Sort, large-scale search, recommendation, and LLM systems.

AgentServe

Engineering

San Jose, CA

Dec '23

Research Scientist - AI Security

Research Scientist focused on AI security, investigating threats like adversarial attacks and model tampering, and developing mitigation strategies for NLP and computer vision models. Requires experience in AI/ML security research and programming skills.

Post-train

Research

San Jose, CA

Dec '23

Machine Learning Engineer - Data Recommendation (CapCut)

Machine Learning Engineer focused on recommendation algorithms for video creation tools like CapCut, aiming to optimize content distribution and drive user growth. Requires strong ML knowledge and coding skills.

Ship

Engineering

San Jose, CA

3w ago

Machine Learning Engineer - AI Compiler Optimization

Machine Learning Engineer focused on AI compiler optimization for recommendation systems. Responsibilities include building and implementing compilation optimization systems, collaborating on hardware-software co-design, and adapting recommendation models from PyTorch to the engine to maximize hardware efficiency and simplify deployment.

Serve

Engineering

San Jose, CA

6w ago

Research Scientist, Operations Research (Infrastructure Lab)

Research Scientist role focusing on operations research for AI-native data infrastructure. The role involves designing and optimizing vector indexing algorithms for vector databases, and exploring the integration of LLM, RL, and Agent technologies into operations research optimization pipelines. This includes developing AI for infrastructure optimization and LLM-based tooling like NL2SQL.

AgentData

Research

San Jose, CA

7w ago

Senior Research Scientist, Operations Research (Infrastructure Lab)

Research Scientist role focused on designing and optimizing state-of-the-art vector indexing algorithms for next-generation vector database infrastructure, and exploring AI for Operations Research by integrating LLM, RL, and Agent technologies into optimization pipelines.

AgentData

Research

San Jose, CA

7w ago

Senior Research Scientist, Operations Research (Infrastructure Lab)

Research Scientist role focused on operations research for AI-native data infrastructure, including next-generation databases, AI for infra optimization, and LLM-based tooling. The role involves designing and optimizing vector indexing algorithms and exploring AI integration into operations research pipelines.

DataAgent

Research

Seattle, WA

7w ago

Research Scientist, Operations Research (Infrastructure Lab)

Research Scientist role focused on designing and optimizing state-of-the-art vector indexing algorithms and integrating AI (LLM, RL, Agent) into operations research optimization pipelines for AI data centers and cloud resource scheduling. The role involves building next-generation AI-native data infrastructure, including vector databases and intelligent algorithms for infrastructure optimization.

AgentData

Research

Seattle, WA

7w ago

Tech Lead - Machine Learning Platform Engineer

Machine Learning Platform Engineer to develop and maintain a platform supporting deep learning models for code development, testing, training, model deployment, and other core business functions. The platform is foundational for recommendation, advertising, and search systems, involving recommended systems and distributed training of large-scale deep learning models.

ServeData

Engineering

San Jose, CA

Apr 24

Recommender System Engineer, AI-Driven (PICO-Lab) - San Jose

Recommender System Engineer focused on building and productionizing recommendation models, designing low-latency serving pipelines, and running experiments for XR products.

Ship

Engineering

San Jose, CA

Apr 24

Machine Learning Engineer - Orchestration

Machine Learning Engineer focused on optimizing resource efficiency in distributed orchestration and scheduling for training and inference systems, particularly for large-scale recommendation models. The role involves building and optimizing training system architectures and online inference architectures, integrating with MLops processes, and working within Kubernetes/Godel ecosystems.

ServePost-train

Engineering

San Jose, CA

Apr 6

Edge ML Software Engineer (Model Optimization-PICO) - San Jose

Software Engineer focused on optimizing and deploying ML models for edge NPUs in VR/AR devices, involving quantization, performance profiling, and hardware-aware optimizations to meet latency, memory, and power constraints.

Serve

Engineering

San Jose, CA

Apr 1

Edge ML Software Engineer (Compiler-PICO) - San Jose

Software Engineer specializing in ML compilers for edge NPU architectures, focusing on optimizing latency, memory, power, and thermal constraints for ML inference on target hardware. Requires strong compiler and deep learning model understanding, with preferred experience in quantization and ML compiler stacks.

Serve

Engineering

San Jose, CA

Apr 1

Edge ML Software Engineer (System Modeling-PICO) - San Jose

Develop transaction-level models of edge NPU architectures for ML workloads (CNNs, Transformers) to simulate execution, analyze performance, and optimize for latency, memory, and power targets. Requires strong C/C++ and System C proficiency, computer architecture understanding, and experience with ML accelerator modeling.

Serve

Engineering

San Jose, CA

Apr 1

Vision Algorithm Evaluation Engineer - PICO Lab - San Jose

ByteDance's PICO Lab is seeking a Vision Algorithm Evaluation Engineer to design and implement evaluation frameworks for computer vision and imaging algorithms in VR/MR/AR devices. This role involves creating test scenarios, defining metrics, analyzing algorithm performance, and providing data-driven recommendations to guide technology and product decisions.

Eval Gate

Engineering

San Jose, CA

Feb 24

LLM AIOps Development Engineer - Data Center Networking

Develops an AIOps platform for data center networking, focusing on building an intelligent diagnostics system, exploring LLM/Agent applications for operations, and establishing capacity prediction. Integrates streaming telemetry and applies ML/DL for anomaly detection and root cause analysis.

AgentData

Engineering

Seattle, WA

Feb 10

LLM AIOps Development Engineer - Data Center Networking

Develops and implements an AIOps platform for data center networking, leveraging LLMs and agents for intelligent diagnostics, automated remediation, and predictive capabilities. Focuses on building a panoramic network observability platform and applying ML/DL for anomaly detection and root cause analysis.

AgentData

Engineering

San Jose, CA

Feb 10

Software Development Engineer - Full Stack - PICO Lab - San Jose

Software Development Engineer role focused on prototyping AI and XR product concepts, specifically agentic AI on mobile and smart devices. The role involves rapid software development and iteration across various platforms to validate product features and user experiences.

Agent

Engineering

San Jose, CA

Feb 9

Tech Lead Software Engineer - AI Compute Infrastructure

The Tech Lead Software Engineer will design and build large-scale, container-based cluster management and orchestration systems with extreme performance, scalability, and resilience, focusing on GPU and AI accelerator infrastructure for LLM inference. This role involves architecting next-generation cloud-native systems, collaborating on inference solutions using various LLM engines, and contributing to open-source projects.

Serve

Engineering

San Jose, CA

Jan 9

Tech Lead Software Engineer - AI Compute Infrastructure

Tech Lead Software Engineer focused on building and maintaining large-scale, Kubernetes-native LLM inference infrastructure (AIBrix). The role involves designing and architecting GPU-optimized orchestration systems for hyper-scale environments, collaborating on inference solutions using various LLM engines, and staying current with AI/ML infrastructure advancements.

Serve

Engineering

Seattle, WA

Jan 9

Research Scientist - DPU & AI Infra

Research Scientist focused on DPU and AI infrastructure, aiming to accelerate distributed training and inference by co-designing software and hardware solutions. Explores AI/ML infrastructure acceleration leveraging DPUs, GPUs, and custom hardware.

ServeData

Research

San Jose, CA

Sep '25

Senior Research Scientist - DPU & AI Infra

Research Scientist role focused on designing and developing DPU network software for AI/ML workloads, optimizing distributed training and inference, and exploring software-hardware co-design for cloud and AI computing infrastructure.

ServeData

Research

Seattle, WA

Sep '25

Research Scientist - DPU & AI Infra

Research Scientist role focused on designing and developing DPU network software for AI/ML workloads, including distributed training and inference acceleration, and software-hardware co-design.

ServeData

Research

Seattle, WA

Sep '25

Tech Lead, Research Scientist - DPU & AI Infra

Tech Lead, Research Scientist focused on DPU and AI infrastructure, optimizing distributed training and inference by leveraging DPUs, GPUs, and custom hardware. The role involves designing and developing high-performance network software, collaborating on software-hardware co-design, and driving end-to-end performance optimization.

ServeData

Engineering

Seattle, WA

Sep '25

Tech Lead, Research Scientist - DPU & AI Infra

This role focuses on designing and developing DPU network software and exploring AI/ML infrastructure acceleration using DPUs, GPUs, and custom hardware to optimize distributed training and inference. It involves software-hardware co-design and end-to-end performance optimization for cloud-scale computing.

ServeData

Research

San Jose, CA

Sep '25

Senior Cloud Acceleration Engineer – DPU & AI Infra

Senior Cloud Acceleration Engineer focused on DPU and AI infrastructure, involving software-hardware co-design to optimize distributed training and inference performance. Requires strong C/C++ and Linux systems development, with experience in networking, distributed systems, or AI/ML systems.

ServeAgent

Engineering

Seattle, WA

Sep '25

Senior Software Engineer - AI Compute Infrastructure

Senior Software Engineer to design and build large-scale, container-based cluster management and orchestration systems for LLM inference, focusing on performance, scalability, and cost-efficiency. The role involves architecting GPU and AI accelerator infrastructure, collaborating on inference solutions using various LLM engines, and staying current with AI/ML infrastructure advancements.

Serve

Engineering

Seattle, WA

Sep '25

Software Engineer - AI Compute Infrastructure

Software Engineer focused on building and maintaining large-scale, Kubernetes-native AI compute infrastructure for LLM inference, emphasizing performance, scalability, and cost-efficiency. The role involves architecting GPU-optimized systems and collaborating on inference solutions using various LLM engines.

Serve

Engineering

Seattle, WA

Sep '25

Software Engineer - AI Compute Infrastructure

Software Engineer focused on building and maintaining large-scale, Kubernetes-native LLM inference infrastructure (AIBrix) with a focus on performance, scalability, and cost-efficiency. The role involves architecting GPU-optimized systems, collaborating on inference solutions using various LLM engines, and contributing to open-source projects.

Serve

Engineering

San Jose, CA

Sep '25

Cloud Acceleration Engineer – DPU & AI Infra

This role focuses on designing and developing DPU network software and exploring AI/ML infrastructure acceleration, specifically for distributed training and inference. It involves software-hardware co-design and performance optimization of systems related to AI computing.

ServeData

Engineering

Seattle, WA

Sep '25

Cloud Acceleration Engineer – DPU & AI Infra

ByteDance is seeking a Cloud Acceleration Engineer to focus on DPU and AI infrastructure. The role involves designing and developing high-performance DPU network software, collaborating on software-hardware co-design, and exploring AI/ML infrastructure acceleration for distributed training and inference. The position requires strong C/C++ and Linux systems development skills, with a background in areas like software-hardware co-design, distributed systems, networking, or AI/ML systems.

ServeData

Engineering

San Jose, CA

Sep '25

Senior Software Engineer, AI Infrastructure - Developer Tooling

Senior Software Engineer to build AI-powered developer tools, focusing on retrieval infrastructure (RAG), a coding agent with multi-step generation and tool use, and evaluation frameworks for measuring effectiveness. Requires strong Python/TypeScript, systems-level language experience, and practical LLM integration.

AgentData

Engineering

San Jose, CA

Sep '25

Tech Lead, AML Orchestration

Tech Lead for an Applied Machine Learning (AML) team focused on building and advancing distributed orchestration platforms for recommendation systems, ads ranking, and search ranking. The role involves leading a team of ML Engineers, setting technical strategy for resource efficiency, distributed training, and online inference systems, and optimizing large-scale distributed orchestration and scheduling strategies.

ServeAgent

Engineering

San Jose, CA

Sep '25

Video Codec Algorithm Engineer - Multimedia Lab

Research role focused on designing and developing AI-powered video codec algorithms, optimizing performance, and pushing the boundaries of video coding technologies. The role involves foundational research into large models and next-generation standards for multimedia content.

Data

Research

San Diego, CA

Aug '25

Machine Learning Engineer (User Growth & Intelligent Marketing) - Global e-Commerce

Machine Learning Engineer focused on optimizing user growth and intelligent marketing algorithms for TikTok's e-commerce platform. This role involves developing and implementing solutions for personalized recommendations, user value modeling, uplift modeling, and marketing efficiency to drive e-commerce GMV growth.

Ship

Engineering

Seattle, WA

Aug '25

Machine Learning Engineer, Search - Local Services Team

Machine Learning Engineer for ByteDance's Local Services team, focusing on enhancing user discovery and ecosystem growth for hospitality, dining, and leisure experiences. The role involves leveraging large-scale ML for search and recommendation systems, aiming to improve personalized relevance, CTR/CVR prediction, and conversion efficiency for billions of users. Responsibilities include designing and implementing full-stack search algorithms, query analysis, ranking, and personalized behavior modeling.

Ship

Engineering

Seattle, WA

Jul '25

Machine Learning Platform Engineer, Applied Machine Learning Team

Machine Learning Platform Engineer to develop and maintain a platform supporting deep learning models for code development, testing, training, model deployment, and other core business functions. The role supports recommendation, advertising, and search systems, focusing on distributed training of large-scale deep learning models.

ServeData

Engineering

San Jose, CA

May '25

Multimodal AI Algorithm Expert-EMG / Interaction Perception, PICO

Research and develop deep learning models for multimodal data fusion using sEMG, computer vision, and IMU technologies, focusing on signal acquisition, processing, and handling sensor noise for enhanced human-virtual world interaction.

Data

Research

San Jose, CA

May '25