What you'd actually do

Design, build, and evolve the next-generation memory infrastructure for AI agents, developing a unified platform that supports long-term memory, conversational memory, and task-oriented memory.

Architect and optimize memory system pipelines for large-scale, low-latency, and high-availability environments, including data ingestion, storage, indexing, retrieval, updating, compression, and forgetting mechanisms to support real-time inference and personalized interactions.

Explore key challenges at the intersection of large language models, context engineering, and data management, including memory representation, retrieval and ranking, conflict resolution, summarization and fusion, and memory lifecycle management.

Design unified memory models and processing workflows for multimodal data (text, image, audio, behavioral signals), enhancing agents’ long-term consistency, personalization, and task completion in complex scenarios.

Collaborate closely with model, application, and platform teams to productionize memory capabilities, and continuously optimize system performance across quality, latency, cost, reliability, and safety.

Skills

Required

distributed systems
databases
information retrieval systems
AI infrastructure
system design
production engineering
Go
Python
C++
LLM applications
embeddings
retrieval-augmented generation (RAG)
context engineering
retrieval systems
long-term state management
memory extraction and representation
vector/graph indexing
retrieval and ranking
memory updating
compression and forgetting
multimodal memory fusion

Nice to have

agent memory systems
user profiling
recommendation/search feature platforms
knowledge base systems
mem0
memOS
memU
multimodal data processing
online inference systems
personalized agents
long-term user state modeling
system performance optimization
latency optimization
cost optimization
scalability optimization

About the Team Join ByteDance’s AI Agent Memory Infrastructure team, where we build the core memory systems that power next-generation intelligent agents. Our focus is on creating a unified platform for long-term, conversational, and task-oriented memory, enabling more personalized and context-aware AI experiences. We design and operate large-scale, low-latency, and highly reliable memory infrastructure, covering the full lifecycle from storage and retrieval to updating and optimization. Working at the intersection of LLMs, data systems, and context engineering, we tackle challenges in memory representation, retrieval, and multimodal fusion.

Partnering closely with model and product teams, we turn advanced research into scalable production systems that support a wide range of AI-driven applications.

Responsibilities

Design, build, and evolve the next-generation memory infrastructure for AI agents, developing a unified platform that supports long-term memory, conversational memory, and task-oriented memory.
Architect and optimize memory system pipelines for large-scale, low-latency, and high-availability environments, including data ingestion, storage, indexing, retrieval, updating, compression, and forgetting mechanisms to support real-time inference and personalized interactions.
Explore key challenges at the intersection of large language models, context engineering, and data management, including memory representation, retrieval and ranking, conflict resolution, summarization and fusion, and memory lifecycle management.
Design unified memory models and processing workflows for multimodal data (text, image, audio, behavioral signals), enhancing agents’ long-term consistency, personalization, and task completion in complex scenarios.
Collaborate closely with model, application, and platform teams to productionize memory capabilities, and continuously optimize system performance across quality, latency, cost, reliability, and safety.
Stay up-to-date with cutting-edge advancements and contribute to the long-term technical roadmap of AI agent memory systems, driving innovation and capability evolution.

Requirements

Minimum Qualifications

Bachelor’s degree or higher in Computer Science, Artificial Intelligence, Data Science, or related fields.
Strong experience in distributed systems, databases, information retrieval systems, or AI infrastructure, with proven system design and production engineering capabilities.
Proficient in at least one programming language such as Go, Python, or C++, with strong coding standards and engineering best practices.
Solid understanding of core technologies in LLM applications, including but not limited to embeddings, retrieval-augmented generation (RAG), context engineering, retrieval systems, and long-term state management.
Familiarity with one or more key areas in memory systems: memory extraction and representation, vector/graph indexing, retrieval and ranking, memory updating, compression and forgetting, multimodal memory fusion.

Preferred Qualifications

Experience in agent memory systems, user profiling, recommendation/search feature platforms, or knowledge base systems.
Contributions to or deep understanding of open-source memory frameworks such as mem0, memOS, memU, or similar solutions.
Strong track record in databases, information retrieval, machine learning, or AI systems, including publications, impactful open-source work, or notable technical achievements.
Experience in multimodal data processing, online inference systems, personalized agents, or long-term user state modeling.
Ability to analyze and optimize trade-offs across system performance, latency, cost, and scalability from both system and algorithm perspectives; experience with complex production systems is highly preferred.

Partnering closely with model and product teams, we turn advanced research into scalable production systems that support a wide range of AI-driven applications.

Responsibilities

Design, build, and evolve the next-generation memory infrastructure for AI agents, developing a unified platform that supports long-term memory, conversational memory, and task-oriented memory.
Architect and optimize memory system pipelines for large-scale, low-latency, and high-availability environments, including data ingestion, storage, indexing, retrieval, updating, compression, and forgetting mechanisms to support real-time inference and personalized interactions.
Explore key challenges at the intersection of large language models, context engineering, and data management, including memory representation, retrieval and ranking, conflict resolution, summarization and fusion, and memory lifecycle management.
Design unified memory models and processing workflows for multimodal data (text, image, audio, behavioral signals), enhancing agents’ long-term consistency, personalization, and task completion in complex scenarios.
Collaborate closely with model, application, and platform teams to productionize memory capabilities, and continuously optimize system performance across quality, latency, cost, reliability, and safety.
Stay up-to-date with cutting-edge advancements and contribute to the long-term technical roadmap of AI agent memory systems, driving innovation and capability evolution.

Requirements

Minimum Qualifications

Bachelor’s degree or higher in Computer Science, Artificial Intelligence, Data Science, or related fields.
Strong experience in distributed systems, databases, information retrieval systems, or AI infrastructure, with proven system design and production engineering capabilities.
Proficient in at least one programming language such as Go, Python, or C++, with strong coding standards and engineering best practices.
Solid understanding of core technologies in LLM applications, including but not limited to embeddings, retrieval-augmented generation (RAG), context engineering, retrieval systems, and long-term state management.
Familiarity with one or more key areas in memory systems: memory extraction and representation, vector/graph indexing, retrieval and ranking, memory updating, compression and forgetting, multimodal memory fusion.

Preferred Qualifications

Experience in agent memory systems, user profiling, recommendation/search feature platforms, or knowledge base systems.
Contributions to or deep understanding of open-source memory frameworks such as mem0, memOS, memU, or similar solutions.
Strong track record in databases, information retrieval, machine learning, or AI systems, including publications, impactful open-source work, or notable technical achievements.
Experience in multimodal data processing, online inference systems, personalized agents, or long-term user state modeling.
Ability to analyze and optimize trade-offs across system performance, latency, cost, and scalability from both system and algorithm perspectives; experience with complex production systems is highly preferred.

Tech Lead, Software Engineer - AI Agent Memory Infrastructure

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

Requirements

Requirements