Principal Ai/ml Researcher / Engineer Reasoning, Planning, and Decision-making Systems

Airbnb Airbnb · Consumer · United States · Software Engineering

This role focuses on researching and engineering advanced AI systems for reasoning, planning, and decision-making within a consumer platform. It involves architecting post-training intelligence frameworks, integrating Large Reasoning Models (LRMs) with Knowledge Graphs, and applying Reinforcement Learning (RL) for adaptive planning and control. The core of the role is to build cognitive AI systems that combine foundational models, memory, and recursive planning strategies, with a significant emphasis on designing and deploying multi-agent systems for distributed intelligence and complex coordination. The ultimate goal is to transform the platform from prediction-based to one capable of deliberation, foresight, and agency, impacting various user and internal workflows.

What you'd actually do

  1. Drive foundational and applied research in reasoning engines, planning architectures, and decision-making frameworks at scale in order to incorporate genAI into the ranking / recommendation / personalization stack in both single model to multi-agent ( system ) level intelligence with objective to grow the business (new user growth, abandoned user, long tailed user) in existing and new business areas while supporting Multi-Modal NL → Conversational Interfaces.
  2. Advance techniques in LLM/LRM post-training, reinforcement learning–based decisioning, and knowledge-integrated agents.
  3. Design methods for plan induction, value estimation, and contingency modeling within intelligent agents.
  4. Explore and validate protocols for distributed reasoning and joint planning among cooperative agents in multi-agent systems.
  5. Architect RPD systems that integrate post-trained LLMs/LRMs, graph-structured memory (e.g., KGs), and RL-driven controllers.

Skills

Required

  • Deep experience in reasoning, planning, and decision-making systems
  • Architecting post-training intelligence frameworks
  • Integrating Large Reasoning Models (LRMs) with Knowledge Graphs
  • Applying Reinforcement Learning (RL)
  • Designing and deploying multi-agent systems
  • Foundational and applied research in reasoning engines, planning architectures, and decision-making frameworks
  • LLM/LRM post-training
  • Knowledge-integrated agents
  • Plan induction, value estimation, and contingency modeling
  • Distributed reasoning and joint planning
  • Architecting RPD systems integrating post-trained LLMs/LRMs, graph-structured memory (e.g., KGs), and RL-driven controllers

Nice to have

  • Symbolic and sub-symbolic methods
  • Recursive planning strategies
  • Multi-agent systems
  • Cooperation, negotiation, and alignment of objectives among agents
  • Multi-Modal NL → Conversational Interfaces
  • Graph-structured memory (e.g., KGs)
  • Recursive task planners
  • Search-based or policy-based reasoners
  • Belief-state trackers

What the JD emphasized

  • reasoning, planning, and decision-making systems
  • architected post-training intelligence frameworks
  • integrated Large Reasoning Models (LRMs) with Knowledge Graphs
  • applied Reinforcement Learning (RL)
  • design and deployment of multi-agent systems
  • distributed reasoning and joint planning among cooperative agents
  • complex multi-stage planning and adaptive coordination

Other signals

  • multi-agent systems
  • reasoning, planning, and decision-making systems
  • Large Reasoning Models (LRMs)
  • Reinforcement Learning (RL)
  • cognitive AI systems
  • goal-directed reasoning systems
  • complex multi-stage planning
  • distributed intelligence architectures