Member of Technical Staff - Reasoning

xAI xAI · AI Frontier · London, United Kingdom · Model

This role focuses on building frameworks to enhance AI reasoning capabilities, developing distributed reinforcement learning systems, optimizing inference-time compute techniques like tree search and planning, and creating environments for agents. It requires experience with large-scale RL and distributed systems, and staying current with state-of-the-art algorithms.

What you'd actually do

  1. Build robust and scalable distributed RL systems.
  2. Optimise frameworks to enable complex inference-time reasoning.
  3. Develop environments and harnesses for agents.

Skills

Required

  • Reinforcement Learning
  • Distributed Systems
  • Inference Optimization
  • Agent Development

Nice to have

  • Tree Search
  • Planning Algorithms

What the JD emphasized

  • large-scale reinforcement learning systems
  • distributed systems
  • state-of-the-art RL and inference time compute algorithms

Other signals

  • building frameworks for reasoning
  • distributed reinforcement learning systems
  • inference time compute
  • environments for agents