Member of Technical Staff - RL Infrastructure

xAI xAI · AI Frontier · Palo Alto, CA · Model

This role focuses on building infrastructure and frameworks to support AI agents, data processing, and model evaluations, with a strong emphasis on RL training and automation. The primary output is the agent environment and associated tooling, with secondary contributions to data pipelines for RL training.

What you'd actually do

  1. Creating and maintaining frameworks for agent, data, and model evaluation tasks.
  2. Building environments for AI agents.
  3. Tools for automating common workflows.
  4. Improving alerts, metrics and error handling on large scale RL jobs.
  5. Refactoring existing agent, data, eval, training frameworks for better modularity.

Skills

Required

  • Experience building and maintaining frameworks that are used by many engineers.
  • Experience in building high-performance sandboxes, virtual machines, and simulations.
  • Experience building full-stack apps for automating workflows and data visualization.
  • Experience in rapid iteration of research to production cycles.
  • Experience in test automation, CI/CD.

Nice to have

  • RL infrastructure
  • agent environments
  • evaluation frameworks
  • data pipelines
  • automation frameworks
  • large scale RL training

What the JD emphasized

  • frameworks that are used by many engineers
  • high-performance sandboxes, virtual machines, and simulations
  • rapid iteration of research to production cycles

Other signals

  • building environments for AI agents
  • creating and maintaining frameworks for agent, data, and model evaluation tasks
  • automating common workflows
  • improving alerts, metrics and error handling on large scale RL jobs