What you'd actually do

Creating and maintaining frameworks for agent, data, and model evaluation tasks.

Building environments for AI agents.

Tools for automating common workflows.

Improving alerts, metrics and error handling on large scale RL jobs.

Refactoring existing agent, data, eval, training frameworks for better modularity.

Skills

Required

Experience building and maintaining frameworks that are used by many engineers.
Experience in building high-performance sandboxes, virtual machines, and simulations.
Experience building full-stack apps for automating workflows and data visualization.
Experience in rapid iteration of research to production cycles.
Experience in test automation, CI/CD.

Nice to have

RL infrastructure
agent environments
evaluation frameworks
data pipelines
automation frameworks
large scale RL training

ABOUT xAI

xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineering excellence. This organization is for individuals who appreciate challenging themselves and thrive on curiosity. We operate with a flat organizational structure. All employees are expected to be hands-on and to contribute directly to the company’s mission. Leadership is given to those who show initiative and consistently deliver excellence. Work ethic and strong prioritization skills are important. All employees are expected to have strong communication skills. They should be able to concisely and accurately share knowledge with their teammates.

ABOUT THE ROLE:

xAI is seeking experienced software engineers to create robust data pipelines, comprehensive evaluations for benchmarking LLMs, and automation frameworks to increase the productivity of researchers and engineers.

Typical problems you will deal with include the following:

We have a new agentic model capability that we’d like to improve. How do we design an efficient and robust environment for the agent to perform actions in?
Evaluations and observability are a core part of knowing what we need to improve in our models. What new features can we add into our evaluation framework to ease the workflow of researchers & engineers and increase observability?
A new open-source evaluation dataset has been released and researchers would like to track our models performance on it. How should we onboard it into our internal evaluation framework?
Datasets have been collected that require complex pre-processing to prepare it for large-scale RL training. How do we standardize our preprocessing pipelines to minimize dataset onboarding time?
A researcher on the team has an idea for how to augment a dataset to produce additional training data. How should we go about creating the data augmentation pipeline?

RESPONSIBILITIES:

Creating and maintaining frameworks for agent, data, and model evaluation tasks.
Building environments for AI agents.
Tools for automating common workflows.
Improving alerts, metrics and error handling on large scale RL jobs.
Refactoring existing agent, data, eval, training frameworks for better modularity.
Designing operation procedures and coding standards to streamline the transition from small scale experimentation to large scale RL training.
Writing unit tests, CI/CD frameworks to support rapid development cycles.

BASIC QUALIFICATIONS:

Experience building and maintaining frameworks that are used by many engineers.
Experience in building high-performance sandboxes, virtual machines, and simulations.
Experience building full-stack apps for automating workflows and data visualization.
Experience in rapid iteration of research to production cycles.
Experience in test automation, CI/CD.

COMPENSATION AND BENEFITS:

$180,000 - $440,000 USD

Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

_xAI is an equal opportunity employer. For details on data processing, view our _Recruitment Privacy Notice.

ABOUT xAI

ABOUT THE ROLE:

Typical problems you will deal with include the following:

We have a new agentic model capability that we’d like to improve. How do we design an efficient and robust environment for the agent to perform actions in?
Evaluations and observability are a core part of knowing what we need to improve in our models. What new features can we add into our evaluation framework to ease the workflow of researchers & engineers and increase observability?
A new open-source evaluation dataset has been released and researchers would like to track our models performance on it. How should we onboard it into our internal evaluation framework?
Datasets have been collected that require complex pre-processing to prepare it for large-scale RL training. How do we standardize our preprocessing pipelines to minimize dataset onboarding time?
A researcher on the team has an idea for how to augment a dataset to produce additional training data. How should we go about creating the data augmentation pipeline?

RESPONSIBILITIES:

Creating and maintaining frameworks for agent, data, and model evaluation tasks.
Building environments for AI agents.
Tools for automating common workflows.
Improving alerts, metrics and error handling on large scale RL jobs.
Refactoring existing agent, data, eval, training frameworks for better modularity.
Designing operation procedures and coding standards to streamline the transition from small scale experimentation to large scale RL training.
Writing unit tests, CI/CD frameworks to support rapid development cycles.

BASIC QUALIFICATIONS:

Experience building and maintaining frameworks that are used by many engineers.
Experience in building high-performance sandboxes, virtual machines, and simulations.
Experience building full-stack apps for automating workflows and data visualization.
Experience in rapid iteration of research to production cycles.
Experience in test automation, CI/CD.

COMPENSATION AND BENEFITS:

$180,000 - $440,000 USD

_xAI is an equal opportunity employer. For details on data processing, view our _Recruitment Privacy Notice.

Member of Technical Staff - RL Infrastructure

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

ABOUT xAI

ABOUT THE ROLE:

RESPONSIBILITIES:

BASIC QUALIFICATIONS:

COMPENSATION AND BENEFITS:

ABOUT xAI

ABOUT THE ROLE:

RESPONSIBILITIES:

BASIC QUALIFICATIONS:

COMPENSATION AND BENEFITS: