Manager, Safety Operations

xAI xAI · AI Frontier · Bastrop, TX · Safety

Manager for Safety Operations at xAI, responsible for leading a team that trains and refines Grok (an LLM) to enforce terms of service, minimize risks, and prevent harmful content. The role involves managing analysts, overseeing data labeling, ensuring quality curated data for ethical alignment, identifying abuse vectors, and improving AI defenses. Requires leadership experience in AI-driven operations and expertise in LLM improvement for safety and efficiency.

What you'd actually do

  1. Lead, mentor, and manage the team that monitors and takes action on content and behavior that goes against our terms of service, escalating as needed.
  2. Oversee the processing of appeals and ensuring proper labeling of use cases in the system.
  3. Guide the team’s use of proprietary software to provide labels, annotations, and inputs on projects involving safety protocols, risk scenarios, and policy compliance.
  4. Ensure the delivery of high-quality curated data that reinforces xAI’s rules and ethical alignment.
  5. Mentor team members, conduct performance management and calibration, drive feedback on tasks that improve AI's defenses to detect illegal and unethical behavior, identify emerging abuse vectors, and implement process improvements and automations.

Skills

Required

  • Leadership and people management
  • AI-driven operations
  • LLM improvement for safety and efficiency
  • Online safety and harm reduction
  • Policy interpretation and training
  • Data analysis
  • Ethical reasoning
  • Risk assessment
  • Team performance optimization
  • Communication skills
  • Interpersonal skills
  • Analytical skills
  • Ethical decision-making
  • Quality assurance
  • Continuous improvement
  • Automation design

Nice to have

  • Trust and Safety management in social media
  • AI/automation tools in Trust and Safety
  • Red-teaming and adversarial testing of LLMs
  • Translating findings into concrete improvements

What the JD emphasized

  • Proven leadership and people management experience in AI-driven operations
  • Expertise in improving Large Language Models (LLMs)
  • Proven experience in online safety and reducing harm
  • Ability to interpret, apply, and train teams on xAI safety policies effectively
  • Expertise in leading red-teaming and adversarial testing of Large Language Models

Other signals

  • training and refining Grok
  • enforce our terms of service
  • minimizing existential risks
  • enforcing xAI’s rules
  • promoting responsible development
  • prevent illegal and harmful content
  • monitoring and takes action on content and behavior
  • providing labels, annotations, and inputs on projects involving safety protocols
  • high-quality curated data that reinforces xAI’s rules and ethical alignment
  • improve AI's defenses to detect illegal and unethical behavior
  • identify emerging abuse vectors
  • align Grok with our rules enforcement
  • strengthen overall safety operations
  • improving Large Language Models (LLMs) to maximize efficiencies in enforcement and support
  • increase security and safety of our platform
  • online safety and reducing harm
  • interpret, apply, and train teams on xAI safety policies
  • ethical reasoning, risk assessment
  • safety-focused actions
  • continuous improvement of processes, people, and operations to prioritize safety and risk mitigation
  • data analysis to identify emerging abuse vectors
  • design automations that strengthen enforcement effectiveness and platform safety
  • leading red-teaming and adversarial testing of Large Language Models
  • proactively identify novel abuse vectors, jailbreaks, and safety failure modes
  • translate findings into concrete improvements for enforcement systems, team processes, and platform robustness