Member of Technical Staff - Internal Tools (front-end)

xAI xAI · AI Frontier · Palo Alto, CA · Model

This role is for a Senior Frontend Engineer with full-stack capabilities to build and maintain internal tools for AI research. The platforms include a human data collection system (Starfleet) and a research platform for model training, evaluations, and experiment management (Toolbox). The role requires strong frontend (React/TypeScript) and backend (Node.js/Python) skills, experience with large-scale web applications, and product ownership. Familiarity with AI/ML workflows is a bonus.

What you'd actually do

  1. Design and build delightful, performant, and scalable user interfaces that researchers and engineers love to use every day.
  2. Own and elevate two mission-critical platforms that directly accelerate xAI’s frontier research: Starfleet — Our Human Data Collection Platform (large-scale annotation, labeling, quality control, and human feedback systems) and Toolbox — Our unified Research Platform for model training orchestration, evaluations, benchmarking, and experiment management.
  3. Write production-grade backend code.
  4. Write clean, production backend code (Node.js/TypeScript or Python preferred).
  5. Build large-scale, data-heavy web applications with excellent performance.

Skills

Required

  • React (TypeScript preferred)
  • modern frontend ecosystem
  • Node.js/TypeScript or Python
  • production backend code
  • large-scale, data-heavy web applications
  • excellent performance
  • product intuition
  • ownership mindset
  • design taste
  • pixel-level details

Nice to have

  • UX design
  • product management
  • founded a startup
  • joined a fast-growing early-stage startup
  • public or internal component libraries / design systems
  • AI/ML workflows
  • annotation tools
  • research platforms

What the JD emphasized

  • mission-critical platforms
  • frontier research
  • large-scale annotation
  • labeling
  • quality control
  • human feedback systems
  • model training orchestration
  • evaluations
  • benchmarking
  • experiment management
  • large-scale, data-heavy web applications