Member of Technical Staff - Real-time Storage

xAI xAI · AI Frontier · Palo Alto, CA · Infrastructure

Seeking exceptional storage & database engineers to integrate xAI's advanced AI infrastructure into a platform used by 600 million users monthly. The role involves building xAI's new storage tier for training, inference, recommendations, and real-time data extraction, including an exabyte-scale object store, a high-throughput key/value store, a massive caching tier, and a scalable vector database for recommendation systems.

What you'd actually do

  1. Design, build, and launch to production new features and improvements aimed at unifying common components across the storage systems
  2. Dive into performance issues and work with customers and deliver solutions to cater to customers’ latency, availability and data durability requirements.
  3. Lead and drive incident responses and recovery with your peers. Review and contribute to incident postmortems and hold a high bar for the same.
  4. Work in a collaborative environment and uplevel your peers by doing mentoring, code and design reviews.
  5. Be open to develop new skills and learn on the job as we navigate new technology spaces.

Skills

Required

  • software development
  • building storage systems or databases
  • reliability
  • performance
  • quality
  • high performance C++, Rust, or JVM-based languages
  • building, running, and operating scalable and resilient distributed systems

Nice to have

  • AI infrastructure
  • vector database
  • recommendation systems

What the JD emphasized

  • building storage systems or databases
  • Obsessed with reliability, performance, and quality
  • Expertise in building, running, and operating scalable and resilient distributed systems