Senior Software Engineer - Infrastructure

Roblox Roblox · Consumer · San Mateo, CA · Software Engineering

Senior Software Engineer for the Reliability team at Roblox, focusing on building and maintaining high-availability, scalable infrastructure, frameworks, and automated tooling for performance benchmarking, chaos engineering, and self-healing systems. The role involves architecting software, engineering scalable frameworks, building automation, and developing tooling to identify and understand infrastructure issues. Experience with LLM-based agents or RAG systems in production is a plus.

What you'd actually do

  1. Architect and ship high-availability software and libraries that programmatically enforce fault-tolerance and system resilience.
  2. Engineer scalable frameworks and automated tooling for performance benchmarking, chaos engineering, and self-healing infrastructure.
  3. Build and maintain automation to streamline repetitive tasks and improve system reliability.
  4. Develop and implement tooling to proactively identify and understand infrastructure issues and platform degradations.

Skills

Required

  • full-cycle software development
  • Site Reliability
  • clean, maintainable code
  • building software and tools
  • Go
  • C#
  • Java

Nice to have

  • LLM-based agents
  • RAG systems