Principal Software Engineer

Roblox Roblox · Consumer · San Mateo, CA · Software Engineering

Principal Software Engineer on the Reliability team at Roblox, focusing on building and scaling systems that leverage AI/ML for service efficiency and reliability. The role involves leading engineers, optimizing distributed systems, and creating performance monitoring and observability tools.

What you'd actually do

  1. Lead a team of engineers (4+) to drive multi quarter projects to scale Roblox and make systems more efficient.
  2. Build systems that leverage AI and ML to scale services and workloads across multiple geographical regions.
  3. Build, automate and standardize process automation to create a “golden path” of tooling and platform support that powers the fundamental Roblox ecosystem.
  4. Create performance monitoring services and observability towards understanding capacity issues and platform degradations.
  5. Work with product teams across organizations to understand their most critical problems and solve them with software systems.

Skills

Required

  • BS degree (or equivalent professional experience) in Computer Science or related engineering field
  • 8+ years of experience
  • 5+ years in Software Engineering or SRE
  • building distributed systems
  • optimizing end-to-end performance of distributed systems
  • distributed systems at scale
  • infrastructure reliability
  • scalability
  • continuous improvement
  • Python
  • Go
  • C#
  • Rust
  • Java
  • leading engineering teams
  • mentoring junior engineers
  • project lifecycles
  • sprints
  • breaking down complex tasks
  • reporting status

Nice to have

  • AI coding agents

What the JD emphasized

  • at least 8 years of experience including at least 5+ years in Software Engineering or SRE
  • track record of building distributed systems and getting them adopted
  • Ability to lead, coach, and mentor junior engineers
  • optimizing end-to-end performance of distributed systems
  • Deep expertise in distributed systems at scale
  • design and build systems and products
  • proficient in at least one programming language (e.g., Python, Go, C#, Rust, Java)
  • Tech Lead a group of engineers to deliver on multi quarter projects

Other signals

  • Build systems that leverage AI and ML to scale services and workloads
  • Create performance monitoring services and observability
  • Experience optimizing end-to-end performance of distributed systems
  • Deep expertise in distributed systems at scale