Senior Site Reliability Engineer, Currents

Braze Braze · Enterprise · New York, NY · Engineering

Senior Site Reliability Engineer for Braze's Currents team, responsible for building, maintaining, and evolving a large-scale Kafka-based event pipeline that handles tens of billions of messages daily. Focuses on observability, scalability, and reliability strategy for a critical data streaming system.

What you'd actually do

  1. Solve live performance and reliability issues and prevent their recurrence
  2. Write and review code, educating engineers and building a culture of reliability
  3. Practice sustainable incident response and blameless postmortems
  4. Define and enable standards for monitoring, reliability, and performance
  5. Bridge the gap between infrastructure and platform engineering teams
  6. Support and improve services by planning for scale and reliability
  7. Guide junior engineers in SRE best practices, software engineering, and agile project leadership

Skills

Required

  • software engineering
  • site reliability engineering
  • Kafka
  • observability
  • scalability
  • reliability

Nice to have

  • SRE best practices
  • agile project leadership