Senior Site Reliability Engineer I

Booking Booking · Hospitality · Bangalore, India · Engineering

Senior Site Reliability Engineer I focused on treating operations as a software problem, ensuring system reliability, availability, performance, scalability, latency, observability, and efficiency. Responsibilities include designing and implementing complex technical solutions, leading incident response, mentoring junior engineers, and reducing operational toil through automation. The role involves building software applications, designing software systems, end-to-end system ownership, and technical incident management.

What you'd actually do

  1. Is responsible to build software applications by using relevant development languages and applying knowledge of systems, services and tools appropriate for the business area and guide more junior members of the team in this topic.
  2. Is responsible to evaluate possible architecture solutions by taking into account cost, business requirements, technology requirements and emerging technologies
  3. Is responsible to reduce business continuity risks and bus factor by applying state-of-the-art practices and tools, and writing the appropriate documentation such as runbooks and OpDocs
  4. Is responsible to address and resolve live production issues by mitigating the customer impact within SLA
  5. Is responsible to ensure that infrastructure stays current by reducing technical debt, searching for bottlenecks and prepari

Skills

Required

  • site reliability engineering
  • system reliability
  • availability
  • performance
  • scalability
  • latency
  • observability
  • efficiency
  • automation
  • incident response
  • software development
  • system design
  • testing techniques
  • root cause analysis
  • technical debt management

Nice to have

  • mentoring junior engineers
  • thought leadership