Senior Site Reliability Engineer

PayPal PayPal · Fintech · San Jose, CA +1 · Site Reliability Engineering

Senior Site Reliability Engineer focused on the reliability, performance, and availability of PayPal's global mobile and backend systems. This role involves implementing reliability standards, building automation, and enhancing observability across the stack, bridging the gap between clients and backend services.

What you'd actually do

  1. Take ownership of system performance monitoring, identify inefficiencies, and lead initiatives to improve the overall availability and reliability of digital platforms and applications.
  2. Lead and manage the response to complex, high-priority incidents, ensuring prompt resolution and a thorough root cause analysis to prevent future occurrences.
  3. Design and implement advanced automation frameworks to improve operational efficiency, streamline processes, and reduce human error.
  4. Lead reliability-focused initiatives, ensuring systems are highly available, resilient, and scalable, and promote best practices across engineering teams.
  5. Enhance the monitoring infrastructure by identifying key metrics, optimizing alerting, and improving system observability to ensure the reliability of large-scale systems.

Skills

Required

  • Site Reliability Engineering
  • software development
  • systems engineering
  • backend architectures
  • APIs
  • data flows
  • cross-system dependencies
  • monitoring
  • observability
  • alerting
  • automation
  • tooling development
  • Python
  • Go
  • SLIs/SLOs
  • distributed systems
  • cloud infrastructure
  • AWS
  • GCP
  • Azure
  • CI/CD pipelines
  • debugging
  • problem-solving
  • collaboration
  • communication
  • mentorship