Senior Computer Scientist - Sre

Adobe Adobe · Enterprise · Bangalore, India

Senior Computer Scientist - SRE role at Adobe, focusing on the reliability, scalability, and automation of the Adobe Pass platform. The role involves architecting distributed systems, implementing AI/ML for predictive monitoring, leading reliability initiatives, and managing incidents. Requires extensive experience in SRE, cloud-native environments, Kubernetes, IaC, and observability stacks.

What you'd actually do

  1. Define and drive the long-term reliability and scalability strategy for the Adobe Pass platform, aligning with product and business goals.
  2. Architect large-scale, distributed, and multi-region systems designed for resiliency, observability, and self-healing.
  3. Build and champion advanced automation frameworks that enable zero-touch operations across deployment, recovery, and scaling workflows.
  4. Introduce AI/ML-based predictive monitoring and anomaly detection systems to anticipate failures before they impact users.
  5. Serve as a technical authority during high-impact incidents, guiding cross-functional teams through real-time mitigation and long-term prevention.

Skills

Required

  • site reliability
  • production engineering
  • large-scale distributed system operations
  • cloud-native environments (AWS, Azure, GCP)
  • Python
  • Go
  • Java
  • Bash
  • Kubernetes
  • microservices
  • service mesh architectures
  • Infrastructure as Code (Terraform, CloudFormation)
  • CI/CD automation frameworks
  • observability and monitoring stacks (Prometheus, Grafana, Datadog, OpenTelemetry)
  • networking
  • storage
  • distributed databases (SQL and NoSQL)
  • architectural decisions
  • reliability strategy
  • communication
  • leadership
  • stakeholder management

Nice to have

  • reliability frameworks
  • SRE platforms
  • error budgets
  • chaos engineering
  • reliability reviews
  • high-traffic or latency-sensitive systems
  • media streaming
  • advertising
  • real-time platforms
  • big data ecosystems (Kafka, Spark, Hadoop)
  • large-scale data ingestion pipelines
  • security
  • compliance
  • governance in production environments (SOC2, GDPR, ISO27001)
  • Cloud or Kubernetes certifications (AWS Solutions Architect Professional, CKA/CKAD, GCP Professional Cloud Architect)
  • Published contributions or conference talks on reliability, automation, or distributed systems

What the JD emphasized

  • 12+ years of experience
  • highly available, globally distributed systems
  • Kubernetes, microservices, and service mesh architectures
  • Infrastructure as Code
  • CI/CD automation frameworks
  • security, compliance, and governance in production environments