Observability Developer

Adobe Adobe · Enterprise · Lehi, UT +1

This role focuses on building and finding tools for Adobe's Observability services, integrating AI agent development and workflows into large-scale deployments. The responsibilities include architecting and implementing observability platforms, managing logging systems, and ensuring the uptime and quality of service for Adobe's customers. The role requires experience with distributed applications, cloud platforms, and container orchestration, with a strong emphasis on AI agent development and integration.

What you'd actually do

  1. AI agent development and experience integrating AI workflows into large-scale deployments
  2. Experience architecting and implementing large-scale Observability platforms
  3. Experience with internally hosted logging systems like Splunk, Clickhouse, Loki, Elastic, assisting clients and improving environment performance and stability
  4. Ensure the highest level of up-time and Quality of Service (QoS) to Adobe’s customers through operational excellence
  5. Impacts the organization through contribution to technical direction and strategic decisions

Skills

Required

  • 3-5+ years production level experience with distributed applications at scale in public and/or private cloud
  • Experience architecting and implementing large-scale Observability platforms
  • B.S. degree in Computer Science or related technical field
  • Experience with internally hosted logging systems like Splunk, Clickhouse, Loki, Elastic, assisting clients and improving environment performance and stability
  • AI agent development and experience integrating AI workflows into large-scale deployments
  • Programming experience with languages like Go, Python; Experience building integrations and applications to large-scale Observability environments
  • Experience designing and implementing systems for fault tolerance, scalability and stability
  • Experience developing, deploying and running distributed applications on cloud platforms. Experience with container and orchestration technologies (Docker, Kubernetes)
  • Ensure the highest level of up-time and Quality of Service (QoS) to Adobe’s customers through operational excellence
  • Knowledge in defining service level objectives (SLOs) and service level indicators (SLIs) to represent and measure service quality
  • Knowledge of (public and/or private) cloud deploym ents – AWS, Azure, Data Center
  • Collaborate with SRE and Engineering/Product teams in driving critical initiatives
  • Experience in designing and maintaining production monitoring systems
  • Experience in solving performance and stability issues using a wide variety of tools
  • Excellent communicator in and across teams, driving projects to completion
  • Impacts the organization through contribution to technical direction and strategic decisions

Nice to have

  • Experience with other Observability tooling like Grafana, Cortex, Tempo, OTEL
  • Experience with Open-Source products/community like Open telemetry
  • Familiar with a variety of cloud security and automation concepts, practices and procedures.
  • Promote the DevOps/SRE approach

What the JD emphasized

  • AI agent development
  • integrating AI workflows into large-scale deployments
  • Observability platforms
  • logging systems
  • distributed applications at scale

Other signals

  • AI agent development
  • integrating AI workflows into large-scale deployments
  • Observability platforms
  • logging systems
  • distributed applications at scale