What you'd actually do

Ability to own and drive ingestion cost optimization end-to-end: analyzing pipeline data, designing guardrails, and engaging directly with customer engineering teams to identify and reduce unnecessary log volume

Experience integrating AI workflows into large-scale deployments; ability to design and implement AI-assisted tooling that automates user interactions and surfaces actionable insights from high-volume log datasets

Deep hands-on experience with internally hosted logging systems such as Splunk, ClickHouse, Loki, or Elastic; track record of improving environment performance, stability, and cost efficiency at scale

Experience with OpenTelemetry — including collector configuration, pipelines, and instrumentation — as a core requirement given Adobe’s OTel-native observability strategy

Proven ability to design systems for fault tolerance, scalability, and stability, and to lead resolution of high-complexity performance and reliability issues

Skills

Required

5-8+ years of production-level experience with distributed applications at scale in public and/or private cloud
Proven experience designing and contributing to the architecture of large-scale Observability platforms
Deep hands-on experience with internally hosted logging systems such as Splunk, ClickHouse, Loki, or Elastic
Experience with OpenTelemetry — including collector configuration, pipelines, and instrumentation
Ability to own and drive ingestion cost optimization end-to-end
Experience integrating AI workflows into large-scale deployments
Strong programming skills in Go and/or Python
Experience building production-grade integrations and applications for large-scale Observability environments
Experience developing, deploying, and operating distributed applications on cloud platforms
Strong command of container and orchestration technologies (Docker, Kubernetes)
Proven ability to design systems for fault tolerance, scalability, and stability
Experience defining service level objectives (SLOs) and service level indicators (SLIs)
Knowledge of public and/or private cloud deployments — AWS, Azure, Data Center
Comfortable owning on-call coverage across a multi-tool observability stack, including leading incident response for high-severity issues

Nice to have

Experience evaluating or prototyping alternative storage/processing backends (e.g., ClickHouse, Loki)
Experience with other Observability tooling such as Grafana, Cortex, and Tempo

Other signals

integrating AI workflows into large-scale deployments

design and implement AI-assisted tooling

automates user interactions

surfaces actionable insights from high-volume log datasets

OpenTelemetry — including collector configuration, pipelines, and instrumentation — as a core requirement given Adobe’s OTel-native observability strategy

Join a globally diverse team that both builds and finds best-of-breed tools to bring critical Observability services to all of Adobe. Our team embodies DevOps, as our responsibilities range from crafting new tools and UIs to maintaining and supporting one of the largest logging deployments in the industry, in partnership with other observability tools.

We’re a close-knit team dedicated to providing a robust platform, supporting both Adobe’s engineering teams and each other. We need a new Developer to help shape and implement Adobe’s observability strategy.

If you enjoy owning complex, high-impact problems where your work directly moves the needle for Adobe’s engineering community, come talk to us.

Job Requirements

5-8+ years of production-level experience with distributed applications at scale in public and/or private cloud
Proven experience designing and contributing to the architecture of large-scale Observability platforms

Must Have

Deep hands-on experience with internally hosted logging systems such as Splunk, ClickHouse, Loki, or Elastic; track record of improving environment performance, stability, and cost efficiency at scale
Experience with OpenTelemetry — including collector configuration, pipelines, and instrumentation — as a core requirement given Adobe’s OTel-native observability strategy
Ability to own and drive ingestion cost optimization end-to-end: analyzing pipeline data, designing guardrails, and engaging directly with customer engineering teams to identify and reduce unnecessary log volume
Experience integrating AI workflows into large-scale deployments; ability to design and implement AI-assisted tooling that automates user interactions and surfaces actionable insights from high-volume log datasets
Strong programming skills in Go and/or Python; experience building production-grade integrations and applications for large-scale Observability environments
Experience developing, deploying, and operating distributed applications on cloud platforms; strong command of container and orchestration technologies (Docker, Kubernetes)
Proven ability to design systems for fault tolerance, scalability, and stability, and to lead resolution of high-complexity performance and reliability issues
Experience defining service level objectives (SLOs) and service level indicators (SLIs); able to translate platform health into meaningful, measurable quality indicators
Knowledge of public and/or private cloud deployments — AWS, Azure, Data Center
Comfortable owning on-call coverage across a multi-tool observability stack, including leading incident response for high-severity issues

Good to Have

Experience evaluating or prototyping alternative storage/processing backends (e.g., ClickHouse, Loki) as part of platform cost reduction and scalability strategy; ability to contribute to a phased migration plan from Splunk
Experience with other Observability tooling such as Grafana, Cortex, and Tempo

About Adobe

Adobe empowers everyone to create through innovative platforms and tools that unleash creativity, productivity and personalized customer experiences. Adobe’s industry-leading offerings including Adobe Acrobat Studio, Adobe Express, Adobe Firefly, Creative Cloud, Adobe Experience Platform, Adobe Experience Manager, and GenStudio enable people and businesses to turn ideas into impact, powered by AI and driven by human ingenuity.

Our 30,000+ employees worldwide are creating the future and raising the bar as we drive the next decade of growth. We’re on a mission to hire the very best and believe in creating a company culture where all employees are empowered to make an impact. At Adobe, we believe that great ideas can come from anywhere in the organization. The next big idea could be yours.

** Let’s Adobe together**

At Adobe, we believe in creating a company culture where all employees are empowered to make an impact. Learn more about Adobe life, including our values and culture, focus on people, purpose and community, Adobe for All, comprehensive benefits programs, the stories we tell, the customers we serve, and how you can help us advance our mission of empowering everyone to create.

Adobe is proud to be an Equal Employment Opportunity employer. We do not discriminate based on gender, race or color, ethnicity or national origin, age, disability, religion, sexual orientation, gender identity or expression, veteran status, or any other protected characteristic. Learn more.

Adobe aims to make our Careers website and recruiting process accessible to any and all users. If you have a disability or special need that requires accommodation to navigate our website or complete the application process, email accommodations@adobe.com or call +1 408-536-3015.

AI Use Guidelines for Interviews: Our interviews are designed to reflect your own skills and thinking. The use of AI or recording tools during live interviews is not permitted unless explicitly invited by the interviewer or approved in advance as part of a reasonable accommodation. If these tools are used inappropriately or in a way that misrepresents your work, your application may not move forward in the process.

At Adobe, we empower employees to innovate with AI — and we look for candidates eager to do the same. As part of the hiring experience, we provide clear guidance on where AI is encouraged during the process and where it’s restricted during live interviews. See how we think about AI in the hiring experience.

Must Have

Experience with OpenTelemetry — including collector configuration, pipelines, and instrumentation — as a core requirement given Adobe’s OTel-native observability strategy

Strong programming skills in Go and/or Python; experience building production-grade integrations and applications for large-scale Observability environments

Experience developing, deploying, and operating distributed applications on cloud platforms; strong command of container and orchestration technologies (Docker, Kubernetes)

Proven ability to design systems for fault tolerance, scalability, and stability, and to lead resolution of high-complexity performance and reliability issues

Experience defining service level objectives (SLOs) and service level indicators (SLIs); able to translate platform health into meaningful, measurable quality indicators

Knowledge of public and/or private cloud deployments — AWS, Azure, Data Center

Comfortable owning on-call coverage across a multi-tool observability stack, including leading incident response for high-severity issues

Good to Have

Experience evaluating or prototyping alternative storage/processing backends (e.g., ClickHouse, Loki) as part of platform cost reduction and scalability strategy; ability to contribute to a phased migration plan from Splunk

Experience with other Observability tooling such as Grafana, Cortex, and Tempo

About Adobe

** Let’s Adobe together**

Senior Observability Platform Engineer

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

Job Requirements

Must Have

Good to Have

Job Requirements

Must Have

Good to Have