What you'd actually do

Build AI-powered systems that improve the quality, reliability, and customer experience of AJO — by automating issue detection and resolution with human-in-the-loop approval, learning from operational patterns to prevent recurring failures, and providing real-time visibility into customer health and platform stability

Develop intelligent knowledge systems that compound expertise over time — using vector embeddings, similarity retrieval, and pattern clustering to ensure every incident investigation builds on past learnings, making the platform progressively smarter and more self-healing

Design and implement LLM-based workflows using prompt engineering, structured outputs, tool calling, and agentic reasoning patterns to create autonomous capabilities that operate safely at production scale

Build evaluation frameworks to measure AI system performance: quality improvement rates, automation success rates, mean time to resolution (MTTR) reduction, and customer impact metrics

Integrate AI capabilities with production infrastructure: Kubernetes, Prometheus, Splunk, GitHub, and 30+ operational data sources — creating closed-loop systems that detect, learn, and act autonomously

Skills

Required

Python
ML frameworks (scikit-learn, PyTorch, TensorFlow, HuggingFace, or LangChain)
LLM APIs (OpenAI, Anthropic Claude, Azure OpenAI)
prompt engineering
vector databases and similarity search (FAISS, Pinecone, ChromaDB, MongoDB Atlas Vector Search, or similar)
ML concepts (embeddings, clustering, classification, evaluation metrics)
building APIs
integrating ML models into backend services (FastAPI, Flask, or similar)
model monitoring
A/B testing
continuous evaluation
safety guardrails for AI systems
problem-solving skills
attention to detail
communication and collaboration

Nice to have

Kubernetes
observability tools (Prometheus, Grafana, Datadog)
incident management systems
building AI agents for operational use cases

The Opportunity

Adobe Journey Optimizer (AJO) powers personalized, real-time customer experiences at massive scale for global brands. Our Reliability Engineering & Operational Intelligence (REOI) team is building AJO's autonomous operating system — an AI-native platform that proactively improves product quality, accelerates issue resolution, and enhances customer experience through intelligent automation and continuous learning.

We are seeking a** Machine Learning Engineer** who is eager to apply ML and AI to solve real challenges in reliability, quality, and operational intelligence at scale. In this role, you will build AI systems that make AJO progressively more reliable and self-healing — learning from every incident, preventing recurring failures, and ensuring exceptional customer experiences while enabling the platform to scale 4x without scaling operational overhead. This is a unique opportunity to work at the intersection of production systems, AI/ML, and product quality — where your work directly impacts how millions of customer journeys are delivered reliably every day.

What You'll Do

Build AI-powered systems that improve the quality, reliability, and customer experience of AJO — by automating issue detection and resolution with human-in-the-loop approval, learning from operational patterns to prevent recurring failures, and providing real-time visibility into customer health and platform stability
Develop intelligent knowledge systems that compound expertise over time — using vector embeddings, similarity retrieval, and pattern clustering to ensure every incident investigation builds on past learnings, making the platform progressively smarter and more self-healing
Design and implement LLM-based workflows using prompt engineering, structured outputs, tool calling, and agentic reasoning patterns to create autonomous capabilities that operate safely at production scale
Build evaluation frameworks to measure AI system performance: quality improvement rates, automation success rates, mean time to resolution (MTTR) reduction, and customer impact metrics
Integrate AI capabilities with production infrastructure: Kubernetes, Prometheus, Splunk, GitHub, and 30+ operational data sources — creating closed-loop systems that detect, learn, and act autonomously
Apply ML techniques to operational data: anomaly detection for early issue detection, time-series forecasting for capacity planning, pattern clustering for recurring failure identification, and predictive analysis for proactive prevention
Collaborate with SREs, software engineers, and product teams to understand quality and reliability challenges, then design and deploy AI solutions that address them systematically
Contribute to code reviews, testing, documentation, and CI/CD pipelines — building production-grade ML systems with the same rigor as mission-critical infrastructure

What You Need to Succeed

BS/MS in Computer Science, Machine Learning, Data Science, or related field, with 2-4 years of professional experience (or strong academic/internship experience in ML/AI applied to real-world problems)
Hands-on experience with Python and ML frameworks: scikit-learn, PyTorch, TensorFlow, HuggingFace, or LangChain
Practical knowledge of LLM APIs (OpenAI, Anthropic Claude, Azure OpenAI) and prompt engineering techniques for building agentic workflows
Understanding of vector databases and similarity search (FAISS, Pinecone, ChromaDB, MongoDB Atlas Vector Search, or similar)
Foundational knowledge of ML concepts: embeddings, clustering, classification, evaluation metrics (precision/recall/F1), and model deployment best practices
Comfortable building APIs and integrating ML models into backend services using FastAPI, Flask, or similar frameworks
Eagerness to learn production ML operations: model monitoring, A/B testing, continuous evaluation, and safety guardrails for AI systems
Strong problem-solving skills, attention to detail, and the ability to iterate quickly based on data and feedback
Excellent communication and collaboration — able to explain ML concepts to non-ML engineers and translate business requirements into technical solutions
Bonus: Experience with Kubernetes, observability tools (Prometheus, Grafana, Datadog), incident management systems, or building AI agents for operational use cases

About Adobe

Adobe empowers everyone to create through innovative platforms and tools that unleash creativity, productivity and personalized customer experiences. Adobe’s industry-leading offerings including Adobe Acrobat Studio, Adobe Express, Adobe Firefly, Creative Cloud, Adobe Experience Platform, Adobe Experience Manager, and GenStudio enable people and businesses to turn ideas into impact, powered by AI and driven by human ingenuity.

Our 30,000+ employees worldwide are creating the future and raising the bar as we drive the next decade of growth. We’re on a mission to hire the very best and believe in creating a company culture where all employees are empowered to make an impact. At Adobe, we believe that great ideas can come from anywhere in the organization. The next big idea could be yours.

** Let’s Adobe together**

At Adobe, we believe in creating a company culture where all employees are empowered to make an impact. Learn more about Adobe life, including our values and culture, focus on people, purpose and community, Adobe for All, comprehensive benefits programs, the stories we tell, the customers we serve, and how you can help us advance our mission of empowering everyone to create.

Adobe is proud to be an Equal Employment Opportunity employer. We do not discriminate based on gender, race or color, ethnicity or national origin, age, disability, religion, sexual orientation, gender identity or expression, veteran status, or any other protected characteristic. Learn more.

Adobe aims to make our Careers website and recruiting process accessible to any and all users. If you have a disability or special need that requires accommodation to navigate our website or complete the application process, email accommodations@adobe.com or call +1 408-536-3015.

AI Use Guidelines for Interviews: Our interviews are designed to reflect your own skills and thinking. The use of AI or recording tools during live interviews is not permitted unless explicitly invited by the interviewer or approved in advance as part of a reasonable accommodation. If these tools are used inappropriately or in a way that misrepresents your work, your application may not move forward in the process.

At Adobe, we empower employees to innovate with AI — and we look for candidates eager to do the same. As part of the hiring experience, we provide clear guidance on where AI is encouraged during the process and where it’s restricted during live interviews. See how we think about AI in the hiring experience.

Expected Pay Range:

Our compensation reflects the cost of labor across several U.S. geographic markets, and we pay differently based on those defined markets. The U.S. pay range for this position is $102,400 -- $202,250 annually. Pay within this range varies by work location and may also depend on job-related knowledge, skills, and experience. Your recruiter can share more about the specific salary range for the job location during the hiring process.

In California, the pay range for this position is $139,700 - $202,250

At Adobe, for sales roles starting salaries are expressed as total target compensation (TTC = base + commission), and short-term incentives are in the form of sales commission plans. Non-sales roles starting salaries are expressed as base salary and short-term incentives are in the form of the Annual Incentive Plan (AIP).

In addition, certain roles may be eligible for long-term incentives in the form of a new hire equity award.

State-Specific Notices:

California:

Fair Chance Ordinances

Adobe will consider qualified applicants with arrest or conviction records for employment in accordance with state and local laws and “fair chance” ordinances.

Colorado:

Application Window Notice

If this role is open to hiring in Colorado (as listed on the job posting), the application window will remain open until at least the date and time stated above in Pacific Time, in compliance with Colorado pay transparency regulations. If this role does not have Colorado listed as a hiring location, no specific application window applies, and the posting may close at any time based on hiring needs.

Massachusetts:

Massachusetts Legal Notice

It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.