Principal Devleopment Engineer

CVS Health · Healthcare · Work at Home, TX +49 · Innovation and Technology

Principal Software Development Engineer to own the AI Platform within HCD, focusing on LLM gateway strategy, Model Context Protocol (MCP) design, reference architectures, and the Agent Development Lifecycle (ADLC). The role involves leading architectural decisions for agents, setting engineering standards, driving complex problems, and ensuring quality gates, eval framework design, and production readiness. Responsibilities include owning observability, building CI/CD pipelines with LLM evaluations, defining AI SDLC standards, and managing AI spend and metrics. The role requires deep expertise in Python, cloud infrastructure (Azure), API gateway patterns, and LLM routing, with a focus on sensitive data handling and compliance in a regulated environment.

What you'd actually do

Own the LLM gateway strategy end-to-end — including model access governance, latency benchmarking across routing layers
Own and continuously evolve the ADLC framework — the team's standard for taking agents from use case discovery through infrastructure planning, evaluation design, development, testing, and production deployment
Provide principal-level technical leadership across the active agent portfolio.
Own the observability mandate — driving adoption across all production agents and defining eval standards that reflect real business outcomes, not just infrastructure uptime
Maintain the AI spend tracking system — cost per team, cost per agent, cost per model — with automated reporting for senior leadership

Skills

Required

9+ years of professional software engineering experience
Deep expertise in Python
strong cloud infrastructure experience
hands-on familiarity with API gateway patterns
LLM routing
Demonstrated ability to influence architecture and engineering standards across multiple teams and work streams without direct management authority

Nice to have

3+ years in a principal, staff, or equivalent senior individual contributor role
Hands-on experience with AI observability tooling such as Arize, LangSmith, or Phoenix
ability to design eval pipelines that measure what actually matters for business outcomes
Proven track record designing and operating production AI systems — LLM APIs, agent frameworks, RAG architectures, or RPA automation — at meaningful scale
Strong working knowledge of security and compliance requirements for AI systems handling sensitive data, including PHI and PII handling in a regulated environment
Azure experience

What the JD emphasized

AI Platform Architecture
Agent Development Lifecycle (ADLC) Ownership
Production Agent Portfolio Technical Leadership
Observability, Evals & Quality
FinOps, Metrics & Governance
LLM gateway strategy
Model Context Protocol (MCP)
reference architectures
agent delivery
Skills and Plugin Marketplace strategy
ADLC framework
engineering quality gates
eval framework design
hallucination rate targets
adversarial test suites
business outcome metrics
structured rollback procedures
canary deployment patterns
production readiness standards
cross-cutting technical challenges
sensitive data handling
integration patterns
BrowserUse automation reliability
RAG retrieval quality at scale
latency under real practical scenarios
deployment automation strategy
observability mandate
eval standards
CI and CD pipelines
LLM evaluations
prompt versioning
hallucination and failure rate tracking
regression test suites
AI SDLC
guardrails infrastructure
policy enforcement
AI quality pipeline
AI spend tracking system
AI metrics dashboard
compliance and security design leadership
PHI and PII handling
regulated environment
Deep expertise in Python
strong cloud infrastructure experience
API gateway patterns
LLM routing
Demonstrated ability to influence architecture and engineering standards
Hands-on experience with AI observability tooling
design eval pipelines that measure what actually matters for business outcomes
Proven track record designing and operating production AI systems
LLM APIs
agent frameworks
RAG architectures
RPA automation
meaningful scale
Strong working knowledge of security and compliance requirements for AI systems handling sensitive data

Other signals

AI Platform Architecture
Agent Development Lifecycle
Production Agent Portfolio Technical Leadership
Observability, Evals & Quality
FinOps, Metrics & Governance

Apply on company site

Closed

Posted 7w ago · 10 days open

AI score: 8/10
Stage: Agent Serve
Compensation: $144k–$288k
Location: Work at Home, TXUnited StatesWork At Home, ALWork At Home, ARWork At Home, AZWork At Home, CAWork At Home, COWork At Home, CTWork At Home, DCWork At Home, DEWork At Home, FLWork At Home, GAWork At Home, IAWork At Home, IDWork At Home, ILWork At Home, INWork At Home, KSWork At Home, KYWork At Home, LAWork At Home, MAWork At Home, MDWork At Home, MEWork At Home, MIWork At Home, MNWork At Home, MOWork At Home, MSWork At Home, MTWork At Home, NCWork At Home, NDWork At Home, NEWork At Home, NHWork At Home, NJWork At Home, NMWork At Home, NVWork At Home, NYWork At Home, OHWork At Home, OKWork At Home, ORWork At Home, PAWork At Home, RIWork At Home, SCWork At Home, SDWork At Home, TNWork At Home, UTWork At Home, VAWork At Home, VTWork At Home, WAWork At Home, WIWork At Home, WVWork At Home, WY
Role: Principal · Platform
Function: Engineering
Domain: healthcare
Team: AI Platform within HCD
Maturity: Scaling
Compliance: regulated

Tech tags

Read full job description

We’re building a world of health around every individual — shaping a more connected, convenient and compassionate health experience. At CVS Health®, you’ll be surrounded by passionate colleagues who care deeply, innovate with purpose, hold ourselves accountable and prioritize safety and quality in everything we do. Join us and be part of something bigger – helping to simplify health care one person, one family and one community at a time.

Position Summary

As a Principal Software Development Engineer you will be the senior technical anchor across AI Platform within HCD. You will own the platform decisions every agent depends on, set the engineering standards the whole portfolio is built against, drive the hardest architectural problems across the Agent Development Lifecycle, and be a force multiplier for a team of talented engineers who are shipping real work at high velocity.

Role and Responsibilities

AI Platform Architecture

Own the LLM gateway strategy end-to-end — including model access governance, latency benchmarking across routing layers
Lead Model Context Protocol (MCP) farm design for authenticated, scalable tool integrations across all agentic applications
Define and maintain reference architectures, template repositories, SDKs, and the AI QuickStart that set the paved road for agent delivery across HCD and OGAI
Drive the Skills and Plugin Marketplace strategy, ensuring a platform-agnostic reusable component model that works across teams and technology stacks

Agent Development Lifecycle (ADLC) Ownership

Own and continuously evolve the ADLC framework — the team's standard for taking agents from use case discovery through infrastructure planning, evaluation design, development, testing, and production deployment
Enforce engineering quality gates across all agent types: RAG knowledge assistants, BrowserUse and RPA automation, conversational chatbots, voice agents, and multi-step workflow agents
Ensure eval framework design happens before development begins — with defined test datasets, hallucination rate targets, adversarial test suites, and business outcome metrics for every production agent
Own structured rollback procedures, canary deployment patterns, and production readiness standards that prevent instability from propagating through the portfolio

Production Agent Portfolio Technical Leadership

Provide principal-level technical leadership across the active agent portfolio.
Own the hardest cross-cutting technical challenges in the portfolio — sensitive data handling at the agent layer, integration patterns, BrowserUse automation reliability, RAG retrieval quality at scale, and latency under real practical scenarios.
Unblock delivery across multiple simultaneous work streams by resolving architecture dependencies, provisioning blockers, and cross-team coordination issues before they stall sprints
Drive the deployment automation strategy so agents consistently move from dev to production through a standardized, auditable pipeline

Observability, Evals & Quality

Own the observability mandate — driving adoption across all production agents and defining eval standards that reflect real business outcomes, not just infrastructure uptime
Build and maintain CI and CD pipelines with integrated LLM evaluations, prompt versioning, hallucination and failure rate tracking, and regression test suites tied to production datasets
Define and enforce the AI SDLC: style guides, working agreements, code review standards, engineering handbook practices, and the agent quality run-book the full team works against
Own guardrails infrastructure including policy enforcement, adversarial test patterns, and the AI quality pipeline that catches low-quality outputs before they reach production

FinOps, Metrics & Governance

Maintain the AI spend tracking system — cost per team, cost per agent, cost per model — with automated reporting for senior leadership
Own the AI metrics dashboard delivering adoption trends, deployment frequency, token consumption, and per-agent business KPIs to executive stakeholders
Provide compliance and security design leadership for agents operating on sensitive data, including agent identity and managed identity strategy at the platform level

Required Qualifications

9+ years of professional software engineering experience
Deep expertise in Python with strong cloud infrastructure experience, preferably Azure, and hands-on familiarity with API gateway patterns and LLM routing
Demonstrated ability to influence architecture and engineering standards across multiple teams and work streams without direct management authority

Preferred Qualification:

3+ years in a principal, staff, or equivalent senior individual contributor role
Hands-on experience with AI observability tooling such as Arize, LangSmith, or Phoenix and the ability to design eval pipelines that measure what actually matters for business outcomes
Proven track record designing and operating production AI systems — LLM APIs, agent frameworks, RAG architectures, or RPA automation — at meaningful scale
Strong working knowledge of security and compliance requirements for AI systems handling sensitive data, including PHI and PII handling in a regulated environment

Education:

Bachelor's degree preferred/specialized training/relevant professional qualification.

Pay Range

The typical pay range for this role is:

$144,200.00 - $288,400.00

This pay range represents the base hourly rate or base annual full-time salary for all positions in the job grade within which this position falls. The actual base salary offer will depend on a variety of factors including experience, education, geography and other relevant factors. This position is eligible for a CVS Health bonus, commission or short-term incentive program in addition to the base pay range listed above. This position also includes an award target in the company’s equity award program.

Our people fuel our future. Our teams reflect the customers, patients, members and communities we serve and we are committed to fostering a workplace where every colleague feels valued and that they belong.

Great benefits for great people

We take pride in offering a comprehensive and competitive mix of pay and benefits that reflects our commitment to our colleagues and their families.

This full‑time position is eligible for a comprehensive benefits package designed to support the physical, emotional, and financial well‑being of colleagues and their families. The benefits for this position include medical, dental, and vision coverage, paid time off, retirement savings options, wellness programs, and other resources, based on eligibility.

Additional details about available benefits are provided during the application process and on Benefits Moments.

We anticipate the application window for this opening will close on: 05/14/2026

Qualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state and local laws.