Senior Staff Machine Learning Engineer, AI Agent Platform

GEICO · Insurance · New York, NY +3

Senior Staff ML Engineer to lead the technical vision and architecture for GEICO's AI agent platform. This role involves designing and building multi-tenant services for creating, testing, deploying, and hosting LLM-based AI agents, including orchestration, interoperability, skill ecosystems, harness engineering, context management, and safety guardrails. The role requires extensive experience in AI/ML platforms, LLM systems, agentic AI, and SDLC management.

What you'd actually do

Define the long-term technical strategy for GEICO's AI agent platform — including multi-agent orchestration, AI agent lifecycle management, evaluation frameworks, skill registries and marketplace, and workflow orchestration.
Architect an enterprise skill ecosystem — reusable capability packages that encode domain expertise and workflows into portable, discoverable modules. Build and govern an internal skill marketplace with versioning, security vetting, approval workflows, progressive disclosure loading, and usage analytics.
Lead design of production-grade AI agent harnesses (tool dispatch, context management, error recovery, session state, fine-grained Authn/AuthZ) that makes AI agents reliable for long-running workflows. Apply feedforward guides (linters, architecture constraints, spec-driven validation) and feedback sensors (test execution, LLM-as-judge) mixing computational and inferential controls. Design context engineering systems that treat the LLM context window as a managed resource — memory hierarchies, RAG pipelines, context compaction, scratchpads, and dynamic skill/tool loading.
Own high-performance platform components powering end-to-end agentic workflows: MCP server/registry management, A2A communication infrastructure, prompt management, workflow orchestration, guardrail enforcement, and observability pipelines.
Establish AI agent governance frameworks including bounded autonomy, human-in-the-loop escalation, audit trails, prompt guardrails, and RBAC/ABAC access controls. Extend governance to skill-level security — vetting published skills for hidden payloads, injection vectors, and data exfiltration risks.

Skills

Required

8+ years of professional software development experience with at least two languages (Java, C++, Python, Go, or C#).
6+ years designing and building AI/ML platforms using open-source/cloud-agnostic components (Elasticsearch, Qdrant, Kafka, PostgreSQL, MongoDB, Spark, Ray, Temporal, Redis, Neo4j, etc.).
5+ years managing end-to-end SDLCs (CI/CD, Kubernetes, testing, monitoring, production support).
4+ years building training, fine-tuning, and inferencing systems for LLMs, especially on GPU infrastructure.
3+ years designing and operating multi-agent or agentic AI systems in production.
Strong understanding of context engineering — memory architectures, RAG, context compaction, and dynamic information management for LLMs.
Demonstrated track record leading technical initiatives, setting architectural direction, and mentoring across teams.
Bachelor's degree in CS, Engineering, or related field

Nice to have

6+ years with cloud providers (Azure, AWS), including container orchestration and GPU compute.
3+ years building agentic workflows with open-source and proprietary LLMs (Llama, Qwen, Claude, Gpt, etc.).
Hands-on experience with MCP and A2A protocols — MCP server development, AI agent card discovery, task delegation patterns.
Experience with harness engineering. (tool dispatch, error recovery, session state, sub-agent coordination, planning & reasoning)
Experience designing AI agent skill systems: building and governing reusable skill packages, skill marketplaces with discovery, versioning, security vetting, and progressive disclosure.
Experience with context engineering at scale: memory hierarchies, RAG optimization, compaction/summarization, state isolation, etc.
Experience with multi-agent orchestration frameworks (LangGraph, AutoGen, CrewAI).
Experience with LLM observability & evaluation platforms (LangSmith, Arize Phoenix, Langfuse).
Experience building guardrail systems (prompt injection defense, PII detection, skill-level security auditing).
Understanding of AI safety, model governance, and regulatory compliance in regulated industries.
advanced degree highly desirable

What the JD emphasized

multi-agent orchestration
AI agent skill ecosystem
production-grade AI agent harnesses
context engineering systems
AI agent governance frameworks
multi-agent or agentic AI systems in production
context engineering — memory architectures, RAG, context compaction, and dynamic information management for LLMs
agentic workflows
AI safety, model governance, and regulatory compliance in regulated industries

Other signals

AI Agent Platform
multi-agent orchestration
enterprise scale
LLM-based AI agents

Read full job description

**At GEICO, we offer a rewarding career where your ambitions are met with endless possibilities. **

**Every day we honor our iconic brand by offering quality coverage to millions of customers and being there when they need us most. We thrive through relentless innovation to exceed our customers’ expectations while making a real impact for our company through our shared purpose. **

When you join our company, we want you to feel valued, supported and proud to work here. That’s why we offer The GEICO Pledge: Great Company, Great Culture, Great Rewards and Great Careers.

Sr. Staff Machine Learning Engineer – AI Agent Platform

Position Description

GEICO is seeking an exceptional Sr. Staff ML Engineer to join our AI organization. You will serve as a technical leader and key architect for GEICO's virtual assistant platform that elevates productivity for 30K+ internal associates and the customer experience for millions of policyholders.

Sr. Staff AI Agent Platform Engineers set the technical vision and drive the architecture of multi-tenant services that power the building, testing, deployment, and hosting of LLM-based AI agents. This includes multi-agent orchestration, standardized interoperability protocols (MCP, A2A), AI agent skill ecosystems with marketplace and governance capabilities, production-grade harness & context engineering, and guardrail frameworks for safe autonomous operation at enterprise scale.

Responsibilities

Technical Vision & Architecture: Define the long-term technical strategy for GEICO's AI agent platform — including multi-agent orchestration, AI agent lifecycle management, evaluation frameworks, skill registries and marketplace, and workflow orchestration.
AI Agent Skills & Marketplace: Architect an enterprise skill ecosystem — reusable capability packages that encode domain expertise and workflows into portable, discoverable modules. Build and govern an internal skill marketplace with versioning, security vetting, approval workflows, progressive disclosure loading, and usage analytics.
Harness & Context Engineering: Lead design of production-grade AI agent harnesses (tool dispatch, context management, error recovery, session state, fine-grained Authn/AuthZ) that makes AI agents reliable for long-running workflows. Apply feedforward guides (linters, architecture constraints, spec-driven validation) and feedback sensors (test execution, LLM-as-judge) mixing computational and inferential controls. Design context engineering systems that treat the LLM context window as a managed resource — memory hierarchies, RAG pipelines, context compaction, scratchpads, and dynamic skill/tool loading.
Platform & Interoperability: Own high-performance platform components powering end-to-end agentic workflows: MCP server/registry management, A2A communication infrastructure, prompt management, workflow orchestration, guardrail enforcement, and observability pipelines.
AI Safety & Governance: Establish AI agent governance frameworks including bounded autonomy, human-in-the-loop escalation, audit trails, prompt guardrails, and RBAC/ABAC access controls. Extend governance to skill-level security — vetting published skills for hidden payloads, injection vectors, and data exfiltration risks.
Leadership: Collaborate cross-functionally with data scientists, engineers, product managers, and designers. Mentor engineers at all levels. Elevate AI engineering best practices — including harness engineering patterns and agentic coding tools — across the company.

Basic Qualifications

8+ years of professional software development experience with at least two languages (Java, C++, Python, Go, or C#).
6+ years designing and building AI/ML platforms using open-source/cloud-agnostic components (Elasticsearch, Qdrant, Kafka, PostgreSQL, MongoDB, Spark, Ray, Temporal, Redis, Neo4j, etc.).
5+ years managing end-to-end SDLCs (CI/CD, Kubernetes, testing, monitoring, production support).
4+ years building training, fine-tuning, and inferencing systems for LLMs, especially on GPU infrastructure.
3+ years designing and operating multi-agent or agentic AI systems in production.
Strong understanding of context engineering — memory architectures, RAG, context compaction, and dynamic information management for LLMs.
Demonstrated track record leading technical initiatives, setting architectural direction, and mentoring across teams.
Bachelor's degree in CS, Engineering, or related field; advanced degree highly desirable.

Preferred Qualifications

6+ years with cloud providers (Azure, AWS), including container orchestration and GPU compute.
3+ years building agentic workflows with open-source and proprietary LLMs (Llama, Qwen, Claude, Gpt, etc.).
Hands-on experience with MCP and A2A protocols — MCP server development, AI agent card discovery, task delegation patterns.
Experience with harness engineering. (tool dispatch, error recovery, session state, sub-agent coordination, planning & reasoning)
Experience designing AI agent skill systems: building and governing reusable skill packages, skill marketplaces with discovery, versioning, security vetting, and progressive disclosure.
Experience with context engineering at scale: memory hierarchies, RAG optimization, compaction/summarization, state isolation, etc.
Experience with multi-agent orchestration frameworks (LangGraph, AutoGen, CrewAI).
Experience with LLM observability & evaluation platforms (LangSmith, Arize Phoenix, Langfuse).
Experience building guardrail systems (prompt injection defense, PII detection, skill-level security auditing).
Understanding of AI safety, model governance, and regulatory compliance in regulated industries.

If you are passionate about pushing the boundaries of generative AI platforms, thrive in a hands-on technical leadership role, and enjoy solving complex, large-scale problems, we encourage you to apply!

Annual Salary

$130,000.00 - $300,000.00

The above annual salary range is a general guideline. Multiple factors are taken into consideration to arrive at the final hourly rate/ annual salary to be offered to the selected candidate. Factors include, but are not limited to, the scope and responsibilities of the role, the selected candidate’s work experience, education and training, the work location as well as market and business considerations.

GEICO will consider sponsoring a new qualified applicant for employment authorization for this position.

The GEICO Pledge:

Great Company: At GEICO, we help our customers through life’s twists and turns. Our mission is to protect people when they need it most and we’re constantly evolving to stay ahead of their needs.

We’re an iconic brand that thrives on innovation, exceeding our customers’ expectations and enabling our collective success. From day one, you’ll take on exciting challenges that help you grow and collaborate with dynamic teams who want to make a positive impact on people’s lives.

Great Careers: We offer a career where you can learn, grow, and thrive through personalized development programs, created with your career – and your potential – in mind. You’ll have access to industry leading training, certification assistance, career mentorship and coaching with supportive leaders at all levels.

Great Culture: We foster an inclusive culture of shared success, rooted in integrity, a bias for action and a winning mindset. Grounded by our core values, we have an an established culture of caring, inclusion, and belonging, that values different perspectives. Our teams are led by dynamic, multi-faceted teams led by supportive leaders, driven by performance excellence and unified under a shared purpose.

As part of our culture, we also offer employee engagement and recognition programs that reward the positive impact our work makes on the lives of our customers.

Great Rewards: We offer compensation and benefits built to enhance your physical well-being, mental and emotional health and financial future.

Comprehensive Total Rewards program that offers personalized coverage tailor-made for you and your family’s overall well-being.
Financial benefits including market-competitive compensation; a 401K savings plan vested from day one that offers a 6% match; performance and recognition-based incentives; and tuition assistance.
Access to additional benefits like mental healthcare as well as fertility and adoption assistance.
Supports flexibility- We provide workplace flexibility as well as our GEICO Flex program, which offers the ability to work from anywhere in the US for up to four weeks per year.

The equal employment opportunity policy of the GEICO Companies provides for a fair and equal employment opportunity for all associates and job applicants regardless of race, color, religious creed, national origin, ancestry, age, gender, pregnancy, sexual orientation, gender identity, marital status, familial status, disability or genetic information, in compliance with applicable federal, state and local law. GEICO hires and promotes individuals solely on the basis of their qualifications for the job to be filled.

GEICO reasonably accommodates qualified individuals with disabilities to enable them to receive equal employment opportunity and/or perform the essential functions of the job, unless the accommodation would impose an undue hardship to the Company. This applies to all applicants and associates. GEICO also provides a work environment in which each associate is able to be productive and work to the best of their ability. We do not condone or tolerate an atmosphere of intimidation or harassment. We expect and require the cooperation of all associates in maintaining an atmosphere free from discrimination and harassment with mutual respect by and for all associates and applicants.