Why Harvey

At Harvey, we’re transforming how legal and professional services operate. By combining frontier agentic AI, an enterprise-grade platform, and deep domain expertise, we’re reshaping how critical knowledge work gets done for decades to come.

This is a rare chance to help build a generational company at a true inflection point. With 1500+ customers in 60+ countries, strong product-market fit, and world-class investor support, we’re scaling fast and defining a new category in real time. The work is ambitious, the bar is high, and the opportunity for growth — personal, professional, and financial — is unmatched.

Our team moves fast, takes ownership, and is deeply committed to the mission — operating with intensity, staying close to our customers, and pushing each other for excellence. We live by three values: Decisiveness, Simplicity, and Job's Not Finished. We act quickly on clear judgment over perfect information, we believe simplicity is what scales, and we're never satisfied with where we are. If you want to do the best work of your career alongside people who share that drive, we'd love to build with you.

At Harvey, the future of professional services is being written today — and we’re just getting started.

Role Overview

We’re looking for a technical, systems-minded operator to build and scale the evaluation engine behind Harvey’s platform. As we expand globally, ensuring our models behave reliably, accurately, and jurisdictionally correctly is mission-critical—and evaluation complexity is increasing 10x.

As a member of our Product Operations team, you’ll work closely with Applied Legal Researchers, Product, Engineering, AI Research, and human data providers to operationalize evaluation methodologies and embed them into our product development lifecycle. You’ll create the workflows, systems, and tooling that make evaluation a first-class product capability at Harvey.

This is a high-ownership role for someone who thrives in ambiguity, loves building structure, and wants to help scale the evaluation infrastructure of a global AI company.

What You'll Do

Build and scale the systems that power model and product evaluations across Harvey
Run intake, triage, and prioritization for the evaluation request queue, routing capacity to the highest-value coverage gaps
Embed evaluation workflows and readiness checkpoints into the product development lifecycle
Create the single source of truth for evaluation status, results, history, and launch readiness
Turn Expert-designed evaluation methodologies into scalable, repeatable operational processes
Manage human data providers and stand up our internal contract-attorney pipeline, ensuring evaluation quality meets legal standards
Work with Engineering and Research to improve evaluation tooling, automation, and dashboards
Drive evaluation readiness for major product and model launches across geographies and jurisdictions
Document and operationalize evaluation governance as complexity increases
Help define how Harvey ensures model accuracy, reliability, and trust at global scale

What You Have

4–7+ years in technical program management, product operations, research operations, or evaluation/benchmarking roles
Experience working with ML/AI evaluations, benchmarking frameworks, or scientific workflows
Comfort with statistical methodologies and SQL or Python, or similar tools to interpret evaluation data (either natively or with AI tool support)
Strong business acumen with an ability to apply an ROI-focused mindset to scaling
Ability to work deeply with legal experts and operationalize complex evaluation methodologies
Strong cross-functional coordination skills across Product, Engineering, Research, and data providers/vendors
High attention to detail and a bias toward clarity, rigor, and reproducibility
Ability to navigate an evolving landscape and bring order to complex systems
Strong communication skills and comfort translating technical nuance for diverse stakeholders
Desire to do whatever it takes to make evaluation systems successful—from writing documentation to diagnosing pipeline issues

Depending on your location, an Applicant Privacy Notice may apply to you. You can find all of our Applicant Privacy Notices [here].

#LI-SB1

Harvey is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made by emailing accommodations@harvey.ai

Why Harvey

At Harvey, the future of professional services is being written today — and we’re just getting started.

Role Overview

This is a high-ownership role for someone who thrives in ambiguity, loves building structure, and wants to help scale the evaluation infrastructure of a global AI company.

What You'll Do

Build and scale the systems that power model and product evaluations across Harvey
Run intake, triage, and prioritization for the evaluation request queue, routing capacity to the highest-value coverage gaps
Embed evaluation workflows and readiness checkpoints into the product development lifecycle
Create the single source of truth for evaluation status, results, history, and launch readiness
Turn Expert-designed evaluation methodologies into scalable, repeatable operational processes
Manage human data providers and stand up our internal contract-attorney pipeline, ensuring evaluation quality meets legal standards
Work with Engineering and Research to improve evaluation tooling, automation, and dashboards
Drive evaluation readiness for major product and model launches across geographies and jurisdictions
Document and operationalize evaluation governance as complexity increases
Help define how Harvey ensures model accuracy, reliability, and trust at global scale

What You Have

4–7+ years in technical program management, product operations, research operations, or evaluation/benchmarking roles
Experience working with ML/AI evaluations, benchmarking frameworks, or scientific workflows
Comfort with statistical methodologies and SQL or Python, or similar tools to interpret evaluation data (either natively or with AI tool support)
Strong business acumen with an ability to apply an ROI-focused mindset to scaling
Ability to work deeply with legal experts and operationalize complex evaluation methodologies
Strong cross-functional coordination skills across Product, Engineering, Research, and data providers/vendors
High attention to detail and a bias toward clarity, rigor, and reproducibility
Ability to navigate an evolving landscape and bring order to complex systems
Strong communication skills and comfort translating technical nuance for diverse stakeholders
Desire to do whatever it takes to make evaluation systems successful—from writing documentation to diagnosing pipeline issues

Depending on your location, an Applicant Privacy Notice may apply to you. You can find all of our Applicant Privacy Notices [here].

#LI-SB1

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made by emailing accommodations@harvey.ai

Senior Product Operations Manager, Evaluation

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

Why Harvey

Role Overview

What You'll Do

What You Have

Depending on your location, an Applicant Privacy Notice may apply to you. You can find all of our Applicant Privacy Notices [here].

Why Harvey

Role Overview

What You'll Do

What You Have

Depending on your location, an Applicant Privacy Notice may apply to you. You can find all of our Applicant Privacy Notices [here].

What you'd actually do

Skills

Required

Nice to have

What the JD emphasized

Other signals

Why Harvey

Role Overview

What You'll Do

What You Have

Depending on your location, an Applicant Privacy Notice may apply to you. You can find all of our Applicant Privacy Notices [**here**].

Why Harvey

Role Overview

What You'll Do

What You Have

Depending on your location, an Applicant Privacy Notice may apply to you. You can find all of our Applicant Privacy Notices [**here**].

Depending on your location, an Applicant Privacy Notice may apply to you. You can find all of our Applicant Privacy Notices [here].

Depending on your location, an Applicant Privacy Notice may apply to you. You can find all of our Applicant Privacy Notices [here].