What you'd actually do

Own the core KPI framework for the API platform, spanning developer adoption, engagement, retention, and platform health.

Build end-to-end funnels that identify where developers succeed or get stuck—from first integration through scaling to production.

Define and operationalize platform guardrails (e.g., reliability, latency, error rates, cost/efficiency) and connect them to user outcomes.

Design and evaluate experiments and rollouts to quantify the impact of platform and product changes.

Partner with product and engineering teams to improve instrumentation, data quality, and metric definitions so decisions are fast and correct.

Skills

Required

Statistics
Causal Inference
Experimentation Design
SQL
Python
Data Pipelines
Business Intelligence Tools
Communication Skills

Nice to have

Developer Platforms
APIs/SDKs
Usage-based Products
Platform Reliability Analytics
Incident Impact Measurement
Performance/Cost Optimization
AI Evaluation
Quality Measurement Systems
Online/Offline Evals
Human-in-the-loop
Safety/Quality Guardrails

What the JD emphasized

10+ years of experience in data science roles within product or technology organizations

Expertise in statistics and causal inference

Expert-level SQL and proficiency in Python for analytics, modeling, and experimentation

Proven experience designing and interpreting experiments and making statistically sound recommendations

Experience building datasets, metrics, and data pipelines that power production decision-making

Strong product sense and an impact-driven mindset

Ability to operate effectively in a fast-moving, ambiguous environment with limited structure

Are consistently among the first to adopt the latest AI tools, you use them daily to increase your own throughput, and you proactively turn them into durable workflows that change how your team and org operate.

Familiarity with AI evaluation and quality measurement systems (online/offline evals, human-in-the-loop, safety/quality guardrails).

Other signals

developer adoption

engagement

retention

platform health

developer friction

reliability

latency

cost/efficiency

experimentation

product decisions

data quality

metric definitions

AI platform performance

developer success

platform reliability analytics

incident impact measurement

performance/cost optimization

AI evaluation

quality measurement systems

online/offline evals

human-in-the-loop

safety/quality guardrails

About the Team

OpenAI’s mission is to ensure AGI benefits all of humanity. The API organization is one of the highest-leverage ways we do that: we put frontier intelligence in the hands of builders who turn it into products, businesses, and services that reach people everywhere.

We build the infrastructure and developer platform that makes that possible—so developers can reliably bring powerful AI into the real world, operate it safely at scale, and keep pushing what’s possible. When we do our job well, we don’t just ship an API. We enable thousands of teams to create new capabilities for millions of end users, accelerating progress across industries and communities.

About the Role

As a Data Scientist on the API team, you’ll build the measurement systems that make our platform legible and improveable. You’ll define the metrics that matter, identify and quantify developer friction, evaluate launches and platform changes, and translate data into product decisions that improve reliability and developer outcomes at scale.

You’ll partner closely with Product, Engineering, Research, and Finance to ensure our metrics are trusted, our experimentation is rigorous, and our insights turn into shipped improvements.

This role is based in San Francisco. We use a hybrid work model of three days in the office per week and offer relocation assistance to new employees.

In this role, you will:

Own the core KPI framework for the API platform, spanning developer adoption, engagement, retention, and platform health.
Build end-to-end funnels that identify where developers succeed or get stuck—from first integration through scaling to production.
Define and operationalize platform guardrails (e.g., reliability, latency, error rates, cost/efficiency) and connect them to user outcomes.
Design and evaluate experiments and rollouts to quantify the impact of platform and product changes.
Partner with product and engineering teams to improve instrumentation, data quality, and metric definitions so decisions are fast and correct.
Translate complex analysis into clear, actionable insights for leadership and cross-functional stakeholders.
Develop and socialize dashboards, tools, and self-serve data products that help teams answer product questions quickly.
Help establish data science standards and best practices for measuring AI platform performance and developer success.
Partner with other data scientists across the company to share learnings and raise the bar on measurement and decision-making.

You might thrive in this role if you have:

10+ years of experience in data science roles within product or technology organizations (platform or developer-facing experience is a plus).
Expertise in statistics and causal inference, applied in both experimentation and observational studies.
Expert-level SQL and proficiency in Python for analytics, modeling, and experimentation.
Proven experience designing and interpreting experiments and making statistically sound recommendations.
Experience building datasets, metrics, and data pipelines that power production decision-making.
Experience developing and extracting insights from business intelligence tools (e.g., Tableau) and building self-serve solutions.
Strong product sense and an impact-driven mindset: you turn ambiguity into crisp frameworks that drive roadmap decisions.
Ability to build relationships with diverse stakeholders and cultivate strong partnerships across Product, Engineering, Research, and GTM teams.
Strong communication skills, including the ability to bridge technical and non-technical audiences.
Ability to operate effectively in a fast-moving, ambiguous environment with limited structure.
Are consistently among the first to adopt the latest AI tools, you use them daily to increase your own throughput, and you proactively turn them into durable workflows that change how your team and org operate.

Nice-to-haves:

Experience with developer platforms, APIs/SDKs, or usage-based products.
Experience with platform reliability analytics, incident impact measurement, or performance/cost optimization.
Familiarity with AI evaluation and quality measurement systems (online/offline evals, human-in-the-loop, safety/quality guardrails).

Why it matters

When developers build on OpenAI, they put our platform into products that touch millions of people. API Data Science ensures we make the right bets, operate with rigor, and continuously raise reliability and usability as the platform scales. This is a unique opportunity to shape how frontier AI is measured, improved, and delivered through a platform that powers the next generation of AI products.

About OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products. AI is an extremely powerful tool that must be created with safety and human needs at its core, and to achieve our mission, we must encompass and value the many different perspectives, voices, and experiences that form the full spectrum of humanity.

We are an equal opportunity employer, and we do not discriminate on the basis of race, religion, color, national origin, sex, sexual orientation, age, veteran status, disability, genetic information, or other applicable legally protected characteristic.

For additional information, please see OpenAI’s Affirmative Action and Equal Employment Opportunity Policy Statement.

Background checks for applicants will be administered in accordance with applicable law, and qualified applicants with arrest or conviction records will be considered for employment consistent with those laws, including the San Francisco Fair Chance Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, and the California Fair Chance Act, for US-based candidates. For unincorporated Los Angeles County workers: we reasonably believe that criminal history may have a direct, adverse and negative relationship with the following job duties, potentially resulting in the withdrawal of a conditional offer of employment: protect computer hardware entrusted to you from theft, loss or damage; return all computer hardware in your possession (including the data contained therein) upon termination of employment or end of assignment; and maintain the confidentiality of proprietary, confidential, and non-public information. In addition, job duties require access to secure and protected information technology systems and related data security obligations.

To notify OpenAI that you believe this job posting is non-compliant, please submit a report through this form. No response will be provided to inquiries unrelated to job posting compliance.

We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made via this link.

OpenAI Global Applicant Privacy Policy

At OpenAI, we believe artificial intelligence has the potential to help people solve immense global challenges, and we want the upside of AI to be widely shared. Join us in shaping the future of technology.