What you'd actually do

Define and pursue a research agenda spanning both pure and applied work, with the applied component connected to Baseten's platform and customer needs

Design and execute rigorous experiments, frequently at meaningful scale (multi-node, 1T+ parameter models)

Publish at top venues (NeurIPS, ICML, ICLR) and establish Baseten's research presence

Collaborate with model performance and training infrastructure teams to bridge research findings and production systems

Mentor junior researchers and shape the technical direction of the research organization as it grows

Skills

Required

PhD or equivalent research depth in machine learning
First-author publications at top venues
Ability to move from theory through implementation to empirical results
Judgment about problem selection
Ability to distinguish research that advances a metric from research that changes how systems are built
Willingness to operate in a startup environment

Nice to have

Experience with production ML systems
Understanding of constraints causing academic solutions to fail in deployment
Background spanning multiple research areas (e.g., both interpretability and RL, or both systems and training methodology)
Track record of open-source contributions or community building in ML research

What the JD emphasized

PhD or equivalent research depth in machine learning, with first-author publications at top venues

Demonstrated ability to move from theory through implementation to empirical results — not exclusively theoretical or exclusively engineering work

Judgment about problem selection, the ability to distinguish research that advances a metric from research that changes how systems are built

Willingness to operate in a startup environment where the majority of research informs product decisions, with timelines measured in months rather than years

ABOUT BASETEN

Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, flexible infrastructure, and seamless developer tooling, we enable companies operating at the frontier of AI to bring cutting-edge models into production. We're growing quickly and recently raised our $300M Series E, backed by investors including BOND, IVP, Spark Capital, Greylock, and Conviction. Join us and help build the platform engineers turn to to ship AI products.

THE ROLE

This role sits at the frontier of our research agenda. You will pursue open problems at the intersection of post-training methodology and performant inference and then collaborate with research engineering to translate findings into production systems. Roughly a third of your time will be dedicated to pure research: questions that may not have immediate product application but deepen our understanding of models ability to learn, alignment, or architectural efficiency. The remainder will be directed toward research that solves concrete training problems for Baseten's platform and customers which are the fastest growing AI companies in the world like Cursor, Lovable, Notion etc.

We are looking for someone with sharp research taste and genuine creative instinct for problem selection. Someone who can identify questions that matter, design clean experiments to answer them, and push the state of the art. The environment here is not theoretical, but rather research that can be validated with eager customers who are serving billions of tokens a second.

RECENT RESEARCH

Dense, on-policy or both?
Repeated kv cache for long-running agents
Distillation without the dark – replicating black-box on-policy distillation on Baseten

RESPONSIBILITIES

Define and pursue a research agenda spanning both pure and applied work, with the applied component connected to Baseten's platform and customer needs
Design and execute rigorous experiments, frequently at meaningful scale (multi-node, 1T+ parameter models)
Publish at top venues (NeurIPS, ICML, ICLR) and establish Baseten's research presence
Collaborate with model performance and training infrastructure teams to bridge research findings and production systems
Mentor junior researchers and shape the technical direction of the research organization as it grows

QUALIFICATIONS

PhD or equivalent research depth in machine learning, with first-author publications at top venues
Demonstrated ability to move from theory through implementation to empirical results — not exclusively theoretical or exclusively engineering work
Judgment about problem selection, the ability to distinguish research that advances a metric from research that changes how systems are built
Willingness to operate in a startup environment where the majority of research informs product decisions, with timelines measured in months rather than years

PREFERRED QUALIFICATIONS

Experience with production ML systems and an understanding of the constraints that cause academic solutions to fail in deployment
Background spanning multiple research areas (e.g., both interpretability and RL, or both systems and training methodology)
Track record of open-source contributions or community building in ML research

BENEFITS

Competitive compensation, including meaningful equity.
100% coverage of medical, dental, and vision insurance for employee and dependents
Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
Paid parental leave
Fertility and family-building stipend through Carrot
Company-facilitated 401(k)
Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities.

Apply now to embark on a rewarding journey in shaping the future of AI! If you are a motivated individual with a passion for machine learning and a desire to be part of a collaborative and forward-thinking team, we would love to hear from you.

At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants without regard to race, color, religion, gender, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, or veteran status.

We are an Equal Opportunity Employer and will consider qualified applicants with criminal histories in a manner consistent with applicable law (by example, the requirements of the San Francisco Fair Chance Ordinance, where applicable).