Software Engineer, ML Research

at Cursor · Coding AI · San Francisco, CA · Engineering

Research Engineer at Cursor focused on building the training, inference, and data systems for frontier coding models, including scaling RL on user data and creating infrastructure for large-scale data jobs.

What you'd actually do

  1. Build our distributed training, inference, and RL infrastructure
  2. Write libraries to simplify how researchers do large-scale data jobs
  3. Architect the systems that turn Cursor user data into effective training data

Skills

Required

  • infrastructure/distributed systems background
  • architect and ship end-to-end with high ownership
  • strong intuitions about how language models work

Nice to have

  • ML

What the JD emphasized

  • frontier coding agents
  • scale RL on real user data
  • distributed training, inference, and RL infrastructure
  • large-scale data jobs
  • Cursor user data into effective training data

Other signals

  • frontier coding agents
  • scale RL on real user data
  • distributed training, inference, and RL infrastructure
  • large-scale data jobs
  • turn Cursor user data into effective training data
Read full job description

Our mission is to automate coding. The first step in our journey is to build the best tool for professional programmers, using a combination of inventive research, design, and engineering. Our organization is very flat, and our team is small and talent dense. We particularly like people who are truth-seeking, passionate, and creative. We enjoy spirited debate, crazy ideas, and shipping code.

Research Engineer

Cursor is building the future of coding. We train frontier coding agents and scale RL on real user data to make them increasingly effective.

About the role

We’re looking for Research Engineers to build the training, inference, and data systems behind our frontier coding models. You’ll work directly with researchers to make progress repeatable and iteration fast.

What you’ll do

  • Build our distributed training, inference, and RL infrastructure
  • Write libraries to simplify how researchers do large-scale data jobs
  • Architect the systems that turn Cursor user data into effective training data

You may be a fit if

  • You have a strong infrastructure/distributed systems background
  • You are able to architect and ship end-to-end with high ownership
  • You have strong intuitions about how language models work
  • You’re excited to learn more about ML

#LI-DNI