Research, Post-training

Cognition Cognition · Coding AI · San Francisco, CA · Research & Development

Research role focused on post-training and alignment of AI agents, shaping their behavior and capabilities for real-world usefulness and safety. Blends research and engineering to iterate on datasets, training stages, hyperparameters, and evaluation design, with a focus on understanding and advancing techniques like RLHF/RLAIF for long-horizon tasks.

What you'd actually do

  1. Iterate on the full stack of datasets, training stages, and hyperparameters that determine model behavior. Measure how choices compound across evals and production performance, not just isolated benchmarks.
  2. Build evals that actually capture what matters. The loop never ends: define, optimize, realize the gaps, and rebuild. You'll be responsible for making numbers go up and making sure the numbers mean something.
  3. When training produces results that don't make sense, you dig until you understand why. The goal isn't just to fix it; it's to carry that understanding forward to the next problem.
  4. Apply and advance techniques like RLHF, RLAIF, and constitutional approaches to shape how agents reason, act, and collaborate with humans in long-horizon tasks.
  5. Measure how performance scales with data and compute, and develop new methodologies when existing ones hit ceilings. We expect both rigor and invention.

Skills

Required

  • post-training
  • alignment methods
  • RLHF
  • RLAIF
  • preference modeling
  • reward learning
  • probability
  • statistics
  • ML theory
  • experimental data analysis
  • original contributions
  • large-scale distributed training
  • systems-level thinking
  • fast-moving research environments

Nice to have

  • PhD
  • competitive programmers
  • former founders
  • frontier of AI research

What the JD emphasized

  • track record of advancing ML systems through post-training, alignment, or related methods: RLHF, RLAIF, preference modeling, reward learning, or equivalent
  • Evidence of original contributions: publications at top venues, open-source impact, or equivalent industry results

Other signals

  • building end-to-end software agents
  • iterating on training recipes, evaluations, and alignment methods
  • shaping agent reasoning, action, and collaboration