AI Phd Student Researcher - Fall 2026

Handshake · Enterprise · San Francisco, CA · HAI Research

Handshake AI is seeking a PhD Student Researcher to work on novel RLHF/GRPO pipelines, instruction-following refinements, reasoning-trace supervision, multilingual/long-horizon/domain-specific benchmarks, automatic vs. human preference studies, robustness diagnostics, active-learning loops, data value estimation, synthetic data generation, and low-resource fine-tuning strategies. The goal is to produce an archive-ready manuscript or top-tier conference submission.

What you'd actually do

  1. Novel RLHF / GRPO pipelines, instruction-following refinements, reasoning-trace supervision.
  2. New multilingual, long-horizon, or domain-specific benchmarks; automatic vs. human preference studies; robustness diagnostics.
  3. Active-learning loops, data value estimation, synthetic data generation, and low-resource fine-tuning strategies.

Skills

Required

  • Current PhD student in CS, ML, NLP, or related field
  • Publication track record at top venues (NeurIPS, ICML, ACL, EMNLP, ICLR, etc.)
  • Hands-on experience training and experimenting with LLMs (e.g., PyTorch, JAX, DeepSpeed, distributed training stacks)

Nice to have

  • Prior work on RLHF, evaluation tooling, or data selection methods
  • Contributions to open-source LLM frameworks
  • Public speaking or teaching experience

What the JD emphasized

  • publication track record at top venues (NeurIPS, ICML, ACL, EMNLP, ICLR, etc.)

Other signals

  • AI data business
  • frontier AI lab researchers
  • post-training techniques
  • data spend for AI training will increase
  • LLM Post-Training
  • LLM Evaluation
  • Data Efficiency
  • archive-ready manuscript or top-tier conference submission