Senior/staff Research Scientist - Frontier Benchmarks

Snorkel AI Snorkel AI · Data AI · Redwood City, CA +1 · Remote · 316 - Research

Research Scientist role focused on designing datasets for frontier model training and evaluation, translating benchmark insights, and staying at the forefront of LLM evaluation research. This is a customer-facing role that collaborates cross-functionally and influences company roadmap and external research.

What you'd actually do

  1. Design state of the art datasets that drive frontier model training and evaluation based on current model performance and academic partnerships
  2. Translate benchmark insights into clear, compelling narratives that articulate the ROI of expert-curated data for customer-facing presentations, technical reports, and go-to-market materials.
  3. Work cross-functionally with data operations, product, engineering, and strategy to surface research findings that inform the company roadmap.
  4. Stay at the frontier of LLM evaluation research and bring best practices into Snorkel's workflows
  5. Represent Snorkel's research externally through publications, blog posts, conference talks, and customer engagements that advance the conversation around data-centric AI

Skills

Required

  • AI/ML evaluation
  • NLP
  • rigorous experimental design
  • measuring the impact of training and evaluation data on model behavior
  • communication skills
  • present complex technical findings clearly to both technical and non-technical audiences
  • operating in a fast-moving, cross-functional environment with ambiguous problem spaces

Nice to have

  • Ph.D. in machine learning, NLP, or a related field
  • Genuine interest in GTM strategy, startup dynamics, and the commercial side of AI data services.

What the JD emphasized

  • track record of rigorous experimental design
  • frontier model training and evaluation
  • LLM evaluation research

Other signals

  • design state of the art datasets that drive frontier model training and evaluation
  • stay at the frontier of LLM evaluation research
  • customer-facing role at the intersection of research, company strategy, and go-to-market