Staff/principal Research Scientist

Snorkel AI Snorkel AI · Data AI · Redwood City, CA +1 · Remote · 316 - Research

Research Scientist at Snorkel AI focused on designing, implementing, and validating novel AI techniques for data development, such as synthetic data generation using LLM as a Judge. The role involves prototyping and building end-to-end workflows, integrating research into scalable systems, and collaborating with partners to test solutions in applied settings. Emphasis on rapid iteration and driving innovation from research into production.

What you'd actually do

  1. Design, implement, and validate novel AI techniques for data development** **such as synthetic data generation, utilizing techniques such as LLM as a Judge
  2. Prototype and build end-to-end workflows, integrating research ideas into scalable systems.
  3. Write high-quality, maintainable code, ensuring robust implementation of research-driven innovations.
  4. Move fast and adapt—iterating on solutions in response to new challenges, customer needs, and emerging research.
  5. Work closely with real-world design partners, testing solutions in applied settings with measurable impact.

Skills

Required

  • Python
  • machine learning frameworks (NumPy, Scikit-learn, Pandas, PyTorch, TensorFlow, etc.)
  • software engineering best practices (e.g., clean coding, modular design, version control)
  • ML infrastructure, cloud platforms (AWS, Google Cloud), and accelerators (GPUs, TPUs)

Nice to have

  • AI, NLP, multi-modal models, LLMs, and generative AI, with an emphasis on applied research and system-building
  • PhD in Computer Science, Machine Learning, AI, or a related field, with 4+ years of industry or postdoctoral research experience; or equivalent experience
  • impactful research publications in top-tier conferences or journals (e.g., NeurIPS, ICML, ACL), demonstrating thought leadership in AI/ML
  • Experience in developing, experimenting, and deploying AI models at scale

What the JD emphasized

  • impactful research publications in top-tier conferences or journals
  • developing, experimenting, and deploying AI models at scale
  • PhD in Computer Science, Machine Learning, AI, or a related field, with 4+ years of industry or postdoctoral research experience; or equivalent experience

Other signals

  • transform expert knowledge into specialized AI at scale
  • translate research breakthroughs into scalable, practical applications
  • rapid iteration, solving open-ended challenges, and driving innovation from research into production