Staff Genai Research Scientist - Retrieval

Databricks Databricks · Data AI · San Francisco, CA · Engineering - Pipeline

Staff Research Scientist focused on Information Retrieval for enterprise agents, RAG, and embeddings. The role involves defining and leading research agendas, driving algorithmic innovations, and publishing at top ML/systems conferences.

What you'd actually do

  1. Define and lead independent research agendas on Information Retrieval for reusable Agent components, conducting experiments to empirically validate hypotheses and benchmark against state-of-the-art approaches.
  2. Drive research and technical strategy across enterprise retrieval and search, including workspace search, vector search.
  3. Drive algorithmic innovations for Retrieval within Large Language Models.
  4. Experience with Agent Benchmarks for retrieval, text-embeddings and LLM Data Quality.

Skills

Required

  • PhD in Computer Science or related field
  • strong foundations in Natural Language Processing
  • strong foundations in Information Retrieval
  • Developing and implementing methods that extend and improve model capabilities, reliability, and safety.

Nice to have

  • first-author publications at top ML/systems conferences (EMNLP, ICLR, ACL, NeurIPs) focused on optimization or efficiency

What the JD emphasized

  • first-author publications at top ML/systems conferences (EMNLP, ICLR, ACL, NeurIPs) focused on optimization or efficiency

Other signals

  • advancing the scientific frontier
  • creating new techniques
  • state of the art LLMs and AI systems