Member of Technical Staff - Retrieval-augmented Generation (rag)

Microsoft Microsoft · Big Tech · Mountain View, CA +1 · Software Engineering

This role focuses on building large-scale Copilot memory and personalization systems using RAG, search, embeddings, retrieval, and ranking. The individual will be responsible for designing and implementing these systems, collaborating with stakeholders, and mentoring engineers. Experience with LLM applications and large-scale embedding/retrieval/ranking systems is required.

What you'd actually do

  1. Design and build large-scale Copilot memory and personalization systems leveraging search, embeddings, retrieval, ranking, and Retrieval-Augmented Generation (RAG).
  2. Apply subject-matter expertise in cross-product features, collaborating with appropriate stakeholders to drive project plans, release plans, and deliverables across multiple groups.
  3. Hold accountability as a Designated Responsible Individual (DRI), mentoring engineers across products/solutions, working on-call to monitor system/product/service for degradation, downtime, or interruptions.
  4. Proactively seek out new knowledge and adapt to new trends, technical solutions, and patterns that improve the availability, reliability, efficiency, observability, and performance of products while also driving consistency in monitoring and operations at scale, and share knowledge with other engineers.

Skills

Required

  • Bachelor's degree in computer science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
  • Experience building APIs and creating pipelines for large-scale products
  • Experience building and deploying machine learning or large language model (LLM) applications at scale.
  • Experience designing and implementing large-scale embedding, retrieval, and ranking systems.

Nice to have

  • master's degree in computer science or related technical field AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
  • equivalent experience.

What the JD emphasized

  • large-scale Copilot memory and personalization systems
  • Retrieval-Augmented Generation (RAG)
  • large-scale embedding, retrieval, and ranking systems
  • machine learning or large language model (LLM) applications at scale

Other signals

  • Copilot memory and personalization
  • Retrieval-Augmented Generation (RAG)
  • large-scale embedding, retrieval, and ranking systems