Principal Applied Scientist

Microsoft Microsoft · Big Tech · Redmond, WA +1 · Applied Sciences

This role focuses on building and productionizing machine learning and generative AI systems for conversational commerce experiences within Microsoft Copilot. It involves developing models for product discovery, ranking, personalization, and reasoning, as well as LLM-based systems for conversational shopping, including RAG and tool orchestration. The role also addresses quality and trust challenges and defines evaluation frameworks, aiming to translate models into low-latency, reliable user-facing experiences.

What you'd actually do

  1. Design, build, and productionize machine learning models for product discovery, ranking, recommendation, and personalization using large-scale commerce and behavioral data.
  2. Develop LLM-based systems for conversational shopping, including prompt design, retrieval-augmented generation, tool orchestration, and grounding against structured commerce data.
  3. Address quality and trust challenges such as hallucination risk, stale data, and recommendation reliability.
  4. Define evaluation frameworks and experimentation strategies for conversational and proactive shopping scenarios, including offline metrics and online experiments.
  5. Partner closely with product, engineering, and design teams to translate models into low-latency, reliable Copilot experiences.

Skills

Required

  • Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 6+ years related experience
  • Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 4+ years related experience
  • Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ years related experience
  • equivalent experience
  • programming for data science (e.g. using Python or R for data analysis and modeling)
  • experience with data querying languages (e.g. SQL)
  • Hands-on experience with large-scale data processing using tools like Apache Spark or Azure Databricks for training and inference workflows.

Nice to have

  • Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 9+ years related experience
  • Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 6+ years related experience
  • equivalent experience
  • 3+ years of hands-on experience developing machine learning or statistical models to solve real-world problems (in industry or academic projects), including building and validating algorithms such as regressions, classifiers, or clustering models.
  • Advanced Analytics: Skilled in time-series analysis and anomaly detection techniques (e.g., ARIMA, isolation forests) applied to business contexts for actionable insights.
  • Practical experience with prompt engineering, fine-tuning GPT-like models, and applying LLMs in domain-heavy areas (healthcare, agriculture, social sciences) while ensuring privacy and Responsible AI compliance.

What the JD emphasized

  • productionize machine learning models
  • LLM-based systems
  • retrieval-augmented generation
  • tool orchestration
  • grounding against structured commerce data
  • hallucination risk
  • stale data
  • recommendation reliability
  • evaluation frameworks
  • experimentation strategies
  • low-latency, reliable Copilot experiences

Other signals

  • LLM-based systems
  • product discovery, ranking, personalization
  • conversational shopping
  • low-latency, reliable Copilot experiences