Snr Applied Scientist

Microsoft Microsoft · Big Tech · Egypt · Applied Sciences

Lead the science behind Copilot Discover's ranking and content-quality stack, combining LLMs, multimodal models, and large-scale recommender systems to drive measurable gains in engagement, satisfaction, and trust. This role involves setting technical direction, mentoring scientists, and partnering with engineering, PM, UXR, and policy to ship end-to-end outcomes.

What you'd actually do

  1. Lead content‑quality understanding at scale. Design and deploy models that assess credibility, usefulness, freshness, safety, and diversity across modalities; reduce misinformation/toxicity error rates through prompt‑ and model‑level innovations; build human‑in‑the‑loop and active‑learning pipelines that get better over time.
  2. Champion safety & trust. Partner with policy and platform teams to encode safety standards and editorial principles into the ML system; create red‑teaming, adversarial, and safeguard layers for generative and curated experiences.
  3. Scale E2E ML systems. Collaborate with engineering on data contracts, feature stores, distributed training/inference, and automated rollout/rollback; drive architectural investments that increase agility and reliability of Discover’s AI platform.
  4. Own evaluation and experimentation. Define offline metrics (e.g., Rejection Rate, ERR, Defect Rate) and online methodologies (A/B tests, interleaving, counterfactual & bandit approaches) to confidently attribute business impact and guard against regressions.
  5. Mentor & influence. Provide technical leadership across problem framing, methodology selection, code quality, and publishing/knowledge‑sharing; uplevel peers through design reviews, deep‑dives, and principled decision‑

Skills

Required

  • Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field & related experience
  • Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering experience
  • Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field experience
  • equivalent experience
  • Experience working with natural language understanding
  • Experience in Python and at least one major deep learning framework (PyTorch/TensorFlow) with large‑scale data processing and training
  • Experience with evaluation & experimentation (offline metrics, A/B testing, bandits) and ML model development lifecycle

Nice to have

  • Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND related experience
  • Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND related experience
  • equivalent experience
  • Have publications at top AI/ML conferences (e.g., KDD, SIGIR, EMNLP, NIPS, ICML, ICLR, RecSys, ACL, CIKM, CVPR, ICCV, etc.)
  • Expertise with LLMs (prompting, RAG, Parameter-Efficient Fine-Tuning), multimodal modeling, and retrieval‑augmented recommendation; familiarity with counterfactual learning and multi‑objective optimization
  • Experience building content integrity/safety systems (e.g., misinformation, harmful content, low‑quality/duplicate detection) and quality‑aware ranking
  • Demonstrated ability to lead cross‑disciplinary efforts (PM, ENG, UXR, editorial/policy) from idea to shipped business impact; mentoring scientists and setting technical vision
  • Familiarity with Microsoft stack (e.g., Azure ML, Kusto, Synapse, Azure AI Foundry)

What the JD emphasized

  • Lead content‑quality understanding at scale
  • Champion safety & trust
  • Scale E2E ML systems
  • Own evaluation and experimentation
  • Mentor & influence
  • Stay close to users
  • Experience working with natural language understanding
  • Experience with evaluation & experimentation (offline metrics, A/B testing, bandits) and ML model development lifecycle
  • Have publications at top AI/ML conferences
  • Expertise with LLMs (prompting, RAG, Parameter-Efficient Fine-Tuning), multimodal modeling, and retrieval‑augmented recommendation
  • Experience building content integrity/safety systems
  • Demonstrated ability to lead cross‑disciplinary efforts

Other signals

  • LLMs
  • multimodal models
  • recommender systems
  • content quality
  • safety
  • evaluation