Senior Applied Scientist

Microsoft Microsoft · Big Tech · Hyderabad, TS, IN +1 · Applied Sciences

Senior Applied Scientist role focused on building and scaling an AI-driven testing platform for Microsoft's AI products. The role involves using LLMs, prompt engineering, and agent-based workflows to validate product quality, design testing strategies, develop evaluation frameworks, and create actionable insights. The goal is to evolve the platform towards more autonomous and agentic workflows.

What you'd actually do

  1. Build and scale AI‑driven testing capabilities using LLMs, prompts, and agent‑based workflows to validate MAI products across scenarios, geographies, and product surfaces.
  2. Design and optimize prompts, models, and agent behaviors to perform functional, quality, and experience‑focused testing at scale.
  3. Collaborate closely with product and engineering teams across MAI and beyond to understand testing needs and translate them into efficient, AI‑powered testing workflows.
  4. Develop metrics and evaluation frameworks to measure test quality, coverage, effectiveness, and signal accuracy across AI‑driven testing pipelines.
  5. Create actionable outputs and insights (issues, summaries, trends, and recommendations) that product owners can directly consume to fix defects and improve product quality.

Skills

Required

  • Bachelor's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 7+ years related experience OR Master's Degree in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 5+ years related experience OR Doctorate in Statistics, Econometrics, Computer Science, Electrical or Computer Engineering, or related field AND 3+ year(s) related experience OR equivalent experience.
  • 4+ years of solid experience in Data Science, Applied AI, or Machine Learning, with a track record of building solutions that operate at scale.
  • Hands‑on experience with LLMs, prompt engineering, and/or agentic AI systems.
  • Solid foundation in statistics, experimentation, and metrics design, especially for evaluating AI system quality.
  • Experience working with data pipelines, model evaluation, and production systems.
  • Ability to work across multiple product teams, influence without authority, and translate ambiguous testing needs into concrete AI solutions.
  • Solid communication skills to explain complex AI outputs clearly to engineering and product stakeholders.

Nice to have

  • 3+ years experience creating publications (e.g., patents, libraries, peer-reviewed academic papers).
  • 3+ year(s) experience developing and deploying live production systems, as part of a product team.
  • 3+ year(s) experience developing and deploying products or systems at multiple points in the product cycle from ideation to shipping.

What the JD emphasized

  • Hands‑on experience with LLMs, prompt engineering, and/or agentic AI systems.
  • Experience working with data pipelines, model evaluation, and production systems.
  • Ability to work across multiple product teams, influence without authority, and translate ambiguous testing needs into concrete AI solutions.

Other signals

  • AI-powered testing platform
  • Agentic AI
  • LLMs
  • prompt engineering
  • scalable testing