What you'd actually do

Deliver world-class and transformative speech solutions for Microsoft 1st party and 3rd party products and services.

Set technical directions in multilingual speech model, speech LLMs, model customization, and impact accuracy, latency, and compute.

Build novel data generation solutions to synthesize complex speech scenarios and finetune models.

Build data analysis metrics and solutions to understand the model results, identify gaps, and guide solutions.

Collaborate with the global Microsoft teams, drive innovative solutions for significant customer asks, and deliver sustained large impacts.

Skills

Required

BS/MS/PhD Degree in CS/EE or related fields with focus in machine learning, AI, or speech technologies.
10+ years of demonstrated experience in speech or machine learning in academic or industrial setting with skills and aptitude for software design, coding and quality.
Demonstration of excellent problem-solving skills in speech and machine learning areas.
Proven track record of delivering impactful results and high-quality solutions in complex technical environments.
Strong programming skills in Python, C++ or similar languages, with experience in large-scale data processing and distributed computing.
Effective communication skills, both verbal and written.

Nice to have

Experience with speech/audio processing, multilingual model development, or voice agent technologies.
Familiarity with Azure, cloud-based AI platforms, or enterprise-scale deployment of speech solutions.
Contributions to open-source projects, patents, or publications in top-tier conferences/journals.
Demonstrated leadership in driving technical direction, influencing cross-functional teams, and mentoring peers.

What the JD emphasized

advanced multilingual speech models

AOAI finetuning

multimodal generative AI

speech recognition

generative AI

scale model quality

breakthrough technologies

AI speech technologies

multilingual speech model

speech LLMs

model customization

novel data generation solutions

finetune models

data analysis metrics and solutions

customer asks

sustained large impacts

academic or industrial setting with skills and aptitude for software design, coding and quality

excellent problem-solving skills in speech and machine learning areas

Proven track record of delivering impactful results and high-quality solutions in complex technical environments.

Overview

Shape the future of AI speech—join us to build transformative speech technologies for multilingual intelligent experiences that reach billions.

Microsoft is pioneering next-generation AI-driven speech solutions for voice agents, transcription, and call centre analytics.

As a Principal Applied Scientist in Microsoft’s Azure Speech team, you will lead the development of advanced multilingual speech models, AOAI finetuning and multimodal generative AI powering real-time and batch transcription, intelligent voice agents, and multilingual technologies across Microsoft products and enterprise solutions. Your work will impact billions—enabling next-generation human–machine experiences for diverse markets, with a special focus on India.

In this strategic role, you will set technical direction and drive innovation in speech recognition, AOAI customisation, and generative AI. You’ll collaborate with top scientists and engineers to scale model quality and deliver breakthrough technologies across AI speech technologies.

Based in Hyderabad, this on-site role offers opportunities to mentor, grow, and shape the future of multimodal interaction for Indian and global audiences.

Microsoft’s mission is to empower every person and organisation to achieve more. We embrace a growth mindset and encourage teams and leaders to bring their best. Join us to shape the future of speech and multimodal LLM technology.

Responsibilities

Responsibilities:

Deliver world-class and transformative speech solutions for Microsoft 1st party and 3rd party products and services.
Set technical directions in multilingual speech model, speech LLMs, model customization, and impact accuracy, latency, and compute.
Build novel data generation solutions to synthesize complex speech scenarios and finetune models.
Build data analysis metrics and solutions to understand the model results, identify gaps, and guide solutions.
Collaborate with the global Microsoft teams, drive innovative solutions for significant customer asks, and deliver sustained large impacts.
Mentor and influence peers, sharing expertise and fostering a growth-oriented inclusive team culture.
Contribute to patents and publications at top-tier conferences and represent the team’s technical leadership within and outside Microsoft.

Qualifications

Required Qualifications:

BS/MS/PhD Degree in CS/EE or related fields with focus in machine learning, AI, or speech technologies.
10+ years of demonstrated experience in speech or machine learning in academic or industrial setting with skills and aptitude for software design, coding and quality.
Demonstration of excellent problem-solving skills in speech and machine learning areas.
Proven track record of delivering impactful results and high-quality solutions in complex technical environments.
Strong programming skills in Python, C++ or similar languages, with experience in large-scale data processing and distributed computing.
Effective communication skills, both verbal and written.

**Preferred Qualifications: **

Experience with speech/audio processing, multilingual model development, or voice agent technologies.
Familiarity with Azure, cloud-based AI platforms, or enterprise-scale deployment of speech solutions.
Contributions to open-source projects, patents, or publications in top-tier conferences/journals.
Demonstrated leadership in driving technical direction, influencing cross-functional teams, and mentoring peers.

Inclusivity & Compliance:

Commitment to fostering an inclusive, growth-oriented team culture.
Adherence to Microsoft’s EEO and diversity guidelines.

This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about **requesting accommodations.**