Speech recognition, synthesis, and audio understanding — TTS, ASR, voice agents, and audio-native LLMs.
Primary AI lifecycle stage: application.
As of today, 301 active AI roles across 54 companies in our index reference Audio & speech. Hiring concentrates at the post-training (24%) and agents (20%) stages. Most common sectors: Big Tech, AI Frontier, Enterprise.
Speech recognition, synthesis, and audio understanding — TTS, ASR, voice agents, and audio-native LLMs. Primary AI lifecycle stage: application.
301 active AI roles across 54 companies in our index reference Audio & speech as of today.
The companies with the most active Audio & speech listings are: Google (62 roles), Amazon (49 roles), xAI (32 roles), Meta (12 roles), Apple (10 roles).
Audio & speech primarily belongs to the application stage of the AI lifecycle. In current hiring, Audio & speech roles concentrate at: post-training (24%), agents (20%).
The sectors with the most active Audio & speech hiring are: Big Tech, AI Frontier, Enterprise.
2 AI roles tagged audio_speech.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Expedia | Senior Machine Learning Engineer (Gen AI & Multi-Agentic Systems) | Hospitality | 9 | Agent orchestration · RAG · Vector DB · Fine-tuning · RL post-training · Inference infra · Model serving · Multimodal · Vision · Code gen · Evals · Guardrails · LLM observability |
| Expedia | Senior Product Manager, Agentic Voice Experience | Hospitality | 7 | Agent orchestration |