Speech recognition, synthesis, and audio understanding — TTS, ASR, voice agents, and audio-native LLMs. Primary AI lifecycle stage: application.
301 active AI roles across 54 companies in our index reference Audio & speech as of today.
The companies with the most active Audio & speech listings are: Google (62 roles), Amazon (49 roles), xAI (32 roles), Meta (12 roles), Apple (10 roles).
Audio & speech primarily belongs to the application stage of the AI lifecycle. In current hiring, Audio & speech roles concentrate at: post-training (24%), agents (20%).
The sectors with the most active Audio & speech hiring are: Big Tech, AI Frontier, Enterprise.
4 AI roles tagged audio_speech.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Black Forest Labs | Member of Technical Staff - Pretraining | Multimodal | 10 | Pretraining · Multimodal · Frontier research · Model serving · Vision |
| Synthesia | Senior Research Engineer - Audio Post-Training | Multimodal | 8 | Fine-tuning · RL post-training · Model serving · Inference infra · Multimodal |
| Synthesia | Staff Backend Engineer, Voices | Multimodal | 7 | Model serving · Recommender systems |
| Synthesia | Staff Software Engineer, Voices | Multimodal | 7 | Model serving · Inference infra · LLM observability · Evals · Recommender systems |
Speech recognition, synthesis, and audio understanding — TTS, ASR, voice agents, and audio-native LLMs.
Primary AI lifecycle stage: application.
As of today, 301 active AI roles across 54 companies in our index reference Audio & speech. Hiring concentrates at the post-training (24%) and agents (20%) stages. Most common sectors: Big Tech, AI Frontier, Enterprise.