268 AI roles tagged audio_speech.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| Meta | Staff Research Scientist, FAIR (RL / LLM's) | Big Tech | 10 | Frontier research · Pretraining · Vision · Multimodal · Code gen |
| Senior Staff Software Engineer, Cognitive Architecture, Special Projects | Big Tech | 10 | Interpretability · Agent orchestration · Agent research · RL robotics · Model serving · Evals | |
| Apple | AIML - Machine Learning Researcher, MLR | Big Tech | 10 | Frontier research · Pretraining · RL post-training · Multimodal |
| Meta | AI Research Scientist - Meta Superintelligence Labs | Big Tech | 10 | Frontier research · Pretraining · RL post-training · Fine-tuning · Evals |
| Meta | AI Research Scientist, Audio-Visual Understanding, FAIR | Big Tech | 10 | Multimodal · Vision · Frontier research · Evals |
| Meta | Research Scientist Intern, FAIR - Language & Multimodal Foundations (PhD) | Big Tech | 10 | Frontier research · Pretraining · Multimodal · Vision |
| Senior Software Engineer, AI/ML, LLM Modeling | Big Tech | 9 | Fine-tuning · RL post-training · Model serving · Inference infra · Evals · LLM observability · Agent orchestration · Tool use · RAG | |
| Research Software Engineer | Big Tech | 9 | Model serving · Inference infra · LLM observability | |
| Amazon | Principal Solutions Architect, Generative AI, AWS Industries, Telco | Big Tech | 9 | Agent orchestration · Tool use · RAG · Fine-tuning · Model serving · Multimodal · Guardrails |
| Meta | Research Scientist Intern, Photorealistic Telepresence (PhD) | Big Tech | 9 | Multimodal · Vision · Agent research · Frontier research · Fine-tuning |
| Research Software Engineer, Multimodal AI | Big Tech | 9 | Agent orchestration · Multimodal · Vision · LLM observability · Fine-tuning · Evals | |
| Staff Software Engineer, On-Device Hybrid Multimodal AI | Big Tech | 9 | Agent orchestration · Multimodal · Model serving · Inference infra · Vision | |
| Amazon | Applied Scientist II | Big Tech | 9 | Fine-tuning |
| Applied AI Engineer, Audio, XR | Big Tech | 9 | Fine-tuning · Model serving · Inference infra | |
| Senior Software Engineer | Big Tech | 9 | Model serving · Inference infra · Multimodal · LLM observability | |
| Senior Technical Program Manager Lead, Gemini Audio, DeepMind | Big Tech | 9 | Evals · Model serving · Fine-tuning · Frontier research | |
| Gemini Audio Research Scientist, DeepMind | Big Tech | 9 | RL post-training · Evals · Multimodal | |
| Amazon | Sr. Applied Scientist, Trust CX Innovations&AI Policy | Big Tech | 9 | Multimodal · Frontier research · Fine-tuning · Model serving |
| Senior Staff Software Engineer, AI/ML, Applied AI | Big Tech | 9 | Agent orchestration · Multimodal · Model serving · Evals | |
| Research Scientist, Frontier Health, DeepMind | Big Tech | 9 | Agent orchestration · Multimodal · RL post-training · Reward modeling · Evals · Tool use · Vision | |
| Amazon | Applied Scientist II, Amazon AWS Agentic AI, AWS AI Fundamental Research | Big Tech | 9 | Agent research · Frontier research · Multimodal · Vision · Agent orchestration · Fine-tuning |
| Amazon | Applied Science Manager, Alexa Edge AI | Big Tech | 9 | Multimodal · Model serving · Inference infra |
| Amazon | Applied Scientist, Alexa Edge AI | Big Tech | 9 | Vision · Multimodal · Fine-tuning · Frontier research · Model serving · Inference infra |
| Amazon | Applied Scientist, Alexa Edge AI | Big Tech | 9 | Multimodal · Vision · Fine-tuning · Model serving · Inference infra |
| Amazon | Applied Scientist, Alexa Edge AI | Big Tech | 9 | Vision · Multimodal · Fine-tuning · Model serving · Inference infra |
| Senior Software Engineer, Applied AI Commerce | Big Tech | 9 | Agent orchestration · Multimodal · Evals · Guardrails · RAG · LLM observability · Tool use · Vision | |
| Apple | Sr. Machine Learning Research Engineer, Siri Speech | Big Tech | 9 | Multimodal · Fine-tuning · Model serving · Inference infra · Frontier research |
| Meta | Research Scientist Intern, Multimodal AI (PhD) | Big Tech | 9 | Multimodal · Evals · Fine-tuning · LLM observability |
| Senior Staff Software Engineer, Applied AI | Big Tech | 9 | Agent orchestration · Model serving · Inference infra · Fine-tuning · Evals · RL robotics | |
| Apple | Machine Learning Architect - Conversational Speech | Big Tech | 9 | Multimodal · Model serving · Fine-tuning · Inference infra |
Speech recognition, synthesis, and audio understanding — TTS, ASR, voice agents, and audio-native LLMs.
Primary AI lifecycle stage: application.
As of today, 301 active AI roles across 54 companies in our index reference Audio & speech. Hiring concentrates at the post-training (24%) and agents (20%) stages. Most common sectors: Big Tech, AI Frontier, Enterprise.
Speech recognition, synthesis, and audio understanding — TTS, ASR, voice agents, and audio-native LLMs. Primary AI lifecycle stage: application.
301 active AI roles across 54 companies in our index reference Audio & speech as of today.
The companies with the most active Audio & speech listings are: Google (62 roles), Amazon (49 roles), xAI (32 roles), Meta (12 roles), Apple (10 roles).
Audio & speech primarily belongs to the application stage of the AI lifecycle. In current hiring, Audio & speech roles concentrate at: post-training (24%), agents (20%).
The sectors with the most active Audio & speech hiring are: Big Tech, AI Frontier, Enterprise.