Principal Voice AI Engineer

Zendesk Zendesk · Enterprise · Berlin, Germany +1

Principal Voice AI Engineer to lead and accelerate voice and conversational AI initiatives, focusing on Speech and Natural Language Processing (NLP). The role involves spearheading the development and deployment of AI/ML technologies for voice-enabled customer experiences, overseeing research in ASR, TTS, LLMs, and conversational systems, and transforming advanced AI models into production systems.

What you'd actually do

  1. Lead the research, design, and engineering of next-generation Voice AI solutions including noise-robust multilingual ASR, neural TTS, and advanced QA dialog systems fine-tuned with state-of-the-art pretrained models (e.g., BERT, GPT).
  2. Drive collaboration across research scientists, software engineers, and product teams to transform advanced AI models into robust, scalable production systems.
  3. Oversee large-scale AI research and development projects, ensuring delivery of high-quality, real-world solutions optimized for diverse tasks and computing environments.
  4. Architect and implement AI models leveraging deep learning algorithms such as DNNs, CNNs, RNNs, and Transformer-based architectures across speech and NLP pipelines.
  5. Champion best practices in software development, including CI/CD, code reviews, version control (Git), and refactoring to support efficient and maintainable codebases.

Skills

Required

  • Python
  • C++
  • Java
  • Linux/Shell scripting
  • PyTorch
  • TensorFlow
  • Keras
  • Huggingface Transformers
  • DNN
  • CNN
  • RNN
  • Transformers
  • fine-tuning large pre-trained models
  • ASR
  • diarization
  • TTS
  • NMT
  • dialog systems
  • M.S. in Engineering, Computer Science, or a related field

Nice to have

  • experience developing AI-driven speech technologies for complex domains such as autonomous pilot systems or court reporting

What the JD emphasized

  • noise-robust multilingual ASR
  • neural TTS
  • advanced QA dialog systems fine-tuned with state-of-the-art pretrained models
  • noise robustness and multilingual capabilities

Other signals

  • Lead research, design, and engineering of next-generation Voice AI solutions
  • Oversee researchers innovating across Automatic Speech Recognition (ASR), Text-to-Speech (TTS), Large Language Models (LLM), and voice conversational systems
  • Architect and implement AI models leveraging deep learning algorithms such as DNNs, CNNs, RNNs, and Transformer-based architectures across speech and NLP pipelines
  • Experience deploying voice AI systems in production, including ASR, diarization, TTS, NMT, and dialog systems