Meet DeepL

DeepL is a global AI product and research company focused on building secure, intelligent solutions to complex business problems. Over 200,000 business customers and millions of individuals across 228 global markets today trust DeepL's Language AI platform for human-like translation, improved writing and real-time voice translation. Founded in 2017 by CEO Jaroslaw “Jarek” Kutylowski, DeepL now has over 1,000 passionate employees and is supported by world-renowned investors including Benchmark, IVP, and Index Ventures.

Our goal is to become the global leader in trusted, intelligent AI technology, building products that drive better communication, foster connections, and create a meaningful impact. To achieve this, we need talented people like you to join our journey. If you’re ready to shape the future of AI and grow your career in a fast-moving, purpose-driven environment, DeepL is your next destination.

What sets us apart

What sets us apart is our blend of cutting-edge AI technology, meaningful work, and a culture where people truly thrive. We’re a team of innovators, researchers, and creators driven by a shared purpose to unlock human potential by making work simpler, smarter, and more connected.

When we share what it’s like to work at DeepL, the reactions are overwhelmingly positive. This might be because of our technology that helps millions of people and businesses communicate and work better every day, or because of the trust, curiosity, and care that shape our culture.

What we know for sure is this: being part of DeepL means joining a team dedicated to innovation, growth, and well-being. Discover more about life at DeepL onLinkedIn,Instagram, and our Blog.

Meet the Team

DeepL Voice is defining the future of real-time multilingual communication. Built on the foundation of DeepL’s world-leading translation quality, the Voice Track brings together cross-functional teams across Research, Engineering, and Product to push the boundaries of what is possible in speech-to-speech interpretation.

In just two years, we have launched our first customer-facing features and built reliable real-time systems for meetings, live communication, and developer integrations. We are now scaling up our research efforts to invent the next generation of audio, speech, and multimodal translation technologies.

What will you be doing at DeepL Voice

We’re looking for a Senior Staff Research Scientist to lead scientific innovation across our speech and translation models. This is a high-impact, hands-on role for a research leader who can define long-term scientific strategy, prototype rapidly, run large-scale experiments, and drive breakthroughs all the way into production.

You will work across ASR, MT, TTS, streaming inference, and large speech models—leading both cascaded and emerging end-to-end internationalized speech-translation approaches.

Your responsibilities

Lead hands-on research and development across ASR, MT, TTS, and speech-to-speech translation for real-time voice products.
Design, train, and optimize large-scale ASR models for multilingual accuracy, robustness, and ultra-low-latency streaming.
Improve cascaded translation pipelines end to end: segmentation, ASR→MT interfaces, streaming MT inference, and incremental decoding.
Develop and refine real-time TTS models with natural prosody, stable speaker characteristics, and fast inference.
Build and experiment with end-to-end and LLM-based speech-to-speech translation systems, including streaming and one-shot approaches.
Own the full lifecycle of model delivery: prototyping, ablations, training, evaluation, optimization, and production deployment.
Work closely with engineering teams to integrate models into real-time systems, ensuring reliability, uptime, and quality at scale.
Drive improvements in inference efficiency, model serving, voice UX, and robustness to real-world acoustic conditions.
Establish strong practices for evaluation, reproducibility, monitoring, and continuous model improvement in production.
Mentor researchers and engineers, promote hands-on collaboration, and raise the bar for model quality and operational excellence.

What we’re looking for

Deep expertise in speech, audio, or multilingual ML, particularly in ASR, MT, TTS, end-to-end ST, or large speech models.
A hands-on builder who enjoys training models, running experiments, debugging pipelines, and integrating ML systems into production.
Strong understanding of real-time streaming constraints and how to design models that operate reliably at low latency.
Experience shipping ML models to production, maintaining them at scale, and working with engineers on deployment, monitoring, and serving.
Ability to lead complex research efforts while staying grounded in product impact, user experience, and real-world performance.
Strong coding and experimentation skills (Python, PyTorch/JAX, audio processing libraries).
Ability to communicate clearly, collaborate across teams, and align research work with product and engineering priorities.
Proven experience mentoring others and elevating technical quality across a fast-moving, applied research team.

We are an equal opportunity employer

You are welcome at DeepL for who you are - we appreciate authenticity here. Our product is for everyone, and so is our workplace. The more voices we have represented and amplified in our business, the more we will all succeed, contribute, and think forward! So bring us your personal experience, your perspectives, and your background. It’s in our diversity that we will find the power to break down language barriers in the world.

Meet DeepL

What sets us apart

What we know for sure is this: being part of DeepL means joining a team dedicated to innovation, growth, and well-being. Discover more about life at DeepL onLinkedIn,Instagram, and our Blog.

Meet the Team

What will you be doing at DeepL Voice

You will work across ASR, MT, TTS, streaming inference, and large speech models—leading both cascaded and emerging end-to-end internationalized speech-translation approaches.

Your responsibilities

Lead hands-on research and development across ASR, MT, TTS, and speech-to-speech translation for real-time voice products.
Design, train, and optimize large-scale ASR models for multilingual accuracy, robustness, and ultra-low-latency streaming.
Improve cascaded translation pipelines end to end: segmentation, ASR→MT interfaces, streaming MT inference, and incremental decoding.
Develop and refine real-time TTS models with natural prosody, stable speaker characteristics, and fast inference.
Build and experiment with end-to-end and LLM-based speech-to-speech translation systems, including streaming and one-shot approaches.
Own the full lifecycle of model delivery: prototyping, ablations, training, evaluation, optimization, and production deployment.
Work closely with engineering teams to integrate models into real-time systems, ensuring reliability, uptime, and quality at scale.
Drive improvements in inference efficiency, model serving, voice UX, and robustness to real-world acoustic conditions.
Establish strong practices for evaluation, reproducibility, monitoring, and continuous model improvement in production.
Mentor researchers and engineers, promote hands-on collaboration, and raise the bar for model quality and operational excellence.

What we’re looking for

Deep expertise in speech, audio, or multilingual ML, particularly in ASR, MT, TTS, end-to-end ST, or large speech models.
A hands-on builder who enjoys training models, running experiments, debugging pipelines, and integrating ML systems into production.
Strong understanding of real-time streaming constraints and how to design models that operate reliably at low latency.
Experience shipping ML models to production, maintaining them at scale, and working with engineers on deployment, monitoring, and serving.
Ability to lead complex research efforts while staying grounded in product impact, user experience, and real-world performance.
Strong coding and experimentation skills (Python, PyTorch/JAX, audio processing libraries).
Ability to communicate clearly, collaborate across teams, and align research work with product and engineering priorities.
Proven experience mentoring others and elevating technical quality across a fast-moving, applied research team.

(senior) Staff Research Scientist | Voice

What you'd actually do

Skills

Required

What the JD emphasized

Other signals

Meet DeepL

What sets us apart

Meet the Team

What will you be doing at DeepL Voice

Your responsibilities

What we’re looking for

We are an equal opportunity employer

Meet DeepL

What sets us apart

Meet the Team

What will you be doing at DeepL Voice

Your responsibilities

What we’re looking for

We are an equal opportunity employer