Machine Learning Research Engineer, Siri Speech

Apple Apple · Big Tech · Aachen · Machine Learning and AI

This role focuses on evaluating, analyzing, and improving state-of-the-art end-to-end speech models for Siri. The engineer will design and implement novel evaluation frameworks, develop tools to measure model performance, analyze model behavior, and explore innovative approaches to advance speech capabilities. The role also involves building automated processes for large-scale model evaluation and analysis, collaborating with cross-functional teams.

What you'd actually do

  1. Evaluate emerging Speech LLMs, design and implement novel evaluation frameworks, and develop new tools to measure and enhance model performance.
  2. Analyze model behavior to identify opportunities for improvement in accuracy, robustness, and naturalness.
  3. Explore innovative approaches and improve existing algorithms to push the boundaries of what speech models can achieve.
  4. Build efficient automated processes and tooling to streamline large-scale model evaluation and analysis workflows.
  5. Collaborate with cross-functional teams of engineers and researchers from diverse backgrounds, working together in a dynamic and fast-paced environment to deliver high-impact improvements to Siri’s core speech technologies.

Skills

Required

  • Deep understanding of Machine Learning (ML) fundamentals.
  • Advanced expertise in deep learning, demonstrated through years of industrial roles related to this field as well as publications in relevant conferences, such as Interspeech, ICASSP, ICLR, NeurIPS, etc.
  • Advanced background in speech technologies and related NLP fields.
  • Proficiency in Python and deep learning frameworks such as JAX, PyTorch and TensorFlow.

Nice to have

  • Deep technical expertise on Speech LLMs, ASR or TTS.
  • A track record in software design, coding and parallel computing.
  • Proven software development experience, preferably in a deep learning field.
  • Experience with large scale machine learning training/evaluation.
  • Experience in distributed privacy-preserving ML systems.

What the JD emphasized

  • Advanced expertise in deep learning, demonstrated through years of industrial roles related to this field as well as publications in relevant conferences, such as Interspeech, ICASSP, ICLR, NeurIPS, etc.

Other signals

  • evaluate emerging Speech LLMs
  • design and implement novel evaluation frameworks
  • develop new tools to measure and enhance model performance
  • analyze model behavior
  • explore innovative approaches and improve existing algorithms
  • build efficient automated processes and tooling to streamline large-scale model evaluation and analysis workflows