Sr Engineer, Machine Learning Engineering

T-Mobile T-Mobile · Telecom · Bellevue, WA +4

Senior Engineer, Machine Learning to advance AI capabilities by designing, developing, and deploying LLMs and generative AI solutions. Focus on building scalable, production-grade AI systems for enterprise automation, personalization, and decision-making. Architect RAG and prompt orchestration pipelines, implement MLOps, and conduct fine-tuning and evaluation. Collaborate with cross-functional teams to integrate LLM-powered solutions.

What you'd actually do

  1. Build and manage the complete machine learning and generative AI lifecycle, including research, design, experimentation, development, deployment, monitoring, and maintenance.
  2. Design, develop, and deploy LLM-based and generative AI models to power scalable and intelligent enterprise applications.
  3. Architect, optimize, and maintain retrieval-augmented generation (RAG), prompt orchestration, and contextual reasoning pipelines to support diverse AI use cases.
  4. Implement scalable MLOps pipelines for model deployment, performance monitoring, and continuous improvement.
  5. Conduct fine-tuning, alignment, and evaluation of LLMs and multimodal models to ensure reliability, efficiency, and fairness.

Skills

Required

  • Bachelor's Degree Computer Science, Data Science, Statistics, Informatics, Information Systems, Machine Learning, or another quantitative field
  • 1+ year of experience in designing, developing, and deploying large language models (LLMs) and generative AI systems in production environments
  • 5+ years of experience building and maintaining end-to-end ML pipelines, including data ingestion, training, deployment, monitoring, and optimization
  • 3+ years of experience applying MLOps practices and leveraging cloud platforms (AWS, GCP, or Azure) for scalable AI solutions
  • 5+ years of experience collaborating with cross-functional teams (engineering, data science, and product) to deliver AI-powered applications
  • 2+ years of experience in programming languages such as Python/R, Java/Scala, and/or Go, with hands-on experience in frameworks such as PyTorch, TensorFlow, LangChain, or Hugging Face

Nice to have

  • Master's/Advanced Degree Computer Science, Data Science, Statistics, Informatics, Information Systems, Machine Learning, or another quantitative field
  • Experience implementing fine-tuning, evaluation, and benchmarking techniques for LLMs and generative AI applications
  • Experience in the telecom or large-scale enterprise domain
  • 5+ years in designing, building, and deploying machine learning and generative AI models
  • 5+ years of experience identifying, troubleshooting, and resolving complex technical and operational challenges
  • 4+ years of strong analytical and problem-solving abilities with attention to model performance, reliability, and responsible AI practices
  • 2+ years of experience with transformer architectures, embeddings, and multimodal learning techniques

What the JD emphasized

  • 1+ year of experience in designing, developing, and deploying large language models (LLMs) and generative AI systems in production environments (Required)
  • 5+ years of experience building and maintaining end-to-end ML pipelines, including data ingestion, training, deployment, monitoring, and optimization (Required)
  • 3+ years of experience applying MLOps practices and leveraging cloud platforms (AWS, GCP, or Azure) for scalable AI solutions (Required)
  • 2+ years of experience in programming languages such as Python/R, Java/Scala, and/or Go, with hands-on experience in frameworks such as PyTorch, TensorFlow, LangChain, or Hugging Face (Required)

Other signals

  • LLM
  • generative AI
  • production-grade AI systems
  • scalable MLOps pipelines
  • fine-tuning
  • alignment
  • evaluation
  • transformer architectures
  • multimodal learning