Sr. Machine Learning Engineer - Apple News

Apple Apple · Big Tech · Cupertino, CA +1 · Software and Services

Machine Learning Engineer to build, operate, and scale systems for intelligent features in Apple News, focusing on model serving, deployment pipelines, and ML platform infrastructure for content tagging, ranking, and personalization.

What you'd actually do

  1. Design, build, and operate infrastructure to host and serve classical ML models (gradient boosting, SVMs) and deep learning models (transformers, neural rankers) in production with a strong focus on latency, reliability, and scalability
  2. Evaluate and select the right tools, frameworks, and infrastructure (Kubernetes, Spark, Cassandra, Solr, Spring Boot, AWS, GCP) for model serving and feature delivery with a strong command of trade-offs across latency, cost, scalability, and reliability
  3. Collaborate with model development teams to manage a shared codebase, build common data processing libraries and profile/optimize ML workloads.
  4. Build scalable and reusable infrastructure components for data pipelines, such as sampling and collecting data for training, labeling via human annotations or LLMs
  5. Design and implement model monitoring, observability, and alerting systems to ensure production ML systems meet reliability and performance SLAs

Skills

Required

  • Java
  • Python
  • Kubernetes
  • Spark
  • Cassandra
  • Solr
  • Spring Boot
  • AWS
  • GCP
  • RAG architectures
  • agentic AI systems

Nice to have

  • quantization
  • batching
  • caching
  • model distillation
  • embedding pipeline infrastructure
  • vector store design
  • approximate nearest neighbor search
  • content personalization
  • recommendation systems

What the JD emphasized

  • build and operate the infrastructure that powers ML-driven product features
  • own the systems that host, serve, and monitor both classical and deep learning models in production
  • drive ML infrastructure from concept to production
  • build and shipping production ML infrastructure: model serving, deployment pipelines, and feature delivery systems using AI/ML workflows
  • Experience with RAG architectures: including retrieval, embedding, chunking, and reranking strategies, and deploying agentic AI systems in production

Other signals

  • shipping ML models
  • ML infrastructure
  • model serving
  • deployment pipelines
  • distributed systems