Machine Learning (mlops) Engineer

Apple Apple · Big Tech · Cupertino, CA · Machine Learning and AI

MLOps Engineer role focused on building and optimizing ML infrastructure, ensuring reliability, scalability, and continuous improvement of AI/ML systems in production. Responsibilities include end-to-end quality initiatives, automated pipelines for training, evaluation, deployment, and championing model observability and governance.

What you'd actually do

  1. Explore, design, and implement advanced ML infrastructure frameworks and tools to accelerate model development and delivery.
  2. Champion model observability, incident response, prompt versioning, and feedback loops to ensure continuous model health and performance.
  3. Design and maintain automated pipelines for model training, evaluation, versioning, and deployment.
  4. Partner closely with ML Engineers and Data Scientists to define metrics, gather requirements, and deliver impactful solutions.
  5. Enforce model governance, validation standards, and best practices across teams to ensure reproducibility and compliance.

Skills

Required

  • Python
  • distributed systems
  • databases (SQL/NoSQL)
  • cloud platforms (AWS, Azure, or GCP)
  • Kubernetes
  • MLOps tooling and platforms (Ray, MLflow, Kubeflow, SageMaker, Vertex AI)
  • ML frameworks (TensorFlow, PyTorch, scikit-learn)
  • CI/CD pipelines for ML workflows (Jenkins, GitHub Actions, ArgoCD)
  • data pipeline orchestration tools (Airflow, Prefect)

Nice to have

  • model monitoring
  • drift detection
  • observability practices
  • LLM-based tools (Claude, Gemini, ChatGPT)
  • LLM-generated outputs validation
  • AI-assisted tools integration

What the JD emphasized

  • shipping and maintaining production-grade ML systems end-to-end
  • large-scale software system design and implementation
  • MLOps tooling and platforms
  • CI/CD pipelines for ML workflows

Other signals

  • MLOps Engineer
  • ML lifecycle
  • production-grade ML systems
  • CI/CD pipelines for ML