Lead Machine Learning Engineer-mlops

JPMorgan Chase JPMorgan Chase · Banking · New York, NY +1 · Consumer & Community Banking

Lead Machine Learning Engineer (MLOps) focused on building and maintaining pipelines for distributed model training, model serving, hyperparameter tuning, monitoring, and production validation for a Recommendation Engine team. The role involves deploying ML models on AWS, supporting personalized experiences across consumer channels, and optimizing inference for LLMs and vector databases.

What you'd actually do

  1. Build, deploy, and maintain robust pipelines for distributed training on GPU-enabled clusters to support scalable machine learning workflows.
  2. Develop and manage pipelines for high-throughput, real-time inference as well as batch inference, ensuring optimal performance and reliability.
  3. Implement quantization techniques and deploy large language models (LLMs) to maximize efficiency and resource utilization.
  4. Oversee the management and optimization of vector databases to support advanced AI and machine learning applications.
  5. Establish and maintain comprehensive monitoring and observability pipelines to ensure system health, performance, and rapid issue resolution.

Skills

Required

  • Python
  • AWS
  • data science fundamentals
  • training and deploying models
  • monitoring and observability tools
  • Ray
  • DuckDB
  • Spark
  • engineering fundamentals
  • analytical mindset

Nice to have

  • recommendation and personalization systems
  • containers (docker ecosystem)
  • container orchestration systems [Kubernetes, ECS]
  • DAG orchestration [Airflow, Kubeflow etc]
  • Databases

What the JD emphasized

  • high throughput, low latency applications
  • large language models (LLMs)
  • vector databases
  • monitoring and observability

Other signals

  • MLOps
  • Recommendation Engine
  • distributed model training
  • model serving
  • monitoring and observability