Lead Software Engineer - Devops / Full-stack / Mlops

JPMorgan Chase JPMorgan Chase · Banking · Columbus, OH +1 · Consumer & Community Banking

Lead Software Engineer focused on building and maintaining a scalable ML platform for model training, deployment, and monitoring within a cloud-native DevOps environment. The role involves coding infrastructure with Terraform, Python automation, Kubernetes, CI/CD pipelines, and applying agentic AI/LLM capabilities to DevSecOps use cases.

What you'd actually do

  1. Design and develop a scalable ML platform to support model training, deployment, and monitoring
  2. Build and maintain infrastructure for automated ML pipelines, ensuring reliability and reproducibility supporting different model frameworks and architectures
  3. Set up monitoring and reliability for both infrastructure and models utilizing Prometheus and Grafana
  4. Code infrastructure with Terraform and utilizing Python for automation
  5. Perform DevOps in Kubernetes (K8s), Docker, Helm, GitOps, and CI/CD pipelines (Jenkins, GitLab CI)

Skills

Required

  • Software engineering concepts
  • Cloud-native delivery
  • Infrastructure as Code with Terraform
  • Python for platform automation
  • CI/CD
  • AWS/public cloud knowledge
  • MLOps tools and platforms
  • Data versioning and ML models lifecycle management
  • Agentic AI/LLM capabilities to DevSecOps
  • Kubernetes
  • Docker
  • Helm
  • GitOps
  • Prometheus
  • Grafana

Nice to have

  • Deploying models using Canary, Blue/Green, or Shadow deployment strategies
  • Deploying & managing ML models
  • Highly regulated environment or industry
  • AWS, Azure, or GCP
  • Postgres experience

What the JD emphasized

  • Expert Infrastructure as Code with Terraform
  • Expert proficiency in Python for platform automation
  • Deep experience with CI/CD
  • Experience with MLOps tools and platforms
  • Practical experience applying agentic AI/LLM capabilities to DevSecOps use cases

Other signals

  • ML platform design and development
  • MLOps tools and platforms
  • agentic AI/LLM capabilities to DevSecOps