Lead Software Engineer - Ai/ml and Iac

JPMorgan Chase JPMorgan Chase · Banking · Plano, TX +1 · Consumer & Community Banking

Lead Software Engineer focused on AI/ML and Infrastructure as Code (IaC) within a financial services technology team. The role involves designing, developing, and deploying AI/ML models and agent-based systems for automation, including IaC development, CI/CD enhancement, and AI-powered observability. Requires strong experience in AI/ML engineering, agent-based systems, IaC automation, observability tools, and programming languages like Python or Java.

What you'd actually do

  1. Designs, develops, and deploys AI/ML models and agent-based systems to automate business and technology processes.
  2. Leads the integration of intelligent agents for workflow automation, decision-making, and process optimization.
  3. Builds and maintain AI-driven tools to automate IAC development, deployment, and management across cloud and on-prem environments.
  4. Develops AI-powered observability solutions to monitor, analyze, and proactively manage application and infrastructure health.
  5. Automates alerting, root cause analysis, and incident response using advanced ML techniques.

Skills

Required

  • software engineering concepts
  • system design
  • application development
  • testing
  • operational stability
  • AI/ML engineering
  • agent-based systems
  • automation
  • automating IAC development
  • observability tools
  • Python
  • Java
  • ML frameworks (TensorFlow, PyTorch, Scikit-learn)
  • cloud platforms (AWS, Azure, GCP)
  • containerization (Docker, Kubernetes)
  • agile methodologies
  • CI/CD
  • Application Resiliency
  • Security
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related field

Nice to have

  • automation
  • continuous delivery methods
  • financial services industry
  • IT systems
  • cloud native experience

What the JD emphasized

  • 5+ years of experience in AI/ML engineering, with proven expertise in agent-based systems and automation.
  • Strong experience in automating IAC development (e.g., Terraform, Ansible, CloudFormation) using AI/ML.
  • Deep understanding of observability tools (e.g., Prometheus, Grafana, ELK stack) and automation using AI/ML.

Other signals

  • Designs, develops, and deploys AI/ML models and agent-based systems
  • Leads the integration of intelligent agents for workflow automation
  • Builds and maintain AI-driven tools to automate IAC development
  • Develops AI-powered observability solutions
  • Automates alerting, root cause analysis, and incident response using advanced ML techniques