Lead Infrastructure Engineer - Mainframe Mq

JPMorgan Chase JPMorgan Chase · Banking · Plano, TX +1 · Corporate Sector

Lead Infrastructure Engineer responsible for engineering, securing, and optimizing IBM MQ on z/OS, driving automation, and advancing AIOps/operational analytics. The role involves global on-call support, incident resolution, and integrating telemetry into enterprise observability workflows. Emphasis on an AI mindset for operational improvements and responsible AI principles.

What you'd actually do

  1. Provide global on-call support as part of a shared rotation; lead structured incident/problem resolution and continuous improvements.
  2. Design, develop, and deploy changes while following firm processes for change management, issue resolution, design governance, and JIRA.
  3. Engineer, secure, and optimize IBM MQ on z/OS (queue managers, queues, channels, clusters, QSG, logging, HA across LPARs/sysplex); assess upstream/downstream impacts and mitigation actions.
  4. Build and integrate telemetry from MQ and platform sources (e.g., queue depth, put/get rates, channel states, DLQ events, plus SMF/RMF and logs) into enterprise observability/SRE workflows.
  5. Deliver AIOps and automation: anomaly detection, capacity forecasting, intelligent alerting/noise reduction, and codified runbooks for repeatable operations and remediation.

Skills

Required

  • Formal training/certification in Infrastructure Engineering concepts and 5+ years applied experience (or equivalent).
  • Strong experience on IBM zSeries / IBM Z with z/OS fundamentals (e.g., sysplex concepts, WLM, JES2/3, USS, SAF/RACF, SMF dataset management).
  • Understanding of MQ concepts: queue managers, queues, channels, and MQ object administration.
  • Understanding of RACF/ACF2 security, least privilege, and secure change practices.
  • Strong critical thinking, problem-solving, and communication skills; ability to collaborate across roles and teams.
  • Demonstrated curiosity, continuous learning, and comfort working across infrastructure, automation, and analytics domains.

Nice to have

  • MQ systems programming depth: installation/maintenance/implementation on z/OS, clusters/QSG, tuning using SMF records and monitoring data.
  • Data/AI engineering for ops telemetry: Python, time-series/log shaping, schema governance; streaming/batch pipelines using firm-approved platforms (e.g., Kafka, Spark, enterprise observability).
  • Automation: REXX, CLIST, JCL, Ansible, z/OSMF/Zowe workflows; CI/CD and controlled release in regulated environments.
  • Familiarity with CICS, DB2, IMS, and cross-platform MQ integrations.
  • Experience productionizing ML (lifecycle, monitoring/drift, explainability, rollback) and applying Responsible AI principles.
  • IBM MQ on z/OS experience preferred, but not mandatory if you bring strong adjacent experience (IBM Systems, messaging, observability/AIOps, automation) and willingness to learn MQ rapidly.

What the JD emphasized

  • IBM MQ on z/OS
  • AIOps
  • automation
  • observability
  • Responsible AI