Senior Lead Infrastructure Engineer - Capacity Management

JPMorgan Chase JPMorgan Chase · Banking · Wilmington, DE +1 · Consumer & Community Banking

This role focuses on Capacity Management for critical applications within a large enterprise, leveraging AI capabilities to analyze infrastructure signals, drive automation, and optimize performance, cost, and resilience. The engineer will own the capacity strategy, lead cross-functional planning, and advance observability and analytics.

What you'd actually do

  1. Own the end‑end Capacity Management strategy and operating model for critical CCB applications, including standards, KPIs, governance, and continuous improvement.
  2. Lead cross‑functional capacity planning and risk management across Application, Infrastructure, and SRE teams; proactively identify risks and drive remediation to closure.
  3. Uses enterprise-authorized AI capabilities within the work environment to accelerate analysis of complex infrastructure signals and documentation of mitigation options, validating outputs and handling operational data according to sensitivity and security requirements.
  4. Set and enforce capacity governance across platforms and applications; chair or co‑lead working groups to align priorities and remove blockers.
  5. Advance observability and analytics: design consolidated views/dashboards, real‑time KPIs, forecasts, peak calendars, event flags, and SLA‑backed alerting.

Skills

Required

  • Formal training or certification on Infrastructure Engineering concepts and 5+ years applied experience
  • Proven expertise in managing or governing technology capacity (or adjacent ITIL processes) at enterprise scale.
  • Strong understanding of infrastructure KPIs (CPU, memory, I/O, network), application performance, and monitoring/observability tools.
  • Demonstrated experience using enterprise-authorized AI capabilities within the work environment to support infrastructure engineering workflows with strong validation habits and awareness of data sensitivity.
  • Ability to review and validate AI-assisted recommendations before implementation, escalating when uncertain and ensuring outcomes align to resiliency, security, and auditability expectations.
  • Hands‑on experience building and leveraging dashboards and analytics (e.g., Grafana, Dynatrace, ELK) and BI platforms (Power BI, Qlik, Tableau) for multi‑source data analysis.
  • Proficiency with one or more scripting/programming languages (e.g., Python, Go, JavaScript) to automate data pipelines, reporting, and runbooks.
  • Experience leading the lifecycle of cross‑functional projects, integrating automated systems, and delivering resilient, scalable solutions.
  • Cloud experience (e.g., AWS) with the ability to design, develop, and deploy secure, scalable infrastructure; familiarity with multi‑region patterns and SRE practices.
  • Exceptional communication and stakeholder management; able to influence decisions, build partnerships, and drive change across diverse teams.
  • Track record of improving IT processes to maximize efficiency and user experience; strong documentation discipline and governance mindset.

What the JD emphasized

  • Uses enterprise-authorized AI capabilities within the work environment to accelerate analysis of complex infrastructure signals and documentation of mitigation options, validating outputs and handling operational data according to sensitivity and security requirements.
  • Demonstrated experience using enterprise-authorized AI capabilities within the work environment to support infrastructure engineering workflows with strong validation habits and awareness of data sensitivity.
  • Ability to review and validate AI-assisted recommendations before implementation, escalating when uncertain and ensuring outcomes align to resiliency, security, and auditability expectations.
  • Proven expertise in managing or governing technology capacity (or adjacent ITIL processes) at enterprise scale.