Monitoring Engineering Production Services Specialist Ll

Bank of America Bank of America · Banking · Chandler, AZ

This role supports end users and responds to incidents and problems for multiple applications, focusing on leading triage activities for business-impacting incidents. Responsibilities include ensuring compliance with incident and problem management policies, serving as a key focal point for customer experience, and restoring impacts regardless of root cause. The role involves leading production support triage, managing bridge line troubleshooting, technical research, escalating issues, documenting impacts, providing status updates, interpreting monitors, dashboards, and logs, and analyzing incident management activities.

What you'd actually do

  1. Leads production support triage efforts, manages bridge line troubleshooting, engages in technical research, and escalates issues to leadership as needed
  2. Ensures all impacts are accurately recorded and documented in the system of record, verifies documents and wikis are updated and available for use during triage, and supports on call responsibilities for incidents, the documentation of application flows, impacts during outages, the customer experience, and contacts for support needs
  3. Provides status updates and technical detail for awareness communications, such as infrastructure, application and client impact, and component points of failure, oversees accuracy of all communications sent, and ensures any necessary reconvenes are scheduled
  4. Identifies business impact, interprets monitors, dashboards, and logs, and writes queries to accurately calculate and communicate impacts to leadership in partnership with senior team members or specialists within Technology Services
  5. Promotes and enforces production governance during triage/testing, and identifies production failure scenarios, vulnerabilities, and opportunities for improvement, determines appropriate actions, and escalate issues as needed

Skills

Required

  • Production Support
  • Risk Management
  • Analytical Thinking
  • DevOps Practices
  • Solution Delivery Process
  • Stakeholder Management

Nice to have

  • Adaptability
  • Influence
  • Automation
  • Collaboration
  • Innovative Thinking
  • Result Orientation
  • Solution Design
  • Business Acumen
  • Project Management