Senior Manager of Site Reliability Engineering

JPMorgan Chase JPMorgan Chase · Banking · Jersey City, NJ +1 · Corporate Sector

Senior Manager of Site Reliability Engineering within Enterprise Technology, Liquidity Risk team, responsible for non-functional requirements, strategic planning, and driving improvements in customer experience, resiliency, security, scalability, monitoring, instrumentation, and automation for applications. Focuses on leading teams, implementing SRE principles, and fostering a culture of continuous improvement.

What you'd actually do

  1. Demonstrates expertise in site reliability principles and demonstrates an understanding of the fine balance between features, efficiency, and stability
  2. Effectively negotiates with peers and executive partners to ensure optimal outcomes for all
  3. Drives the adoption of site reliability practices throughout the organization
  4. Ensures your teams demonstrate site reliability best practices with the ability to demonstrate this empirically through stability and reliability metrics
  5. Drives a culture of continual improvement and solicits real-time feedback to improve the customer’s experience

Skills

Required

  • Formal training or certification on software engineering concepts and 5+ years applied experience
  • Advanced proficiency in site reliability culture and principles and can demonstrate how to implement site reliability across application and platform teams while avoiding common pitfalls
  • Experience leading technologists to manage and solve complex technological issues at a firmwide level
  • Ability to influence the team’s culture by championing innovation and change for success
  • Experience hiring, developing, and recognizing talent
  • Proficiency in at least one programming language (e.g., Python, Java Spring Boot, .Net, etc.)
  • Demonstrated proficiency in software applications and technical processes within a technical discipline (e.g., cloud, artificial intelligence, machine learning, mobile, etc.)
  • Proficiency in continuous integration and continuous delivery tools (e.g., Jenkins, GitLab, Terraform, etc.)
  • Experience with container and container orchestration (e.g., ECS, Kubernetes, Docker, etc.)
  • Experience with troubleshooting common networking technologies and issues

Nice to have

  • Ability to code and demonstrate data fluency

What the JD emphasized

  • non-functional requirement owner
  • resiliency, security, scalability, monitoring, instrumentation, and automation
  • blameless, data-driven manner
  • stability and reliability metrics
  • blameless, data-driven, post-mortem strategies