Site Reliability Engineer III

JPMorgan Chase JPMorgan Chase · Banking · Plano, TX +1 · Commercial & Investment Bank

Site Reliability Engineer III role focused on automating, troubleshooting, and monitoring AWS-based applications and infrastructure within the Data Solutions team at JPMorgan Chase. The role emphasizes enhancing reliability, performance, and scalability, driving the adoption of SRE best practices, and delivering impactful solutions for the financial services business. Key responsibilities include designing and implementing solutions, managing infrastructure as code, supporting adoption of SRE principles, driving automation, troubleshooting AWS issues, and enhancing observability.

What you'd actually do

  1. Guides and assists others in building effective designs and achieving consensus within the team
  2. Collaborates with software engineers and teams to implement automated CI/CD pipelines for deployment
  3. Designs, develops, tests, and implements solutions to improve availability, reliability, and scalability
  4. Implements infrastructure, configuration, and network as code for assigned applications and platforms
  5. Works with technical experts, stakeholders, and team members to resolve complex issues

Skills

Required

  • site reliability engineering principles
  • cloud environments
  • Python
  • Java/Spring Boot
  • .Net
  • observability tools (Grafana, Dynatrace, Prometheus, Datadog, Splunk, etc.)
  • CI/CD tools such as Jenkins, GitLab, or Terraform
  • AWS platform
  • container orchestration (EKS)
  • networking technologies
  • cloud security and compliance practices
  • infrastructure automation tools (Ansible, Chef, Puppet)
  • distributed systems
  • microservices architecture
  • agile development environments

Nice to have

  • Experience with AWS platform and container orchestration (EKS)
  • Familiarity with troubleshooting common networking technologies and issues
  • Exposure to cloud security and compliance practices
  • Experience with infrastructure automation tools (Ansible, Chef, Puppet)
  • Knowledge of distributed systems and microservices architecture
  • Experience working in agile development environments