Software Engineering III - Amdp

JPMorgan Chase JPMorgan Chase · Banking · LONDON, LONDON, United Kingdom · Corporate Sector

Software Engineering III role focused on Site Reliability Engineering for AI/ML Data Platforms at JPMorgan Chase. Responsibilities include building and supporting scalable, resilient data solutions, troubleshooting, incident management, root cause analysis, and implementing production changes. Requires proficiency in SRE principles, observability tools, Python/PySpark, and system design. Experience with Databricks, Snowflake, and AWS is preferred.

What you'd actually do

  1. Develop and support AI/ML solutions for troubleshooting and incident resolution
  2. Coordinate incident management coverage to ensure effective resolution of application issues
  3. Collaborate with cross-functional teams to perform root cause analysis and implement production changes
  4. Apply expertise in application development and support using technologies such as Databricks, Snowflake, AWS, and Kubernetes
  5. Mentor and guide team members to drive strategic change

Skills

Required

  • site reliability culture and principles
  • running production incident calls
  • observability
  • SLI/SLO/SLA and error budgets
  • Python or PySpark
  • automate tasks
  • system design
  • resiliency
  • testing
  • operational stability
  • disaster recovery
  • risk controls
  • compliance with organizational standards

Nice to have

  • SRE or production support role
  • AWS Cloud
  • Databricks
  • Snowflake
  • AWS certifications
  • Databricks certifications

What the JD emphasized

  • AI/ML Data Platforms
  • AI/ML solutions
  • AI/ML platforms

Other signals

  • AI/ML data platforms
  • scalable, resilient data solutions
  • reliability and performance of our AI/ML platforms