Lead Software Engineer| Risk Data Platform & Strategy

JPMorgan Chase JPMorgan Chase · Banking · Jersey City, NJ +1 · Corporate Sector

Lead Software Engineer for a Risk Data Platform & Strategy team at JPMorgan Chase, focusing on advanced data engineering solutions with a strong SRE mindset. Responsibilities include designing, developing, and delivering data engineering solutions, championing reliability, scalability, and operational excellence, leading critical technology initiatives, and mentoring other engineers. Requires proficiency in engineering, architecture, AI/ML (for operational stability of data platforms), SRE principles, distributed systems, cloud-native architectures, large-scale data processing, Java, Python, and big data technologies. Preferred qualifications include experience with modern data technologies, microservices, API design, and financial services industry IT systems.

What you'd actually do

  1. Executes creative software solutions, design, development, and technical troubleshooting with ability to think beyond routine or conventional approaches to build solutions or break down technical problems
  2. Develops secure high-quality production code for data-intensive applications, and reviews and debugs code written by others
  3. Drives the implementation of SRE best practices, including automated monitoring, alerting, and self-healing mechanisms to ensure high availability and reliability of data platforms
  4. Leads root cause analysis and post-incident reviews, collaborating with other engineering team members to develop long-term solutions that prevent recurrence and improve system resiliency
  5. Mentors and guides other engineering team members in adopting SRE principles, fostering a culture of reliability, automation, and continuous improvement

Skills

Required

  • Engineering & Architecture
  • AI/ML
  • SRE principles
  • distributed systems
  • cloud-native architectures
  • large-scale data processing
  • Java
  • Python
  • Spark/PySpark
  • Databricks
  • Snowflake
  • system design
  • application development
  • testing
  • operational stability
  • developing, debugging, and maintaining code
  • modern programming languages
  • database querying languages
  • observability tools
  • incident management
  • performance optimization
  • agile methodologies
  • CI/CD
  • Application Resiliency
  • Security

Nice to have

  • automation
  • continuous delivery methods
  • Software Development Life Cycle
  • data engineering
  • cloud
  • artificial intelligence
  • machine learning
  • mobile
  • Databricks
  • Snowflake
  • large scale data processing
  • micro services
  • API design
  • Kafka
  • Redis
  • MemCached
  • Observability (Dynatrace , Splunk, Grafana or similar)
  • Orchestration (Airflow, Temporal)
  • financial services industry
  • IT systems

What the JD emphasized

  • AI/ML with hands-on experience in designing, implementing, testing, and ensuring the operational stability of large-scale enterprise data platforms and solutions
  • Demonstrated expertise in applying SRE principles to drive reliability engineering, automation, and operational excellence within complex technical environment
  • Deep understanding of distributed systems, cloud-native architectures, and large-scale data processing
  • Hands-on practical experience delivering system design, application development, testing, and operational stability