Data Engineer III - Python, Databricks - Senior Associate

JPMorgan Chase JPMorgan Chase · Banking · LONDON, LONDON, United Kingdom · Commercial & Investment Bank

Data Engineer III role at JPMorgan Chase in London, focusing on designing and delivering data collection, storage, access, and analytics solutions. The role involves developing and maintaining data pipelines and architectures using Python and Databricks, with a significant emphasis on utilizing and validating enterprise-authorized AI capabilities to assist in data engineering workflows, SDLC routines, and documentation.

What you'd actually do

  1. Develop workflows and ELT pipelines using Python and Databricks.
  2. Uses enterprise-authorized AI capabilities within the work environment to accelerate data pipeline/design analysis and documentation, validating outputs and handling data according to sensitivity and security requirements.
  3. Support review of controls to ensure sufficient protection of enterprise data.
  4. Implement data security using entitlements frameworks.
  5. Update logical or physical data models based on new use cases.

Skills

Required

  • Formal training or certification on software engineering concepts and 3 years applied experience.
  • Good working knowledge of AWS, Databricks, and Python.
  • Experience across the data lifecycle.
  • Demonstrated experience using enterprise-authorized AI capabilities within the work environment to support data engineering workflows with strong validation habits and awareness of data sensitivity.
  • Ability to review and validate AI-assisted outputs (e.g., query suggestions, test ideas, or model change summaries) before use, escalating when uncertain and following data handling requirements.
  • Advanced at SQL, including joins and aggregations.
  • Working understanding of NoSQL databases.
  • Significant experience with statistical data analysis and ability to determine appropriate tools and data patterns for analysis.
  • Utilize AWS Cloud Services for developing, deploying, and managing applications at scale.
  • Good understanding and working knowledge of software development lifecycle tools used for configuration management, CI/CD pipelines, unit testing, regression testing, and performance testing.

Nice to have

  • Familiarity with the Standardized data layer practices (Medallion architecture)
  • Exposure to Aurora Postgres and MongoDB
  • Skills in designing efficient data models including normalization, denormalization, and schema design and an understanding around relational and star schemas.

What the JD emphasized

  • enterprise-authorized AI capabilities
  • validating outputs
  • data handling requirements
  • AI-assisted practices
  • control validation
  • resiliency and security expectations
  • strong validation habits
  • awareness of data sensitivity
  • review and validate AI-assisted outputs
  • escalating when uncertain