Lead Software Engineer- (python)

JPMorgan Chase JPMorgan Chase · Banking · Jersey City, NJ +1 · Commercial & Investment Bank

Lead Software Engineer role focused on building and deploying production-grade machine learning pipelines and systems at enterprise scale within JPMorgan Chase's ML Center of Excellence. The role involves designing, developing, and maintaining software, engineering data pipelines, feeding data into ML models, and deploying complete systems into production environments with a focus on scalability and reliability.

What you'd actually do

  1. Designs, develops and maintains production grade software
  2. Engineers data pipelines to ingest and transform large volumes of data.
  3. Feeds processed data into machine learning model pipelines.
  4. Stores resulting data in data warehouses and data lakes.
  5. Design sand implements end-to-end machine learning model pipelines, from data input to serving outputs to a large user base.

Skills

Required

  • Python coding
  • Infrastructure design for large-scale machine learning model deployment using tools like Terraform or AWS Infrastructure as Code.
  • Building metrics and setting up AWS CloudWatch monitors and alarms for infrastructure and application performance.
  • Working with data lakes (Amazon S3) and data warehouses (AWS Redshift).
  • Utilizing AWS services and CI/CD pipelines for deploying and maintaining machine learning applications.
  • Developing dynamic, interactive dashboards with Tableau or Qlik Sense, including advanced visualization, ETL automation, and ODBC connectors.
  • Data manipulation, structuring, design flow, and query optimization using SQL and Python.
  • Processing large datasets with data containers, multithreading, and multiprocessing in PySpark and TensorFlow.
  • Developing software or microservices deployed as REST APIs.
  • Using AWS Kinesis and Firehose for large-scale data ingestion and ETL with AWS Glue.
  • Developing and automating high-performance, large-scale data processing systems.

Nice to have

  • Familiarity with recent large language model technologies
  • Familiarity with engineering systems using large language models
  • Familiarity with LLM tools such as Langchain or Haystack

What the JD emphasized

  • production grade software
  • machine learning model pipelines
  • deploy systems into production
  • enterprise scale

Other signals

  • production grade software
  • machine learning model pipelines
  • deploy systems into production
  • enterprise scale