Machine Learning Engineer – Document Digitization (llms)-senior Associate

JPMorgan Chase JPMorgan Chase · Banking · Jersey City, NJ +1 · Corporate Sector

Machine Learning Engineer focused on applying LLMs and AI to document digitization within a financial services context. Responsibilities include the full ML lifecycle from training to deployment and monitoring, building scalable pipelines, and ensuring security and compliance.

What you'd actually do

  1. Design, development, and integration of AI-powered document digitization solutions, focusing on extracting information and insights from diverse document types.
  2. Manage the end-to-end AI/ML lifecycle: model training, validation, deployment, monitoring, and continuous improvement in production environments.
  3. Employ generative AI, and large language models (LLMs) to automate and optimize document workflows.
  4. Build and maintain scalable document digitization pipelines using Python, AI frameworks, and cloud technologies.
  5. Provision and manage cloud resources using infrastructure as code tools (Terraform) and AWS services (SageMaker, Bedrock).

Skills

Required

  • Python programming
  • AI/ML model development, deployment, and MLOps practices
  • machine learning frameworks (TensorFlow, PyTorch, Scikit-learn, PyTorch Lightning)
  • generative AI models (GANs, VAEs, transformers, diffusion models) and LLMs
  • AWS cloud platforms (SageMaker, Bedrock)
  • containerization (Docker, Kubernetes, Amazon EKS)
  • infrastructure as code (Terraform)
  • NoSQL databases (Mongo Atlas, ElasticSearch, OpenSearch, Neo4J)
  • agentic coding approaches, autonomous code agents, and last sprinting code generation
  • SDLC, CI/CD, resiliency, and security practices
  • accelerate development using AI technologies
  • deploying and maintaining AI/ML solutions in large-scale, production environments
  • problem-solving, communication, and collaboration skills

Nice to have

  • Java
  • React.JS, AngularJS
  • financial services, especially investment banking or credit risk operations
  • agentic AI frameworks, prompt optimization, and fine-tuning SLMs
  • distributed computing, data sharing, and DDP training
  • design/code reviews and mentoring teams
  • AWS Generative AI Developer Professional certification

What the JD emphasized

  • secure, scalable, and innovative technology products
  • extract, analyze, and manage information
  • automate and optimize document workflows
  • scalable document digitization pipelines
  • scalability, reliability, security, and compliance
  • reimagine legacy document processing systems
  • monitor digitization accuracy, workflow efficiency, and business impact
  • AI/ML solutions in large-scale, production environments

Other signals

  • LLMs
  • document digitization
  • production deployment
  • MLOps