Applied AI ML Director - Agent Builder Platform

JPMorgan Chase JPMorgan Chase · Banking · Palo Alto, CA +1 · Corporate Sector

Director of ML Engineering to lead the technical vision and execution for a core Agent SDK and the agentic systems it enables, focusing on building and scaling foundational toolkits for enterprise AI.

What you'd actually do

  1. Define and drive the technical vision and roadmap for the Agent SDK and long-running agentic workflows.
  2. Set architectural direction for SDK components, including task orchestration, state management, checkpointing, and retry logic.
  3. Champion data science rigor by establishing measurement, experimentation, and evaluation frameworks for agent performance.
  4. Oversee the design and optimization of ML pipelines for training, fine-tuning, and inference of models powering agent intelligence.
  5. Direct the instrumentation strategy for observability, feedback loops, and continuous improvement of autonomous agents.

Skills

Required

  • 10 years of experience in machine learning engineering, applied data science, or ML platform development.
  • 3 years of experience in a leadership role managing teams of engineers and/or data scientists.
  • Strong technical depth across the ML and data science stack, including ML frameworks (PyTorch, TensorFlow, JAX, scikit-learn) and LLM serving and fine-tuning toolchains.
  • Proven experience designing and delivering SDKs, platforms, or agent development kit, including API design and documentation strategy.
  • Expertise in distributed and long-running systems, including state machines, workflow orchestration, checkpointing, and fault-tolerant design.
  • Fluency in LLM-based agent architectures, prompt engineering, tool use, and multi-agent coordination patterns.
  • Demonstrated ability to craft and drive a technical vision that maximizes business impact and influences decision-making.
  • Proven ability to build, mentor, and retain senior technical talent and foster a collaborative team culture.
  • Strong foundation in experimental design, statistical analysis, and evaluation methodology.
  • Excellent communication skills, with the ability to explain complex technical concepts to both technical and non-technical audiences.
  • Experience integrating data science rigor and responsible AI practices into production systems.

Nice to have

  • Experience building or contributing to open-source ML or agent frameworks such as LangChain, AutoGen, Haystack, or MLflow.
  • Background in ML evaluation and monitoring at scale, including drift detection, A/B testing, and automated regression testing.
  • Deep familiarity with multi-agent system design, including communication protocols, task delegation, and conflict resolution.
  • Experience overseeing AI workload deployment on managed ML platforms such as AWS SageMaker or Bedrock.
  • Background leading AI engineering in regulated or high-reliability environments, especially financial services or asset and wealth management.
  • Experience integrating user and stakeholder feedback loops into continuous model and system improvement processes.
  • Experience designing developer experience, writing technical documentation, and supporting internal developer adoption.

What the JD emphasized

  • long-running, autonomous AI agents
  • agentic system design
  • Agent SDK
  • agentic workflows
  • multi-step reasoning
  • tool use
  • multi-agent coordination
  • responsible AI practices
  • guardrails
  • policy enforcement mechanisms

Other signals

  • Agent SDK
  • long-running, autonomous AI agents
  • agentic system design
  • agentic workflows
  • multi-step reasoning
  • tool use
  • multi-agent coordination