Lead Machine Learning Engineer (manager Ic)

Capital One Capital One · Banking · McLean, VA +2

Lead Machine Learning Engineer at Capital One's Risk Tech division, focusing on building and deploying AI-powered risk management solutions. The role involves designing, developing, testing, deploying, and supporting AI software components, including fine-tuning models, managing LLM inference, similarity search, guardrails, governance, observability, and agentic AI. Responsibilities include contributing to the technical roadmap, leveraging AI technologies, informing ML infrastructure decisions, maintaining production models, and constructing data pipelines, with an emphasis on Responsible and Explainable AI.

What you'd actually do

  1. Partner with a cross-functional team of engineers, data scientists, product managers, and designers to deliver AI-powered products that change how our associates work and provide value to our customers.
  2. Design, develop, test, deploy, and support AI software components utilizing machine learning models, including model evaluation and experimentation, large language model inference, similarity search, guardrails, governance, observability and agentic AI.
  3. Fine-tune, develop and evaluate machine learning and foundation models,
  4. Collaborate as part of a cross-functional Agile team to create and enhance software that utilizes state-of-the-art AI and ML capabilities
  5. Contribute thought leadership and technical vision to the long term roadmap of pioneering AI systems at Capital One.

Skills

Required

  • Python
  • Scala
  • Java
  • designing and building data-intensive solutions using distributed computing
  • building, scaling, and optimizing ML systems
  • machine learning models
  • model evaluation and experimentation
  • large language model inference
  • similarity search
  • guardrails
  • governance
  • observability
  • agentic AI
  • fine-tune machine learning and foundation models
  • develop machine learning and foundation models
  • evaluate machine learning and foundation models
  • ML infrastructure decisions
  • ML modeling techniques
  • Retrain models in production
  • maintain models in production
  • monitor models in production
  • Construct optimized data pipelines to feed ML models
  • code management
  • model governance
  • Responsible AI
  • Explainable AI

Nice to have

  • Master's or Doctoral Degree in computer science, electrical engineering, mathematics, or a similar field
  • designing, developing, delivering, and supporting AI services at scale
  • building production-ready data pipelines that feed ML models
  • industry recognized ML framework such as scikit-learn, PyTorch, Dask, Spark, or TensorFlow
  • developing AI and ML algorithms or technologies using Python
  • Retrieval Augmented Generation (RAG)
  • data gathering and preparation for ML models
  • people leader experience
  • leading teams developing ML solutions using industry best practices, patterns, and automation
  • designing, implementing, and scaling complex data pipelines for ML models
  • evaluating ML model performance
  • leveraging interactive AI tooling

What the JD emphasized

  • state-of-the-art AI technology
  • state-of-the-art AI and ML capabilities
  • pioneering AI systems
  • Responsible and Explainable AI

Other signals

  • deploy proprietary solutions for Risk management that are powered by state-of-the-art AI technology
  • build and deploy proprietary solutions for Risk management that are powered by state-of-the-art AI technology
  • design, develop, test, deploy, and support AI software components utilizing machine learning models
  • large language model inference
  • similarity search
  • guardrails
  • governance
  • observability
  • agentic AI
  • fine-tune, develop and evaluate machine learning and foundation models
  • state-of-the-art AI and ML capabilities
  • pioneering AI systems
  • ML infrastructure decisions
  • Retrain, maintain, and monitor models in production
  • Construct optimized data pipelines to feed ML models
  • Responsible and Explainable AI