Lead Machine Learning Engineer - (bank Tech)

Capital One Capital One · Banking · McLean, VA

Lead Machine Learning Engineer at Capital One focused on productionizing ML applications and systems at scale within a fintech domain. Responsibilities include designing, building, and delivering ML models, optimizing ML infrastructure, writing and testing code, automating tests and deployment, retraining and monitoring models in production, leveraging cloud architectures, constructing data pipelines, and ensuring responsible AI practices. Requires experience in data-intensive solutions, programming, and building/scaling ML systems.

What you'd actually do

  1. Design, build, and/or deliver ML models and components that solve real-world business problems, while working in collaboration with the Product and Data Science teams.
  2. Inform your ML infrastructure decisions using your understanding of ML modeling techniques and issues, including choice of model, data, and feature selection, model training, hyperparameter tuning, dimensionality, bias/variance, and validation).
  3. Solve complex problems by writing and testing application code, developing and validating ML models, and automating tests and deployment.
  4. Retrain, maintain, and monitor models in production.
  5. Leverage or build cloud-based architectures, technologies, and/or platforms to deliver optimized ML models at scale.

Skills

Required

  • Python
  • Scala
  • Java
  • designing and building data-intensive solutions using distributed computing
  • building, scaling, and optimizing ML systems

Nice to have

  • Master's or Doctoral Degree in computer science, electrical engineering, mathematics, or a similar field
  • building production-ready data pipelines that feed ML models
  • industry recognized ML framework such as scikit-learn, PyTorch, Dask, Spark, or TensorFlow
  • developing performant, resilient, and maintainable code
  • data gathering and preparation for ML models
  • people leader experience
  • leading teams developing ML solutions using industry best practices, patterns, and automation
  • developing and deploying ML solutions in a public cloud such as AWS, Azure, or Google Cloud Platform
  • designing, implementing, and scaling complex data pipelines for ML models and evaluating their performance
  • ML industry impact through conference presentations, papers, blog posts, open source contributions, or patents

What the JD emphasized

  • productionizing machine learning applications and systems at scale
  • build, scale, and optimize ML systems
  • designing and building data-intensive solutions
  • building production-ready data pipelines that feed ML models
  • developing and deploying ML solutions in a public cloud

Other signals

  • productionizing machine learning applications and systems at scale
  • develop and review model and application code
  • ensure high availability and performance of our machine learning applications
  • build, scale, and optimize ML systems