Senior Associate, Data Scientist - Anti-money Laundering

Capital One Capital One · Banking · McLean, VA +3

Senior Associate Data Scientist role focused on developing and deploying machine learning models for Anti-Money Laundering (AML) detection within a financial services company. The role involves end-to-end model development, including data sourcing, training, evaluation, validation, and implementation, utilizing tools like Python, AWS, and Spark. The primary output is a productionized AI/ML system for risk management.

What you'd actually do

  1. Build machine learning models and AI tools through all phases of development, from design through training, evaluation, validation, and implementation
  2. Leverage a broad stack of tools and technologies — Python, Conda, AWS, Spark, dbt, and more — to build production-ready pipelines for data sourcing, model development, and model scoring
  3. Partner with a cross-functional team of data scientists, software engineers, business analysts, risk managers, and product owners to deliver industry-leading risk management products
  4. Flex your interpersonal skills to translate the complexity of your work into tangible business goals

Skills

Required

  • Python
  • SQL
  • AWS
  • Spark
  • dbt
  • Machine Learning
  • Data Analytics
  • Quantitative field degree (Statistics, Economics, Operations Research, Analytics, Mathematics, Computer Science)

Nice to have

  • Master’s Degree in STEM field or PhD
  • Experience in AML modeling or related domain (e.g. Fraud, Credit Risk)
  • Experience with clustering, classification, sentiment analysis, time series, and deep learning

What the JD emphasized

  • production-ready pipelines
  • model development
  • model scoring
  • model outputs
  • model developers
  • production models
  • machine learning models
  • model training
  • model evaluation
  • model validation
  • machine learning

Other signals

  • Develops and deploys machine learning models for anti-money laundering detection.
  • Works with large-scale data and cloud platforms (AWS, Snowflake, Spark).
  • Focuses on productionizing models and ensuring their ongoing monitoring.