Senior Associate, Data Scientist - Audit Data Science

Capital One Capital One · Banking · Richmond, VA +1

This role focuses on building and implementing ML and NLP models, including LLM-based chatbots and GenAI applications, within the fintech domain for audit and risk management. The position involves the full ML lifecycle from design to productionization and monitoring, leveraging various technologies like Python, Hugging Face, LangChain, and AWS.

What you'd actually do

  1. Build machine learning and NLP models through all phases of development, from design through training, evaluation, validation, and implementation.
  2. Productionize highly scalable data pipelines for faster feature changes and updates, and implementing data validation framework and quality tests
  3. Investigate new technology to advance data management, model development and deployment and drive change in enterprise ML products
  4. Leverage a broad stack of technologies (Python, Conda, Flask, Dash, Hugging Face, LangChain, AWS, H2O, Spark, and more) to reveal the insights hidden within huge volumes of numeric and textual data
  5. Flex your interpersonal skills to translate the complexity of your work into tangible business goals

Skills

Required

  • Python
  • SQL
  • machine learning
  • data analytics
  • quantitative field degree

Nice to have

  • AWS
  • Scala
  • R
  • Hugging Face
  • LangChain
  • Spark
  • H2O
  • Flask
  • Dash
  • Master’s Degree in STEM
  • PhD in STEM

What the JD emphasized

  • build creative ML solutions
  • LLM based chatbots
  • GenAI powered applications
  • AML/Fraud identification
  • Customer call transcripts intelligence
  • NLP and Generative AI technologies
  • build machine learning and NLP models through all phases of development, from design through training, evaluation, validation, and implementation.

Other signals

  • build creative ML solutions
  • LLM based chatbots
  • GenAI powered applications
  • AML/Fraud identification
  • Customer call transcripts intelligence
  • NLP and Generative AI technologies