Applied Researcher Ii, AI Foundations

Capital One Capital One · Banking · San Jose, CA

Applied Researcher II focused on AI Foundations, responsible for building AI foundation models through all phases of development, from design through training, evaluation, validation, and implementation. The role involves partnering with cross-functional teams to deliver AI-powered products and engaging in applied research to push state-of-the-art AI into customer experiences. Requires experience in building large deep learning models, training optimization, self-supervised learning, robustness, explainability, or RLHF, with a track record of delivering models at scale.

What you'd actually do

  1. Build AI foundation models through all phases of development, from design through training, evaluation, validation, and implementation.
  2. Engage in high impact applied research to take the latest AI developments and push them into the next generation of customer experiences.
  3. Leverage a broad stack of technologies — Pytorch, AWS Ultraclusters, Huggingface, Lightning, VectorDBs, and more — to reveal the insights hidden within huge volumes of numeric and textual data.
  4. Partner with a cross-functional team of data scientists, software engineers, machine learning engineers and product managers to deliver AI-powered products that change how customers interact with their money.
  5. Flex your interpersonal skills to translate the complexity of your work into tangible business goals.

Skills

Required

  • Deep understanding of AI methodologies
  • Experience building large deep learning models (language, images, events, or graphs)
  • Expertise in training optimization, self-supervised learning, robustness, explainability, or RLHF
  • Engineering mindset with a track record of delivering models at scale
  • Experience delivering libraries, platform level code or solution level code
  • Ability to own and pursue a research agenda
  • PhD in Electrical Engineering, Computer Engineering, Computer Science, AI, Mathematics, or related fields with 2 years of experience OR MS with 4 years of experience

Nice to have

  • LLM
  • NLP
  • geometric deep learning
  • Graph Neural Networks
  • Sequential Models
  • Multivariate Time Series
  • deep learning theory
  • optimization (training & inference)
  • model sparsification
  • quantization

What the JD emphasized

  • track record of delivering models at scale
  • track record of coming up with new ideas or improving upon existing ideas in machine learning
  • publications

Other signals

  • building AI foundation models
  • applied research
  • delivering models at scale