Senior Lead AI Engineer (llm Gateway, Fm Hosting)

Capital One Capital One · Banking · McLean, VA +3

Senior Lead AI Engineer role focused on building and optimizing LLM inference infrastructure (Gateway, FM Hosting) and related AI components like similarity search, guardrails, evaluation, and observability for enterprise-scale AI products at Capital One. The role involves designing, developing, testing, deploying, and supporting these AI software components, with a strong emphasis on improving performance (scalability, cost, latency, throughput) of large-scale production AI systems.

What you'd actually do

  1. Design, develop, test, deploy, and support AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability, etc.
  2. Leverage a broad stack of Open Source and SaaS AI technologies such as AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, PyTorch, and more.
  3. Invent and introduce state-of-the-art LLM optimization techniques to improve the performance — scalability, cost, latency, throughput — of large scale production AI systems.
  4. Contribute to the technical vision and the long term roadmap of foundational AI systems at Capital One.

Skills

Required

  • Python
  • Go
  • Scala
  • Java
  • AI software components
  • large language model inference
  • similarity search
  • guardrails
  • model evaluation
  • experimentation
  • governance
  • observability
  • LLM optimization techniques
  • performance optimization
  • scalability
  • cost optimization
  • latency optimization
  • throughput optimization

Nice to have

  • AWS
  • Google Cloud
  • Azure
  • Huggingface
  • VectorDBs
  • Nemo Guardrails
  • PyTorch
  • C++
  • C#
  • cloud platforms
  • AI systems design
  • AI systems development
  • AI systems integration
  • AI systems delivery
  • AI systems support
  • training optimization
  • inference optimization
  • hardware utilization
  • team leadership
  • mentoring
  • stakeholder influence
  • communication skills
  • presentation skills

What the JD emphasized

  • large scale production AI systems
  • foundational AI systems

Other signals

  • LLM inference
  • optimization techniques
  • production AI systems
  • foundational AI systems