AI Hire Signal
JobsCompaniesTrendsInsightsWeekly
JobsStrategy timeline
AI Hire Signal

Tracking AI hiring across 200+ US tech companies. Stage, salary, and stack signals on every role — refreshed weekly.

Contact

Browse

JobsCompaniesTrendsInsightsWeekly

Resources

AboutSitemapRobots

Legal

PrivacyTerms
© 2026 AI Hire Signal·Not affiliated with companies shown

Currently tracking 241 active AI roles, down 26% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $123k–$392k (avg $231k).

Hiring
241 / 262
Momentum (4w)
↓-218 -26%
622 opens last 4w · 840 prior 4w
Salary range · avg $231k
$123k–$392k
USD · disclosed roles only
Tracked since
Aug '25
last role 4w ago
Hiring velocityscroll left for older weeks
1 new role
May 19
1 new role
26
1 new role
Jul 21
1 new role
Aug 25
2 new roles
Sep 8
2 new roles
15
2 new roles
29
1 new role
Oct 13
1 new role
20
5 new roles
27
3 new roles
Nov 3
2 new roles
17
1 new role
24
3 new roles
Dec 1
1 new role
8
5 new roles
15
1 new role
22
1 new role
29
18 new roles
Jan 5
29 new roles
12
12 new roles
19
23 new roles
26
28 new roles
Feb 2
24 new roles
9
22 new roles
16
36 new roles
23
45 new roles
Mar 2
49 new roles
9
57 new roles
16
74 new roles
23
88 new roles
30
129 new roles
Apr 6
135 new roles
13
188 new roles
20
259 new roles
27
314 new roles
May 4
206 new roles
11
158 new roles
18
162 new roles
25
182 new roles
Jun 1
199 new roles
8
155 new roles
15
86 new roles
22

Capital One currently has 293 active AI-related job listings. The majority of these roles are focused on serving infrastructure, accounting for 28% of the total, followed closely by agents at 26% and post-training at 23%. Engineering is the dominant function, with 234 roles, and hiring is primarily concentrated in the United States. Frequent tech tags include model_serving, vector_db, and llm_observability, suggesting a focus on the operational aspects of AI deployment. In the last 30 days, Capital One posted 124 new AI roles, representing a 22% increase compared to the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Frequently asked questions

  • What AI roles is Capital One hiring for?

    Capital One currently has 305 active AI-related roles in our index. The most common open titles are: Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) (9), Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) (8), Applied Researcher I (6), Distinguished Engineer (6), Applied Researcher II (5). Most positions are in Engineering and Research.

  • What stage of AI development does Capital One focus on?

    Capital One's active AI hiring is concentrated in: serving infrastructure (28%), agents (27%), post-training (23%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

  • Where is Capital One hiring AI talent?

    Capital One is hiring AI talent in: United States (299 roles), United Kingdom (3 roles), Canada (2 roles), Philippines (1 role).

  • What technologies does Capital One's AI team work with?

    Job postings at Capital One most frequently reference: model serving, vector db, fine tuning, llm observability, inference infra.

  • How many AI roles has Capital One posted recently?

    In the past 30 days, Capital One has posted 96 new AI-related roles. That is a -26% change versus the prior 30 days (130 → 96).

Jobs (326)

245 AI · 1392 total active
Show
Active onlyAI only (≥ 7)
Stage
AllData · 5Pretrain · 12Post-train · 86Serve · 100Agent · 79Ship · 44
Function
AllEngineering · 272Research · 39Product · 15
Country
AllUnited States · 321United Kingdom · 4Canada · 1
Sort
AI scoreRecentTitle
TitleStageFunctionLocationFirst seenAI score
Director, AI Engineering (Agentic AI Platform)
Director of AI Engineering focused on building and managing an Agentic AI Platform. The role involves defining strategic vision, technical leadership for agentic workflows, overseeing the platform lifecycle, guiding technical teams, and ensuring scalability, reliability, and compliance. It requires strong leadership, technical expertise in AI/ML, and collaboration with cross-functional teams to deliver AI-powered products and foundational AI systems.
AgentEngineeringSan Jose, CA +15w ago8
Senior Lead AI Engineer (Gen AI Platform Services)
Senior Lead AI Engineer role focused on building and optimizing Gen AI platform services, including foundation model training, LLM inference, similarity search, guardrails, evaluation, and governance. The role involves designing, developing, testing, deploying, and supporting AI software components, leveraging cloud platforms and AI technologies, and optimizing performance for scalability, cost, latency, and throughput.
Serve
51–100 of 326← Prev1234567Next →
Agent
Engineering
San Jose, CA +2
5w ago
8
Distinguished AI Engineer
Distinguished AI Engineer role focused on designing, developing, and deploying AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves optimizing LLM performance for scalability, cost, latency, and throughput in large-scale production AI systems, leveraging technologies like AWS Ultraclusters, Huggingface, VectorDBs, and PyTorch. The candidate will contribute to the technical vision and roadmap of foundational AI systems.
ServePost-trainEngineeringBangalore, IN5w ago8
Senior Lead AI Engineer(MLX, Agentic AI, Gen AI platform Services)
Senior Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems at Capital One. Responsibilities include designing, developing, testing, deploying, and supporting AI software components such as foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves leveraging AI technologies like AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, and PyTorch, and optimizing LLM performance for scalability, cost, latency, and throughput.
ServeAgentEngineeringSan Jose, CA +46w ago8
Senior Lead AI Engineer(MLX, Agentic AI, Gen AI platform Services)
Senior Lead AI Engineer responsible for designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role focuses on optimizing LLM performance for scalability, cost, latency, and throughput in production AI systems, leveraging technologies like AWS Ultraclusters, Huggingface, VectorDBs, and Nemo Guardrails. This role is part of the Intelligent Foundations and Experiences (IFX) team, aiming to advance AI science and engineering and deploy proprietary solutions.
ServeAgentEngineeringSan Jose, CA +46w ago8
Senior Lead AI Engineer (LLM Gateway, FM Hosting)
Senior Lead AI Engineer role focused on building and optimizing LLM inference infrastructure (Gateway, FM Hosting) and related AI components like similarity search, guardrails, evaluation, and observability for enterprise-scale AI products at Capital One. The role involves designing, developing, testing, deploying, and supporting these AI software components, with a strong emphasis on improving performance (scalability, cost, latency, throughput) of large-scale production AI systems.
ServeAgentEngineeringMcLean, VA +36w ago8
Lead AI Engineer (FM Hosting, LLM Inference)
Lead AI Engineer focused on optimizing LLM inference for scalable, cost-effective production AI systems within an enterprise setting. The role involves designing, developing, and deploying AI software components, including foundation model training, inference, similarity search, guardrails, evaluation, and observability, leveraging various AI technologies and cloud platforms.
ServeAgentEngineeringNew York, NY +26w ago8
Sr. Lead AI Engineer (GenAI Platform)
Sr. Lead AI Engineer focused on building and scaling GenAI platforms, including foundation model training, LLM inference, similarity search, guardrails, evaluation, governance, and observability. The role involves optimizing performance, cost, and latency of large-scale production AI systems using various open-source and cloud technologies.
ServeAgentEngineeringSan Jose, CA +46w ago8
Senior Distinguished Engineer, AI Compute (Remote Eligible)
Senior Distinguished Engineer focused on architecting and building the AI compute infrastructure for Capital One's enterprise machine learning platform. This role involves developing scalable, high-performance systems for diverse AI workloads including LLM pre-training, fine-tuning, inference, and agentic applications, leveraging distributed compute frameworks like Ray and Spark on cloud substrates.
ServePretrainEngineeringSan Francisco, CA +5 · Remote7w ago8
Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI)
Senior Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems, including LLM inference, agentic AI, similarity search, guardrails, and model evaluation. The role involves optimizing performance, scalability, cost, and latency of large-scale production AI systems.
AgentServeEngineeringSan Jose, CA +47w ago8
Distinguished AI Engineer (Remote)
Distinguished AI Engineer responsible for designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves optimizing performance, scalability, cost, and latency of large-scale production AI systems, and contributing to the technical vision and roadmap of foundational AI systems.
ServeAgentEngineeringMcLean, VA +1 · Remote7w ago8
Lead Machine Learning Engineer (Gen AI, Python, Go, AWS)
Lead Machine Learning Engineer focused on designing, building, and productionizing Generative AI applications and Agentic Workflow systems at scale. The role involves building robust ML serving architecture, developing high-performance code, and ensuring low latency and high availability of AI solutions, with a strong emphasis on cloud-native platforms and MLOps.
ServeAgentEngineeringSan Francisco, CA +37w ago8
Senior Manager, Data Science - LLM Customization Team
Senior Manager of Data Science focused on LLM customization within Capital One's AI Foundations team. The role involves partnering with cross-functional teams to deliver AI-powered products, leveraging technologies like PyTorch, Hugging Face, LangChain, and VectorDBs. Responsibilities include adapting and fine-tuning LLMs for business applications, building NLP models through development phases, and operationalizing them in production systems. The ideal candidate has experience in training language models, expertise in areas like self-supervised learning or RLHF, and a track record of delivering models at scale.
Post-trainServeEngineeringNew York, NY +27w ago8
Lead AI/ML Engineer (Platform, kubeflow)
Lead AI/ML Engineer focused on building and optimizing AI platforms and infrastructure, including foundation model training, LLM inference, similarity search, guardrails, and evaluation. The role involves designing, developing, and deploying AI software components, leveraging various AI technologies, and improving the performance of large-scale production AI systems.
ServeAgentEngineeringSan Jose, CA +37w ago8
Distinguished AI Engineer
Distinguished AI Engineer role focused on designing, developing, and deploying AI software components including foundation model training, LLM inference, similarity search, guardrails, evaluation, and observability. The role involves optimizing performance, scalability, cost, and latency of large-scale production AI systems, leveraging cloud platforms and AI technologies. It requires a strong engineering and mathematics foundation, with experience in Python, Go, Scala, or Java, and cloud deployment.
ServeAgentEngineeringMcLean, VA +57w ago8
Lead AI Engineer (Gen AI Platform Services)
Lead AI Engineer role focused on building and optimizing Gen AI Platform Services. Responsibilities include designing, developing, testing, deploying, and supporting AI software components such as foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves leveraging various AI technologies and optimizing large-scale production AI systems for performance, scalability, cost, and latency.
ServeAgentEngineeringSan Jose, CA +18w ago8
Applied Researcher I
Applied Researcher I role focused on building AI foundation models through all phases of development, from design through training, evaluation, validation, and implementation. The role involves high-impact applied research to push the latest AI developments into next-generation customer experiences, leveraging technologies like Pytorch, AWS Ultraclusters, Huggingface, Lightning, and VectorDBs. Experience with training optimization, self-supervised learning, robustness, explainability, and RLHF is desired, with a track record of delivering models at scale.
Post-trainPretrainResearchNew York, NY +38w ago8
Distinguished Engineer - Agentic AI
Distinguished Engineer role focused on building agentic AI systems for real-time marketing and personalized sales within the enterprise banking domain. The role involves prototyping, defining technical strategy, and scaling these systems, with a focus on influencing architecture and evangelizing AI adoption.
AgentEngineeringMcLean, VA8w ago8
Manager, Data Science - AI Software Engineering
Manager of Data Science focused on AI Software Engineering, designing and building scalable AI architectures for the software development lifecycle using multi-agent solutions. The role involves partnering with cross-functional teams, leveraging technologies like Python, AWS, and Spark, and building ML models through all phases of development. Experience with agentic platforms, RAG, and advanced model customization is preferred.
AgentPost-trainEngineeringMcLean, VA +28w ago8
Manager, Data Science - GenAI Digital Assistant
Manager, Data Science role focused on GenAI for a digital assistant, involving research, development, fine-tuning LLMs, inference optimization, and multi-agentic workflows. Leverages Python, AWS, LangChain, LangGraph, HuggingFace, vLLM, and VectorDBs. Aims to improve customer experience through intelligent digital assistance.
AgentPost-trainEngineeringSan Jose, CA +2Apr 248
Manager, Data Scientist - Recommendation & Personalization Systems
Manager, Data Scientist role focused on building and deploying personalized recommendation engines using Foundation Models and Reinforcement Learning. The role involves partnering with cross-functional teams, leveraging technologies like Python and AWS, and building ML models through all phases of development. Expertise in Transformer-based architectures and scalable systems is required.
AgentPost-trainEngineeringMcLean, VA +2Apr 238
Senior Lead AI Engineer, AI Foundations
This role focuses on designing, developing, testing, deploying, and supporting AI software components for foundational AI systems at Capital One. Key responsibilities include foundation model training, LLM inference, similarity search, guardrails, model evaluation, governance, and observability. The role also involves optimizing LLM performance for scalability, cost, latency, and throughput, leveraging technologies like AWS, Huggingface, VectorDBs, and PyTorch. The goal is to build and deploy proprietary AI solutions that deliver value to millions of customers and enhance products with AI capabilities.
ServeAgentEngineeringNew York, NY +3Apr 238
Lead AI Engineer, AI Foundations
Lead AI Engineer focused on building and optimizing AI Foundations, including foundation model training, LLM inference, similarity search, guardrails, evaluation, governance, and observability. The role involves designing, developing, testing, deploying, and supporting AI software components, with a strong emphasis on improving performance (scalability, cost, latency, throughput) of large-scale production AI systems using state-of-the-art LLM optimization techniques. The role also touches on agentic systems through guardrails and similarity search, and model evaluation.
ServeAgentEngineeringNew York, NY +3Apr 238
Senior Lead AI Engineer (Gen AI Platform Services)
Senior Lead AI Engineer role focused on building and scaling Gen AI Platform Services, including foundation model training, LLM inference, similarity search, guardrails, and model evaluation. The role involves optimizing AI systems for performance, cost, and latency, and contributing to the technical vision for foundational AI systems at Capital One.
ServeAgentEngineeringSan Jose, CA +1Apr 238
Applied Researcher I
Applied Researcher I role focused on building AI foundation models and delivering AI-powered products, leveraging state-of-the-art AI developments for customer experiences. The role involves research, training, evaluation, and implementation of large deep learning models, with a focus on optimization, self-supervised learning, robustness, explainability, and RLHF.
Post-trainPretrainResearchNew York, NY +4Apr 228
Senior Lead AI Engineer
Senior Lead AI Engineer responsible for designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves optimizing LLM performance for scalability, cost, latency, and throughput, and contributing to the technical vision and roadmap of foundational AI systems. The role leverages various AI technologies and requires strong engineering and mathematical foundations.
ServeAgentEngineeringNew York, NY +4Apr 228
Lead AI Engineer
Lead AI Engineer role focused on designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves optimizing LLM performance for scalability, cost, latency, and throughput, and contributing to the technical vision and roadmap of foundational AI systems. It requires experience with AI/ML algorithms, programming languages like Python, and cloud platforms, with a focus on deploying scalable and responsible AI solutions.
ServeAgentEngineeringNew York, NY +4Apr 228
Principal Associate, Data Scientist - LLM Customization Team
Capital One is seeking a Principal Associate Data Scientist to join their LLM Customization Team. This role involves partnering with cross-functional teams to deliver AI-powered products, leveraging technologies like Pytorch, Hugging Face, LangChain, and VectorDBs. The primary focus is on adapting and fine-tuning LLMs for business-specific applications, building NLP models through all phases of development, and operationalizing them in production systems.
Post-trainServeEngineeringNew York, NY +1Apr 208
Applied Researcher II (AI Foundations, LLM Core and Agentic AI)
Applied Researcher II focused on AI Foundations, LLM Core, and Agentic AI at Capital One. The role involves partnering with cross-functional teams to deliver AI-powered products, leveraging technologies like Pytorch, AWS, Huggingface, and VectorDBs. Responsibilities include building AI foundation models through all development phases (design, training, evaluation, validation, implementation) and conducting high-impact applied research to improve customer experiences. The ideal candidate has a strong technical background, experience building large deep learning models, expertise in areas like training optimization, self-supervised learning, robustness, explainability, or RLHF, and a track record of delivering models at scale. Experience with LLM pre-training, optimization, or fine-tuning is highly preferred.
Post-trainAgentResearchNew York, NY +4Apr 178
Senior Manager, AI Engineering (People Leader) (Gen AI Platform Services)
Senior Manager, AI Engineering (People Leader) for Gen AI Platform Services at Capital One. This role involves overseeing the design, development, testing, deployment, and support of AI software components, including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The candidate will make build-vs-buy decisions, optimize LLM performance (scalability, cost, latency, throughput), contribute to the technical vision and roadmap of foundational AI systems, and lead/mentor an AI engineering team. Experience with cloud platforms and deploying scalable AI solutions is required.
ServePost-trainEngineeringSan Jose, CA +1Apr 148
Manager, Product Management - AI Strategy & Enablement (Business Cards & Payments)
Product Manager to execute technical AI vision, scale AI use cases, data infrastructure, development velocity, and knowledge governance. Design agentic workflows, evaluate LLM models, build agentic-first architectures, and connect AI to business goals.
AgentPost-trainProductMcLean, VA +2Apr 148
Sr. Distinguished AI Engineer
This role focuses on designing, developing, testing, deploying, and supporting AI software components, including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The engineer will invent and introduce LLM optimization techniques to improve the performance (scalability, cost, latency, throughput) of large-scale production AI systems, leveraging technologies like AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, and PyTorch. The role involves contributing to the technical vision and roadmap of foundational AI systems.
ServeAgentEngineeringCambridge, MA +5Apr 108
Lead AI Engineer ( MLX, Gen AI Platform Services, Agentic AI)
Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems at Capital One. The role involves designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. It requires leveraging a broad stack of AI technologies and optimizing LLM performance for scalability, cost, and latency.
ServeAgentEngineeringNew York, NY +4Apr 98
Distinguished AI Engineer
This role focuses on architecting and launching conversational AI experiences for millions of customers, involving foundation model training, LLM inference, and optimization of AI systems. It requires partnering with cross-functional teams to deliver AI-powered products and leveraging a broad stack of AI technologies.
AgentServeEngineeringCambridge, MA +2Apr 88
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
Lead AI Engineer role focused on building and deploying AI-powered products, including foundation model training, LLM inference, similarity search, guardrails, and model evaluation. The role involves leveraging AI technologies like AWS Ultraclusters, Huggingface, VectorDBs, and Nemo Guardrails, and optimizing LLM performance for scalability, cost, and latency. The position requires a strong engineering foundation and experience in AI/ML algorithm development and deployment on cloud platforms.
AgentServeEngineeringNew York, NY +4Apr 78
Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI)
Senior Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems, including LLM inference, optimization, and agentic AI platforms. The role involves designing, developing, testing, deploying, and supporting AI software components, leveraging various AI technologies, and contributing to the technical vision and roadmap.
ServeAgentEngineeringCambridge, MA +4Apr 68
Sr. Lead AI Engineer (Gen AI Platform Services)
This role focuses on engineering AI-powered products and platforms, specifically within Generative AI. Responsibilities include designing, developing, and supporting AI software components like foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role also involves optimizing LLM performance for scalability, cost, latency, and throughput, leveraging various AI technologies and cloud platforms.
ServeAgentEngineeringSan Jose, CA +3Apr 68
Sr. Distinguished Machine Learning Engineer (Remote-Eligible)
This role focuses on building and scaling the intelligence and infrastructure for real-time, personalized customer experiences using ML and GenAI systems. It involves defining technical strategy, partnering with data science and ML platform teams, developing a rules engine, building ML infrastructure for end-to-end workflows, architecting low-latency event-driven systems, driving MLOps, and optimizing LLM performance for production AI systems. The role also involves providing technical leadership and leveraging various AI technologies.
AgentServeEngineeringMcLean, VA +1 · RemoteApr 68
Lead AI Engineer (AI Foundations)
Lead AI Engineer focused on AI Foundations, responsible for designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, governance, and observability. The role involves optimizing large-scale production AI systems for performance (scalability, cost, latency, throughput) using various AI technologies and contributing to the technical vision and roadmap of foundational AI systems.
ServeAgentEngineeringCambridge, MA +3Apr 38
Senior Lead AI Engineer,(MLX, Agentic AI, Gen AI platform Services)
Senior Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems at Capital One. Responsibilities include designing, developing, testing, deploying, and supporting AI software components like foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves leveraging AI technologies and optimizing LLM performance for scalability, cost, and latency.
ServeAgentEngineeringSan Jose, CA +3Apr 38
Senior Lead AI Engineer (MLX, Agentic AI, Gen AI platform Services)
Senior Lead AI Engineer role focused on building and scaling Gen AI platform services, including foundation model training, LLM inference, similarity search, guardrails, and model evaluation. The role involves optimizing performance (scalability, cost, latency, throughput) of large-scale production AI systems and contributing to the technical vision for foundational AI systems. Requires strong engineering and AI expertise, with experience in cloud platforms and programming languages like Python.
ServeAgentEngineeringSan Jose, CA +3Apr 38
Lead AI Engineer (MLX, Agentic AI, Gen AI platform Services)
Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems, including LLM inference, similarity search, guardrails, and model evaluation, with an emphasis on optimizing performance and scalability for enterprise use.
ServeAgentEngineeringNew York, NY +4Apr 38
Senior Lead AI Engineer (FM Hosting, LLM Inference)
Senior Lead AI Engineer focused on LLM inference and hosting infrastructure, optimizing performance, scalability, cost, and latency for large-scale production AI systems. The role involves designing, developing, and deploying AI software components, including foundation model training, inference, similarity search, guardrails, evaluation, governance, and observability, leveraging various AI technologies and cloud platforms.
ServeAgentEngineeringNew York, NY +3Apr 28
Sr. Lead AI Engineer (AI Foundations)
This role focuses on engineering AI foundations, including foundation model training, LLM inference, similarity search, guardrails, model evaluation, and optimization techniques for scalability, cost, latency, and throughput. It involves leveraging AI technologies and contributing to the technical vision and roadmap of foundational AI systems.
ServeAgentEngineeringNew York, NY +3Apr 18
Lead AI Engineer (AI Foundations)
Lead AI Engineer focused on building and optimizing foundational AI systems, including LLM inference, similarity search, guardrails, and model evaluation, to enhance customer and associate experiences within a large enterprise.
ServeAgentEngineeringNew York, NY +4Apr 18
Director, Data Scientist - Generative AI Systems
Capital One is seeking a Director, Data Scientist to lead the Generative AI Systems team. This role involves building and operationalizing AI-powered products, specifically focusing on LLMs for customer-facing applications in dialogue, summarization, comprehension, speech, and image processing. The position requires leading a team of specialists, experimenting with generative AI, and contributing to research. The role emphasizes partnering with cross-functional teams, leveraging technologies like PyTorch, AWS, Hugging Face, LangChain, and VectorDBs, and managing the full ML lifecycle from design to production for over 80 million customers.
AgentPost-trainEngineeringMcLean, VA +1Mar 318
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
Lead AI Engineer role focused on building and deploying AI-powered products, including foundation model training, LLM inference, similarity search, guardrails, and model evaluation. The role involves leveraging AI technologies like AWS Ultraclusters, Huggingface, VectorDBs, and Nemo Guardrails, and optimizing LLM performance for scalability, cost, and latency. The position requires a strong engineering foundation and experience in AI/ML algorithm development and deployment on cloud platforms.
AgentServeEngineeringNew York, NY +4Mar 318
Distinguished AI Engineer
This role focuses on engineering and deploying AI-powered products and foundational AI systems, including large language model inference, optimization, and related components like similarity search and guardrails. The primary focus is on the serving and optimization of AI models in production, with a secondary involvement in agentic systems.
ServeAgentEngineeringSan Francisco, CA +4Mar 308
Senior Distinguished AI Engineer
Senior Distinguished AI Engineer role focused on designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves inventing and introducing state-of-the-art LLM optimization techniques to improve performance (scalability, cost, latency, throughput) of large-scale production AI systems, and contributing to the technical vision and roadmap of foundational AI systems. Requires strong engineering and mathematics foundation, expertise in hardware, software, and AI, and experience with cloud platforms and programming languages like Python, Go, Scala, or Java.
ServeAgentEngineeringSan Francisco, CA +5Mar 258
Lead AI Engineer (MLX)
Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems. Responsibilities include designing, developing, testing, deploying, and supporting AI software components such as foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves optimizing LLM performance for scalability, cost, latency, and throughput using various AI technologies and cloud platforms.
ServeAgentEngineeringNew York, NY +4Mar 258