AI Hire Signal
JobsCompaniesTrendsInsightsWeekly
JobsStrategy timeline
AI Hire Signal

Tracking AI hiring across 200+ US tech companies. Stage, salary, and stack signals on every role — refreshed weekly.

Contact

Browse

JobsCompaniesTrendsInsightsWeekly

Resources

AboutSitemapRobots

Legal

PrivacyTerms
© 2026 AI Hire Signal·Not affiliated with companies shown

Capital One currently has 293 active AI-related job listings. The majority of these roles are focused on serving infrastructure, accounting for 28% of the total, followed closely by agents at 26% and post-training at 23%. Engineering is the dominant function, with 234 roles, and hiring is primarily concentrated in the United States. Frequent tech tags include model_serving, vector_db, and llm_observability, suggesting a focus on the operational aspects of AI deployment. In the last 30 days, Capital One posted 124 new AI roles, representing a 22% increase compared to the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Currently tracking 241 active AI roles, down 26% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $123k–$392k (avg $231k).

Hiring
241 / 262
Momentum (4w)
↓-218 -26%
622 opens last 4w · 840 prior 4w
Salary range · avg $231k
$123k–$392k
USD · disclosed roles only
Tracked since
Aug '25
last role 4w ago
Hiring velocityscroll left for older weeks
1 new role
May 19
1 new role
26
1 new role
Jul 21
1 new role
Aug 25
2 new roles
Sep 8
2 new roles
15
2 new roles
29
1 new role
Oct 13
1 new role
20
5 new roles
27
3 new roles
Nov 3
2 new roles
17
1 new role
24
3 new roles
Dec 1
1 new role
8
5 new roles
15
1 new role
22
1 new role
29
18 new roles
Jan 5
29 new roles
12
12 new roles
19
23 new roles
26
28 new roles
Feb 2
24 new roles
9
22 new roles
16
36 new roles
23
45 new roles
Mar 2
49 new roles
9
57 new roles
16
74 new roles
23
88 new roles
30
129 new roles
Apr 6
135 new roles
13
188 new roles
20
259 new roles
27
314 new roles
May 4
206 new roles
11
158 new roles
18
162 new roles
25
182 new roles
Jun 1
199 new roles
8
155 new roles
15
86 new roles
22

Frequently asked questions

  • What AI roles is Capital One hiring for?

    Capital One currently has 305 active AI-related roles in our index. The most common open titles are: Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) (9), Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) (8), Applied Researcher I (6), Distinguished Engineer (6), Applied Researcher II (5). Most positions are in Engineering and Research.

  • What stage of AI development does Capital One focus on?

    Capital One's active AI hiring is concentrated in: serving infrastructure (28%), agents (27%), post-training (23%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

  • Where is Capital One hiring AI talent?

    Capital One is hiring AI talent in: United States (299 roles), United Kingdom (3 roles), Canada (2 roles), Philippines (1 role).

  • What technologies does Capital One's AI team work with?

    Job postings at Capital One most frequently reference: model serving, vector db, fine tuning, llm observability, inference infra.

  • How many AI roles has Capital One posted recently?

    In the past 30 days, Capital One has posted 96 new AI-related roles. That is a -26% change versus the prior 30 days (130 → 96).

Jobs (272)

245 AI · 1392 total active
FilteredFunctionEngineering×
Show
Active onlyAI only (≥ 7)
Stage
AllData · 5Pretrain · 12Post-train · 86Serve · 100Agent · 79Ship · 44
Function
AllEngineering · 272Research · 39Product · 15
Country
AllUnited States · 321United Kingdom · 4Canada · 1
Sort
AI scoreRecentTitle
TitleStageFunctionLocationFirst seenAI score
Senior Lead AI Engineer (Gen AI Platform Services)
Senior Lead AI Engineer role focused on building and scaling Gen AI Platform Services, including foundation model training, LLM inference, similarity search, guardrails, and model evaluation. The role involves optimizing AI systems for performance, cost, and latency, and contributing to the technical vision for foundational AI systems at Capital One.
ServeAgentEngineeringSan Jose, CA +1Apr 238
Senior Lead AI Engineer
Senior Lead AI Engineer responsible for designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves optimizing LLM performance for scalability, cost, latency, and throughput, and contributing to the technical vision and roadmap of foundational AI systems. The role leverages various AI technologies and requires strong engineering and mathematical foundations.
51–100 of 272← Prev123456Next →
ServeAgent
Engineering
New York, NY +4
Apr 22
8
Lead AI Engineer
Lead AI Engineer role focused on designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves optimizing LLM performance for scalability, cost, latency, and throughput, and contributing to the technical vision and roadmap of foundational AI systems. It requires experience with AI/ML algorithms, programming languages like Python, and cloud platforms, with a focus on deploying scalable and responsible AI solutions.
ServeAgentEngineeringNew York, NY +4Apr 228
Principal Associate, Data Scientist - LLM Customization Team
Capital One is seeking a Principal Associate Data Scientist to join their LLM Customization Team. This role involves partnering with cross-functional teams to deliver AI-powered products, leveraging technologies like Pytorch, Hugging Face, LangChain, and VectorDBs. The primary focus is on adapting and fine-tuning LLMs for business-specific applications, building NLP models through all phases of development, and operationalizing them in production systems.
Post-trainServeEngineeringNew York, NY +1Apr 208
Senior Manager, AI Engineering (People Leader) (Gen AI Platform Services)
Senior Manager, AI Engineering (People Leader) for Gen AI Platform Services at Capital One. This role involves overseeing the design, development, testing, deployment, and support of AI software components, including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The candidate will make build-vs-buy decisions, optimize LLM performance (scalability, cost, latency, throughput), contribute to the technical vision and roadmap of foundational AI systems, and lead/mentor an AI engineering team. Experience with cloud platforms and deploying scalable AI solutions is required.
ServePost-trainEngineeringSan Jose, CA +1Apr 148
Sr. Distinguished AI Engineer
This role focuses on designing, developing, testing, deploying, and supporting AI software components, including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The engineer will invent and introduce LLM optimization techniques to improve the performance (scalability, cost, latency, throughput) of large-scale production AI systems, leveraging technologies like AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, and PyTorch. The role involves contributing to the technical vision and roadmap of foundational AI systems.
ServeAgentEngineeringCambridge, MA +5Apr 108
Lead AI Engineer ( MLX, Gen AI Platform Services, Agentic AI)
Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems at Capital One. The role involves designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. It requires leveraging a broad stack of AI technologies and optimizing LLM performance for scalability, cost, and latency.
ServeAgentEngineeringNew York, NY +4Apr 98
Distinguished AI Engineer
This role focuses on architecting and launching conversational AI experiences for millions of customers, involving foundation model training, LLM inference, and optimization of AI systems. It requires partnering with cross-functional teams to deliver AI-powered products and leveraging a broad stack of AI technologies.
AgentServeEngineeringCambridge, MA +2Apr 88
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
Lead AI Engineer role focused on building and deploying AI-powered products, including foundation model training, LLM inference, similarity search, guardrails, and model evaluation. The role involves leveraging AI technologies like AWS Ultraclusters, Huggingface, VectorDBs, and Nemo Guardrails, and optimizing LLM performance for scalability, cost, and latency. The position requires a strong engineering foundation and experience in AI/ML algorithm development and deployment on cloud platforms.
AgentServeEngineeringNew York, NY +4Apr 78
Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI)
Senior Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems, including LLM inference, optimization, and agentic AI platforms. The role involves designing, developing, testing, deploying, and supporting AI software components, leveraging various AI technologies, and contributing to the technical vision and roadmap.
ServeAgentEngineeringCambridge, MA +4Apr 68
Sr. Lead AI Engineer (Gen AI Platform Services)
This role focuses on engineering AI-powered products and platforms, specifically within Generative AI. Responsibilities include designing, developing, and supporting AI software components like foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role also involves optimizing LLM performance for scalability, cost, latency, and throughput, leveraging various AI technologies and cloud platforms.
ServeAgentEngineeringSan Jose, CA +3Apr 68
Sr. Distinguished Machine Learning Engineer (Remote-Eligible)
This role focuses on building and scaling the intelligence and infrastructure for real-time, personalized customer experiences using ML and GenAI systems. It involves defining technical strategy, partnering with data science and ML platform teams, developing a rules engine, building ML infrastructure for end-to-end workflows, architecting low-latency event-driven systems, driving MLOps, and optimizing LLM performance for production AI systems. The role also involves providing technical leadership and leveraging various AI technologies.
AgentServeEngineeringMcLean, VA +1 · RemoteApr 68
Lead AI Engineer (AI Foundations)
Lead AI Engineer focused on AI Foundations, responsible for designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, governance, and observability. The role involves optimizing large-scale production AI systems for performance (scalability, cost, latency, throughput) using various AI technologies and contributing to the technical vision and roadmap of foundational AI systems.
ServeAgentEngineeringCambridge, MA +3Apr 38
Senior Lead AI Engineer,(MLX, Agentic AI, Gen AI platform Services)
Senior Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems at Capital One. Responsibilities include designing, developing, testing, deploying, and supporting AI software components like foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves leveraging AI technologies and optimizing LLM performance for scalability, cost, and latency.
ServeAgentEngineeringSan Jose, CA +3Apr 38
Senior Lead AI Engineer (MLX, Agentic AI, Gen AI platform Services)
Senior Lead AI Engineer role focused on building and scaling Gen AI platform services, including foundation model training, LLM inference, similarity search, guardrails, and model evaluation. The role involves optimizing performance (scalability, cost, latency, throughput) of large-scale production AI systems and contributing to the technical vision for foundational AI systems. Requires strong engineering and AI expertise, with experience in cloud platforms and programming languages like Python.
ServeAgentEngineeringSan Jose, CA +3Apr 38
Lead AI Engineer (MLX, Agentic AI, Gen AI platform Services)
Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems, including LLM inference, similarity search, guardrails, and model evaluation, with an emphasis on optimizing performance and scalability for enterprise use.
ServeAgentEngineeringNew York, NY +4Apr 38
Senior Lead AI Engineer (FM Hosting, LLM Inference)
Senior Lead AI Engineer focused on LLM inference and hosting infrastructure, optimizing performance, scalability, cost, and latency for large-scale production AI systems. The role involves designing, developing, and deploying AI software components, including foundation model training, inference, similarity search, guardrails, evaluation, governance, and observability, leveraging various AI technologies and cloud platforms.
ServeAgentEngineeringNew York, NY +3Apr 28
Sr. Lead AI Engineer (AI Foundations)
This role focuses on engineering AI foundations, including foundation model training, LLM inference, similarity search, guardrails, model evaluation, and optimization techniques for scalability, cost, latency, and throughput. It involves leveraging AI technologies and contributing to the technical vision and roadmap of foundational AI systems.
ServeAgentEngineeringNew York, NY +3Apr 18
Lead AI Engineer (AI Foundations)
Lead AI Engineer focused on building and optimizing foundational AI systems, including LLM inference, similarity search, guardrails, and model evaluation, to enhance customer and associate experiences within a large enterprise.
ServeAgentEngineeringNew York, NY +4Apr 18
Director, Data Scientist - Generative AI Systems
Capital One is seeking a Director, Data Scientist to lead the Generative AI Systems team. This role involves building and operationalizing AI-powered products, specifically focusing on LLMs for customer-facing applications in dialogue, summarization, comprehension, speech, and image processing. The position requires leading a team of specialists, experimenting with generative AI, and contributing to research. The role emphasizes partnering with cross-functional teams, leveraging technologies like PyTorch, AWS, Hugging Face, LangChain, and VectorDBs, and managing the full ML lifecycle from design to production for over 80 million customers.
AgentPost-trainEngineeringMcLean, VA +1Mar 318
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
Lead AI Engineer role focused on building and deploying AI-powered products, including foundation model training, LLM inference, similarity search, guardrails, and model evaluation. The role involves leveraging AI technologies like AWS Ultraclusters, Huggingface, VectorDBs, and Nemo Guardrails, and optimizing LLM performance for scalability, cost, and latency. The position requires a strong engineering foundation and experience in AI/ML algorithm development and deployment on cloud platforms.
AgentServeEngineeringNew York, NY +4Mar 318
Distinguished AI Engineer
This role focuses on engineering and deploying AI-powered products and foundational AI systems, including large language model inference, optimization, and related components like similarity search and guardrails. The primary focus is on the serving and optimization of AI models in production, with a secondary involvement in agentic systems.
ServeAgentEngineeringSan Francisco, CA +4Mar 308
Senior Distinguished AI Engineer
Senior Distinguished AI Engineer role focused on designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves inventing and introducing state-of-the-art LLM optimization techniques to improve performance (scalability, cost, latency, throughput) of large-scale production AI systems, and contributing to the technical vision and roadmap of foundational AI systems. Requires strong engineering and mathematics foundation, expertise in hardware, software, and AI, and experience with cloud platforms and programming languages like Python, Go, Scala, or Java.
ServeAgentEngineeringSan Francisco, CA +5Mar 258
Lead AI Engineer (MLX)
Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems. Responsibilities include designing, developing, testing, deploying, and supporting AI software components such as foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves optimizing LLM performance for scalability, cost, latency, and throughput using various AI technologies and cloud platforms.
ServeAgentEngineeringNew York, NY +4Mar 258
Lead AI Engineer
Lead AI Engineer role focused on designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves leveraging AI technologies, inventing LLM optimization techniques to improve performance (scalability, cost, latency, throughput) of large-scale production AI systems, and contributing to the technical vision and roadmap of foundational AI systems. Requires strong engineering and mathematics foundation, expertise in hardware, software, and AI, and experience with cloud platforms and AI/ML algorithms.
ServeAgentEngineeringMcLean, VA +3Mar 208
Senior Lead AI Engineer
Senior Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems within an enterprise setting. Responsibilities include designing, developing, testing, deploying, and supporting AI software components such as foundation model training, LLM inference, similarity search, guardrails, model evaluation, and observability. The role emphasizes optimizing LLM performance for scalability, cost, latency, and throughput, and contributing to the technical vision and roadmap of foundational AI systems.
ServeAgentEngineeringSan Jose, CA +3Mar 208
Lead AI Engineer (Gen AI Platform, Agentic AI & LLM Infrastructure & Orchestration)
Lead AI Engineer role focused on building and scaling Gen AI platforms, agentic AI systems, and LLM infrastructure. The role involves designing, developing, and deploying AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, and observability. It emphasizes optimizing LLM performance for scalability, cost, latency, and throughput, and leveraging a broad stack of AI technologies.
AgentServeEngineeringSan Jose, CA +4Mar 198
Senior Lead AI Engineer
Senior Lead AI Engineer role focused on designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves inventing and introducing state-of-the-art LLM optimization techniques to improve the performance of large-scale production AI systems, leveraging technologies like AWS Ultraclusters, Huggingface, VectorDBs, and Nemo Guardrails. This position is key to bringing AI capabilities to life at Capital One, empowering teams across the company and delivering value to millions of customers.
ServeAgentEngineeringNew York, NY +3Mar 188
Lead AI Engineer (AI Foundations, LLM Customization and Finetuning)
Lead AI Engineer focused on AI Foundations, LLM Customization and Finetuning within Capital One's Intelligent Foundations and Experiences (IFX) team. The role involves designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. It also emphasizes optimizing LLM performance for scalability, cost, and latency in production AI systems.
ServePost-trainEngineeringNew York, NY +4Mar 178
Lead AI Engineer
Lead AI Engineer role focused on designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves leveraging AI technologies and optimizing LLM performance for scalability, cost, latency, and throughput in production AI systems within an enterprise setting.
ServeAgentEngineeringSan Jose, CA +3Mar 58
Lead AI Engineer
Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems within a fintech company. Responsibilities include designing, developing, testing, deploying, and supporting AI software components such as foundation model training, LLM inference, similarity search, guardrails, model evaluation, and observability. The role emphasizes optimizing LLM performance (scalability, cost, latency, throughput) and contributing to the technical vision and roadmap for AI systems. Requires experience with AI/ML algorithms, programming languages like Python, and cloud platforms, with a focus on deploying scalable and responsible AI solutions.
ServeAgentEngineeringSan Jose, CA +3Feb 278
Distinguished AI Engineer (Agentic AI Platform)
The role is for a Distinguished AI Engineer focused on building an enterprise Generative AI Platform. The engineer will design the agentic workflow framework, shared services (memory, guardrails, vector search, SDKs), and blueprints to enable product teams to compose AI capabilities. Key responsibilities include evaluating agentic frameworks, developing an end-to-end GenAI SDK/CLI, implementing central guardrail services, optimizing orchestration for performance, and mentoring other engineers. The role emphasizes creating scalable, safe, and explainable AI solutions for millions of users.
AgentServeEngineeringSan Jose, CA +1Feb 248
Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
Senior Lead AI Engineer focused on AI Foundations, LLM Core, and Agentic AI. The role involves designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. It also focuses on optimizing LLM performance (scalability, cost, latency, throughput) for production AI systems and contributing to the technical vision and roadmap of foundational AI systems.
AgentServeEngineeringNew York, NY +4Feb 238
Senior Lead AI Engineer (Gen AI Platform Services)
This role focuses on engineering and optimizing AI software components, particularly large language model inference and related platform services, to improve performance, scalability, cost, and latency in a production environment. It involves designing, developing, testing, deploying, and supporting these components, leveraging various AI technologies and cloud platforms.
ServeAgentEngineeringSan Jose, CA +2Feb 198
Senior Manager AI Engineer (GenAI Platform Services)
Senior Manager AI Engineer role focused on building and deploying GenAI Platform Services. Responsibilities include overseeing AI software components like foundation model training, LLM inference, similarity search, guardrails, and model evaluation. The role involves making build-vs-buy decisions, optimizing LLM performance, and contributing to the technical vision of foundational AI systems. Requires people leadership and experience in deploying scalable AI solutions on cloud platforms.
ServeAgentEngineeringSan Jose, CA +1Feb 188
Senior Lead AI Engineer (GenAI Platform Services)
Senior Lead AI Engineer responsible for designing, developing, testing, deploying, and supporting AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves optimizing LLM performance for scalability, cost, and latency, and contributing to the technical vision and roadmap of foundational AI systems.
ServeAgentEngineeringSan Jose, CA +1Feb 188
Senior Lead AI Engineer (FM Hosting)
This role focuses on designing, developing, testing, deploying, and supporting AI software components, including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The engineer will invent and introduce state-of-the-art LLM optimization techniques to improve the performance of large-scale production AI systems and contribute to the technical vision and roadmap of foundational AI systems. The role involves leveraging various AI technologies and optimizing for scalability, cost, latency, and throughput.
ServeAgentEngineeringNew York, NY +3Feb 98
Sr Distinguished Engineer
Capital One is seeking a Sr. Distinguished Engineer to lead the development and scaling of agentic AI systems for business card marketing, underwriting, and sales. The role involves defining technical strategy, hands-on prototyping, and influencing enterprise architecture to create hyper-personalized marketing systems, assistive technologies, and autonomous sales agents.
AgentEngineeringMcLean, VA +1Feb 58
Distinguished Engineer
Distinguished Engineer role focused on building and scaling agentic AI systems for marketing and sales within an enterprise context. Responsibilities include prototyping, defining technical strategy, influencing architecture, and mentoring teams, with a focus on real-time, hyper-personalized customer engagement and assistive technologies for sales teams.
AgentEngineeringRichmond, VA +1Feb 48
Lead AI Engineer (FM Hosting, LLM Inference)
Lead AI Engineer focused on LLM inference and optimization for AI-powered products within a large enterprise. The role involves designing, developing, and deploying AI software components, with a strong emphasis on improving the performance, scalability, cost, and latency of production AI systems.
ServeAgentEngineeringNew York, NY +3Feb 48
Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI)
Senior Lead AI Engineer role focused on building and deploying Gen AI platform services, including agentic AI systems. Responsibilities span foundation model training, LLM inference, similarity search, guardrails, evaluation, experimentation, governance, and observability, leveraging technologies like AWS, Huggingface, VectorDBs, and Nemo Guardrails. The role emphasizes optimizing large-scale production AI systems for performance, cost, and latency, and contributing to the technical vision and roadmap for foundational AI systems.
AgentServeEngineeringSan Jose, CA +3Feb 48
Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
Senior Lead AI Engineer focused on AI Foundations, LLM Core, and Agentic AI. The role involves designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, governance, and observability. It also requires inventing and introducing LLM optimization techniques to improve performance (scalability, cost, latency, throughput) of large-scale production AI systems. The role leverages AI technologies like AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, and PyTorch.
ServeAgentEngineeringCambridge, MA +4Feb 38
Sr. Lead AI Engineer
This role focuses on designing, developing, testing, deploying, and supporting AI software components, including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The engineer will also invent and introduce LLM optimization techniques to improve the performance (scalability, cost, latency, throughput) of large-scale production AI systems, leveraging technologies like AWS Ultraclusters, Huggingface, VectorDBs, and Nemo Guardrails. The role is within the Intelligent Foundations and Experiences (IFX) team, aiming to advance AI science and engineering and deploy proprietary solutions.
ServeAgentEngineeringMcLean, VA +3Feb 38
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
Lead AI Engineer role focused on building and deploying AI-powered products, including foundation model training, LLM inference, similarity search, guardrails, and model evaluation. The role involves leveraging AI technologies like AWS Ultraclusters, Huggingface, VectorDBs, and Nemo Guardrails, and optimizing LLM performance for scalability, cost, and latency. The position requires a strong engineering foundation and experience in AI/ML algorithm development and deployment on cloud platforms.
AgentServeEngineeringNew York, NY +4Feb 38
Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
Senior Lead AI Engineer role focused on AI Foundations, LLM Core, and Agentic AI. Responsibilities include designing, developing, testing, deploying, and supporting AI software components like foundation model training, LLM inference, similarity search, guardrails, model evaluation, and governance. The role involves optimizing LLM performance for scalability, cost, and latency, and contributing to the technical vision for foundational AI systems. It requires experience with cloud platforms and AI/ML algorithms, particularly LLM inference, similarity search, vector databases, and guardrails.
ServeAgentEngineeringCambridge, MA +3Feb 38
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
Lead AI Engineer focused on AI Foundations, LLM Core, and Agentic AI. The role involves designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. Key responsibilities include optimizing LLM performance for scalability, cost, latency, and throughput using various AI technologies and techniques.
ServeAgentEngineeringSan Jose, CA +3Feb 38
Senior Lead AI Engineer (LLM Customization and Finetuning)
Senior Lead AI Engineer focused on LLM customization and finetuning within Capital One's Intelligent Foundations and Experiences (IFX) team. The role involves designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. It requires leveraging AI technologies like Huggingface, VectorDBs, and PyTorch, and optimizing LLM performance for scalability, cost, latency, and throughput in production AI systems. The candidate should have a strong engineering and mathematics foundation, experience with cloud platforms, and the ability to lead and mentor teams.
Post-trainServeEngineeringCambridge, MA +3Feb 38
Senior Lead AI Engineer (FM Hosting, LLM Inference)
Senior Lead AI Engineer focused on optimizing LLM inference performance, scalability, cost, and latency for production AI systems within Capital One's Intelligent Foundations and Experiences (IFX) team. The role involves designing, developing, and deploying AI software components, including foundation model training, inference, similarity search, guardrails, evaluation, and observability, leveraging cloud platforms and open-source AI technologies.
ServeAgentEngineeringMcLean, VA +2Jan 268
Lead AI Engineer (FM Hosting, LLM Inference)
Lead AI Engineer focused on optimizing LLM inference for scalability, cost, and latency within an enterprise AI setting. The role involves designing, developing, and deploying AI software components, including foundation model training, inference services, similarity search, guardrails, and model evaluation, leveraging cloud platforms and various AI technologies.
ServeAgentEngineeringNew York, NY +3Jan 238
Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
Senior Lead AI Engineer role focused on AI Foundations, LLM Core, and Agentic AI. Responsibilities include designing, developing, testing, deploying, and supporting AI software components such as foundation model training, LLM inference, similarity search, guardrails, model evaluation, and agentic systems. The role involves optimizing LLM performance for scalability, cost, and latency, and contributing to the technical vision and roadmap for foundational AI systems. Requires strong engineering and mathematics foundation, expertise in Python/Go/Scala/Java, and experience with cloud platforms and AI technologies like Huggingface, VectorDBs, and PyTorch.
AgentServeEngineeringNew York, NY +4Jan 158