AI Hire Signal
JobsCompaniesTrendsInsightsWeekly
JobsStrategy timeline
AI Hire Signal

Tracking AI hiring across 200+ US tech companies. Stage, salary, and stack signals on every role — refreshed weekly.

Contact

Browse

JobsCompaniesTrendsInsightsWeekly

Resources

AboutSitemapRobots

Legal

PrivacyTerms
© 2026 AI Hire Signal·Not affiliated with companies shown

Currently tracking 241 active AI roles, down 26% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $123k–$392k (avg $231k).

Hiring
241 / 262
Momentum (4w)
↓-218 -26%
622 opens last 4w · 840 prior 4w
Salary range · avg $231k
$123k–$392k
USD · disclosed roles only
Tracked since
Aug '25
last role 4w ago
Hiring velocityscroll left for older weeks
1 new role
May 19
1 new role
26
1 new role
Jul 21
1 new role
Aug 25
2 new roles
Sep 8
2 new roles
15
2 new roles
29
1 new role
Oct 13
1 new role
20
5 new roles
27
3 new roles
Nov 3
2 new roles
17
1 new role
24
3 new roles
Dec 1
1 new role
8
5 new roles
15
1 new role
22
1 new role
29
18 new roles
Jan 5
29 new roles
12
12 new roles
19
23 new roles
26
28 new roles
Feb 2
24 new roles
9
22 new roles
16
36 new roles
23
45 new roles
Mar 2
49 new roles
9
57 new roles
16
74 new roles
23
88 new roles
30
129 new roles
Apr 6
135 new roles
13
188 new roles
20
259 new roles
27
314 new roles
May 4
206 new roles
11
158 new roles
18
162 new roles
25
182 new roles
Jun 1
199 new roles
8
155 new roles
15
86 new roles
22
Capital One

Capital One

Banking · Banking

HQ
McLean, US
Founded
1994
Website
capitalone.com

Capital One currently has 293 active AI-related job listings. The majority of these roles are focused on serving infrastructure, accounting for 28% of the total, followed closely by agents at 26% and post-training at 23%. Engineering is the dominant function, with 234 roles, and hiring is primarily concentrated in the United States. Frequent tech tags include model_serving, vector_db, and llm_observability, suggesting a focus on the operational aspects of AI deployment. In the last 30 days, Capital One posted 124 new AI roles, representing a 22% increase compared to the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Frequently asked questions

  • What AI roles is Capital One hiring for?

    Capital One currently has 305 active AI-related roles in our index. The most common open titles are: Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) (9), Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) (8), Applied Researcher I (6), Distinguished Engineer (6), Applied Researcher II (5). Most positions are in Engineering and Research.

  • What stage of AI development does Capital One focus on?

    Capital One's active AI hiring is concentrated in: serving infrastructure (28%), agents (27%), post-training (23%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

  • Where is Capital One hiring AI talent?

    Capital One is hiring AI talent in: United States (299 roles), United Kingdom (3 roles), Canada (2 roles), Philippines (1 role).

  • What technologies does Capital One's AI team work with?

    Job postings at Capital One most frequently reference: model serving, vector db, fine tuning, llm observability, inference infra.

  • How many AI roles has Capital One posted recently?

    In the past 30 days, Capital One has posted 96 new AI-related roles. That is a -26% change versus the prior 30 days (130 → 96).

Jobs (693)

245 AI · 1392 total active
FilteredFunctionEngineering×
Show
Active onlyAI only (≥ 7)
Stage
AllData · 8Pretrain · 11Post-train · 70Serve · 84Agent · 82Eval Gate · 1Ship · 49
Function
AllEngineering · 693Product · 660Research · 33
Country
AllUnited States · 1257Canada · 52United Kingdom · 38Philippines · 31Mexico · 14
Sort
AI scoreRecentTitle
TitleStageFunctionLocationFirst seenAI score
Sr. Distinguished AI Engineer (Agentic AI Platform)
Senior Distinguished AI Engineer focused on building and scaling an Agentic AI Platform at Capital One. The role involves contributing to platform architecture, standardizing agentic workflows using frameworks like LangGraph and AutoGen, developing GenAI SDKs/CLIs, implementing central guardrail services for trust and safety, optimizing orchestration for cost reduction, and driving innovation in areas like multimodal RAG and hierarchical agent memory. The role also includes coaching and evangelizing the platform vision.
AgentEngineeringSan Jose, CA +48w ago9
Senior Director, Software Engineering - AI
This role leads multiple teams of AI/ML software engineers to develop and manage enterprise LLM orchestration, generative AI pipelines, and low-latency inference microservices. It involves scaling production-grade ML systems and traditional architectures, mentoring engineers, and ensuring robust AI engineering practices for ethical deployment.
1–50 of 693← Prev12…14Next →
Agent
Serve
Engineering
Plano, TX
1w ago
8
Senior Lead AI Engineer (GenAI Platform Services)
This role focuses on designing, developing, testing, deploying, and supporting AI software components for GenAI Platform Services. It involves foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role also emphasizes optimizing large-scale production AI systems for performance (scalability, cost, latency, throughput) and contributing to the technical vision and roadmap of foundational AI systems.
ServeAgentEngineeringSan Jose, CA +32w ago8
Lead AI Engineer (Vision model customization, VML)
Lead AI Engineer focused on vision model customization and VML, responsible for designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves optimizing large-scale production AI systems for performance (scalability, cost, latency, throughput) and contributing to the technical vision and roadmap of foundational AI systems at Capital One, leveraging technologies like AWS Ultraclusters, Huggingface, VectorDBs, and Nemo Guardrails.
ServeAgentEngineeringNew York, NY +32w ago8
Lead Machine Learning Engineer (Manager IC)
Lead Machine Learning Engineer at Capital One focused on building and productionizing foundation models using self-supervised learning for transformer architectures. The role involves large-scale training, representation learning, and serving models in production for applications like fraud, marketing, and servicing. Responsibilities include technical design, development, implementation, model/application code, ML architectural decisions, and ensuring high availability and performance.
PretrainServeEngineeringMcLean, VA +32w ago8
Senior Manager, AI Engineering (People Leader) (Gen AI Platform Services)
Senior Manager of AI Engineering leading a team focused on building and deploying Gen AI Platform Services. The role involves overseeing the design, development, and support of AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, and observability. It also requires making build-vs-buy decisions, optimizing LLM performance, and contributing to the technical vision and roadmap for foundational AI systems.
ServeAgentEngineeringSan Jose, CA +42w ago8
Senior Lead AI Engineer, Gen AI Platform
This role focuses on engineering and optimizing large-scale production AI systems, specifically within the Generative AI Platform at Capital One. Responsibilities include designing, developing, testing, deploying, and supporting AI software components such as foundation model training, LLM inference, similarity search, guardrails, model evaluation, governance, and observability. The role also involves inventing and applying state-of-the-art LLM optimization techniques to improve performance (scalability, cost, latency, throughput) of these systems. The ideal candidate is deeply technical, experienced in AI/ML algorithms and technologies, and skilled in programming languages like Python, Go, Scala, or Java, with a strong foundation in engineering and mathematics.
ServeAgentEngineeringNew York, NY +22w ago8
Senior Associate, Data Scientist - NLP
Senior Associate Data Scientist focused on NLP and LLMs for a financial services company's mobile app. The role involves building, adapting, and fine-tuning LLMs for customer-facing features, operationalizing models in production systems, and leveraging technologies like PyTorch, Hugging Face, LangChain, and VectorDBs. The position requires experience in model development phases from design to validation and operationalization at scale for a large customer base.
Post-trainServeEngineeringMcLean, VA +22w ago8
Manager, Data Science - GenAI Digital Assistant
Manager, Data Science role focused on GenAI and conversational AI for a digital assistant, involving research, fine-tuning LLMs, inference optimization, and multi-agentic workflows within a fintech company.
AgentPost-trainEngineeringSan Jose, CA +22w ago8
Senior Manager, Data Science - AI Foundations
Senior Manager, Data Science - AI Foundations at Capital One. This role focuses on building and shipping AI/ML solutions for the company's mobile app, leveraging technologies like PyTorch, AWS, Hugging Face, LangChain, and VectorDBs. The position involves adapting and fine-tuning LLMs for customer-facing applications, building ML and NLP models through all development phases, and operationalizing them in production systems serving over 80 million customers. The ideal candidate has experience in training language models, computer vision models, and expertise in areas like training optimization, self-supervised learning, explainability, and RLHF, with a track record of delivering models at scale.
Post-trainServeEngineeringMcLean, VA +22w ago8
Lead AI Engineer (Vision model customization, VLM)
Lead AI Engineer focused on customizing vision models (VLMs) and optimizing large-scale AI systems, including foundation model training and LLM inference. The role involves designing, developing, testing, deploying, and supporting AI software components, leveraging technologies like AWS, Huggingface, VectorDBs, and Nemo Guardrails. Emphasis is placed on improving performance (scalability, cost, latency, throughput) of production AI systems and contributing to the technical vision for foundational AI systems.
ServePost-trainEngineeringNew York, NY +32w ago8
Lead Machine Learning Engineer
Lead Machine Learning Engineer at Capital One, focused on building and deploying AI-powered risk management solutions. The role involves designing, developing, testing, and deploying AI software components, including LLM inference, similarity search, guardrails, governance, observability, and agentic AI. Responsibilities include fine-tuning, developing, and evaluating ML and foundation models, contributing to technical vision, and leveraging a broad stack of AI technologies. The role also requires retraining, maintaining, and monitoring production models, constructing optimized data pipelines, and ensuring responsible and explainable AI practices.
AgentPost-trainEngineeringCambridge, MA +22w ago8
Manager, Data Scientist - Recommendation & Personalization Systems
Manager, Data Scientist role focused on building and deploying personalized recommendation engines using Foundation Models, Reinforcement Learning, and Transformer-based architectures for a large-scale fintech company. The role involves partnering with cross-functional teams, leveraging technologies like Python, AWS, and Spark, and building ML models through all phases of development.
AgentPost-trainEngineeringMcLean, VA +22w ago8
Lead Machine Learning Engineer (Manager IC)
Lead Machine Learning Engineer at Capital One's Risk Tech division, focusing on building and deploying AI-powered risk management solutions. The role involves designing, developing, testing, deploying, and supporting AI software components, including fine-tuning models, managing LLM inference, similarity search, guardrails, governance, observability, and agentic AI. Responsibilities include contributing to the technical roadmap, leveraging AI technologies, informing ML infrastructure decisions, maintaining production models, and constructing data pipelines, with an emphasis on Responsible and Explainable AI.
AgentPost-trainEngineeringMcLean, VA +22w ago8
Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
Senior Lead AI Engineer role focused on AI Foundations, LLM Core, and Agentic AI. Responsibilities include designing, developing, testing, deploying, and supporting AI software components such as foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves leveraging AI technologies like AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, and PyTorch, and optimizing LLM performance for scalability, cost, latency, and throughput in production AI systems.
AgentServeEngineeringNew York, NY +32w ago8
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
Lead AI Engineer role focused on building and optimizing AI systems, including foundation models, LLM inference, agentic AI, and related infrastructure. The role involves designing, developing, testing, deploying, and supporting AI software components, with a strong emphasis on improving performance, scalability, cost, and latency of large-scale production AI systems. It requires leveraging various AI technologies and contributing to the technical vision and roadmap for foundational AI systems.
AgentServeEngineeringNew York, NY +32w ago8
Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)
This role focuses on optimizing the performance, scalability, cost, and latency of large-scale production AI systems, specifically for foundation model training and large language model inference. It involves designing, developing, and deploying AI software components, including inference services, and contributing to the AI platform. The role also touches upon aspects of foundation model training and agentic systems (via guardrails, similarity search).
ServeAgentEngineeringSan Jose, CA +43w ago8
Director, Data Scientist
Director of Data Science for the Generative AI Systems team at Capital One, focusing on building and delivering state-of-the-art generative AI solutions for internal efficiency and customer-facing applications. The role involves leading a team of NLP, speech, and computer vision specialists, experimenting with emerging generative AI technologies, and contributing to research.
ShipPost-trainEngineeringMcLean, VA +13w ago8
Lead AI Engineer (FM Hosting, LLM Inference)
Lead AI Engineer focused on LLM inference and optimization for AI systems within a large enterprise. The role involves designing, developing, and deploying AI software components, with a strong emphasis on improving the performance, scalability, cost, and latency of production AI systems.
ServeEngineeringNew York, NY +33w ago8
Lead AI Engineer (FM Hosting, LLM Inference)
Lead AI Engineer focused on LLM inference and optimization for AI systems within a large enterprise. The role involves designing, developing, and deploying AI software components, with a strong emphasis on improving the performance, scalability, cost, and latency of production AI systems.
ServeEngineeringNew York, NY +33w ago8
Lead AI Engineer (GenAI Platform, AI Foundations, LLM Core and Agentic AI)
Lead AI Engineer role focused on building and deploying GenAI platforms, LLM core, and agentic AI systems within an enterprise setting. Responsibilities include designing, developing, and supporting AI software components, optimizing LLM performance, and contributing to the technical vision for foundational AI systems. Requires experience with AI/ML algorithms, programming languages, and cloud platforms, with a focus on deploying scalable and responsible AI solutions.
AgentServeEngineeringMcLean, VA +44w ago8
Senior Data Scientist, AI Foundations
Senior Data Scientist focused on building and shipping AI/ML solutions for a mobile app, including adapting and fine-tuning LLMs for customer-facing applications. The role involves building ML and NLP models through all development phases, from design to training, evaluation, and validation, and operationalizing them in production systems serving millions of customers. Experience with LLMs, NLP, training language models, and delivering models at scale is required.
Post-trainServeEngineeringNew York, NY +14w ago8
Distinguished AI Engineer
Distinguished AI Engineer role focused on designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves optimizing LLM performance (scalability, cost, latency, throughput) for large-scale production AI systems and contributing to the technical vision and roadmap of foundational AI systems. It requires strong engineering and mathematics foundations, expertise in Python/Go/Scala/Java, and experience with cloud platforms and AI technologies like Huggingface, VectorDBs, and PyTorch.
ServeAgentEngineeringMcLean, VA +45w ago8
Director, AI Engineering (Agentic AI Platform)
Director of AI Engineering focused on building and managing an Agentic AI Platform. The role involves defining strategic vision, technical leadership for agentic workflows, overseeing the platform lifecycle, guiding technical teams, and ensuring scalability, reliability, and compliance. It requires strong leadership, technical expertise in AI/ML, and collaboration with cross-functional teams to deliver AI-powered products and foundational AI systems.
AgentEngineeringSan Jose, CA +15w ago8
Senior Lead AI Engineer (Gen AI Platform Services)
Senior Lead AI Engineer role focused on building and optimizing Gen AI platform services, including foundation model training, LLM inference, similarity search, guardrails, evaluation, and governance. The role involves designing, developing, testing, deploying, and supporting AI software components, leveraging cloud platforms and AI technologies, and optimizing performance for scalability, cost, latency, and throughput.
ServeAgentEngineeringSan Jose, CA +25w ago8
Senior Lead AI Engineer(MLX, Agentic AI, Gen AI platform Services)
Senior Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems at Capital One. Responsibilities include designing, developing, testing, deploying, and supporting AI software components such as foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves leveraging AI technologies like AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, and PyTorch, and optimizing LLM performance for scalability, cost, latency, and throughput.
ServeAgentEngineeringSan Jose, CA +46w ago8
Senior Lead AI Engineer(MLX, Agentic AI, Gen AI platform Services)
Senior Lead AI Engineer responsible for designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role focuses on optimizing LLM performance for scalability, cost, latency, and throughput in production AI systems, leveraging technologies like AWS Ultraclusters, Huggingface, VectorDBs, and Nemo Guardrails. This role is part of the Intelligent Foundations and Experiences (IFX) team, aiming to advance AI science and engineering and deploy proprietary solutions.
ServeAgentEngineeringSan Jose, CA +46w ago8
Senior Lead AI Engineer (LLM Gateway, FM Hosting)
Senior Lead AI Engineer role focused on building and optimizing LLM inference infrastructure (Gateway, FM Hosting) and related AI components like similarity search, guardrails, evaluation, and observability for enterprise-scale AI products at Capital One. The role involves designing, developing, testing, deploying, and supporting these AI software components, with a strong emphasis on improving performance (scalability, cost, latency, throughput) of large-scale production AI systems.
ServeAgentEngineeringMcLean, VA +36w ago8
Lead AI Engineer (FM Hosting, LLM Inference)
Lead AI Engineer focused on optimizing LLM inference for scalable, cost-effective production AI systems within an enterprise setting. The role involves designing, developing, and deploying AI software components, including foundation model training, inference, similarity search, guardrails, evaluation, and observability, leveraging various AI technologies and cloud platforms.
ServeAgentEngineeringNew York, NY +26w ago8
Sr. Lead AI Engineer (GenAI Platform)
Sr. Lead AI Engineer focused on building and scaling GenAI platforms, including foundation model training, LLM inference, similarity search, guardrails, evaluation, governance, and observability. The role involves optimizing performance, cost, and latency of large-scale production AI systems using various open-source and cloud technologies.
ServeAgentEngineeringSan Jose, CA +46w ago8
Senior Distinguished Engineer, AI Compute (Remote Eligible)
Senior Distinguished Engineer focused on architecting and building the AI compute infrastructure for Capital One's enterprise machine learning platform. This role involves developing scalable, high-performance systems for diverse AI workloads including LLM pre-training, fine-tuning, inference, and agentic applications, leveraging distributed compute frameworks like Ray and Spark on cloud substrates.
ServePretrainEngineeringSan Francisco, CA +5 · Remote7w ago8
Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI)
Senior Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems, including LLM inference, agentic AI, similarity search, guardrails, and model evaluation. The role involves optimizing performance, scalability, cost, and latency of large-scale production AI systems.
AgentServeEngineeringSan Jose, CA +47w ago8
Lead AI/ML Engineer (Platform, kubeflow)
Lead AI/ML Engineer focused on building and optimizing AI platforms and infrastructure, including foundation model training, LLM inference, similarity search, guardrails, and evaluation. The role involves designing, developing, and deploying AI software components, leveraging various AI technologies, and improving the performance of large-scale production AI systems.
ServeAgentEngineeringSan Jose, CA +37w ago8
Lead AI Engineer (Gen AI Platform Services)
Lead AI Engineer role focused on building and optimizing Gen AI Platform Services. Responsibilities include designing, developing, testing, deploying, and supporting AI software components such as foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves leveraging various AI technologies and optimizing large-scale production AI systems for performance, scalability, cost, and latency.
ServeAgentEngineeringSan Jose, CA +18w ago8
Distinguished Engineer - Agentic AI
Distinguished Engineer role focused on building agentic AI systems for real-time marketing and personalized sales within the enterprise banking domain. The role involves prototyping, defining technical strategy, and scaling these systems, with a focus on influencing architecture and evangelizing AI adoption.
AgentEngineeringMcLean, VA8w ago8
Manager, Data Science - GenAI Digital Assistant
Manager, Data Science role focused on GenAI for a digital assistant, involving research, development, fine-tuning LLMs, inference optimization, and multi-agentic workflows. Leverages Python, AWS, LangChain, LangGraph, HuggingFace, vLLM, and VectorDBs. Aims to improve customer experience through intelligent digital assistance.
AgentPost-trainEngineeringSan Jose, CA +2Apr 248
Manager, Data Scientist - Recommendation & Personalization Systems
Manager, Data Scientist role focused on building and deploying personalized recommendation engines using Foundation Models and Reinforcement Learning. The role involves partnering with cross-functional teams, leveraging technologies like Python and AWS, and building ML models through all phases of development. Expertise in Transformer-based architectures and scalable systems is required.
AgentPost-trainEngineeringMcLean, VA +2Apr 238
Senior Lead AI Engineer, AI Foundations
This role focuses on designing, developing, testing, deploying, and supporting AI software components for foundational AI systems at Capital One. Key responsibilities include foundation model training, LLM inference, similarity search, guardrails, model evaluation, governance, and observability. The role also involves optimizing LLM performance for scalability, cost, latency, and throughput, leveraging technologies like AWS, Huggingface, VectorDBs, and PyTorch. The goal is to build and deploy proprietary AI solutions that deliver value to millions of customers and enhance products with AI capabilities.
ServeAgentEngineeringNew York, NY +3Apr 238
Lead AI Engineer, AI Foundations
Lead AI Engineer focused on building and optimizing AI Foundations, including foundation model training, LLM inference, similarity search, guardrails, evaluation, governance, and observability. The role involves designing, developing, testing, deploying, and supporting AI software components, with a strong emphasis on improving performance (scalability, cost, latency, throughput) of large-scale production AI systems using state-of-the-art LLM optimization techniques. The role also touches on agentic systems through guardrails and similarity search, and model evaluation.
ServeAgentEngineeringNew York, NY +3Apr 238
Senior Lead AI Engineer (Gen AI Platform Services)
Senior Lead AI Engineer role focused on building and scaling Gen AI Platform Services, including foundation model training, LLM inference, similarity search, guardrails, and model evaluation. The role involves optimizing AI systems for performance, cost, and latency, and contributing to the technical vision for foundational AI systems at Capital One.
ServeAgentEngineeringSan Jose, CA +1Apr 238
Senior Lead AI Engineer
Senior Lead AI Engineer responsible for designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves optimizing LLM performance for scalability, cost, latency, and throughput, and contributing to the technical vision and roadmap of foundational AI systems. The role leverages various AI technologies and requires strong engineering and mathematical foundations.
ServeAgentEngineeringNew York, NY +4Apr 228
Lead AI Engineer
Lead AI Engineer role focused on designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves optimizing LLM performance for scalability, cost, latency, and throughput, and contributing to the technical vision and roadmap of foundational AI systems. It requires experience with AI/ML algorithms, programming languages like Python, and cloud platforms, with a focus on deploying scalable and responsible AI solutions.
ServeAgentEngineeringNew York, NY +4Apr 228
Senior Manager, AI Engineering (People Leader) (Gen AI Platform Services)
Senior Manager, AI Engineering (People Leader) for Gen AI Platform Services at Capital One. This role involves overseeing the design, development, testing, deployment, and support of AI software components, including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The candidate will make build-vs-buy decisions, optimize LLM performance (scalability, cost, latency, throughput), contribute to the technical vision and roadmap of foundational AI systems, and lead/mentor an AI engineering team. Experience with cloud platforms and deploying scalable AI solutions is required.
ServePost-trainEngineeringSan Jose, CA +1Apr 148
Sr. Distinguished AI Engineer
This role focuses on designing, developing, testing, deploying, and supporting AI software components, including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The engineer will invent and introduce LLM optimization techniques to improve the performance (scalability, cost, latency, throughput) of large-scale production AI systems, leveraging technologies like AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, and PyTorch. The role involves contributing to the technical vision and roadmap of foundational AI systems.
ServeAgentEngineeringCambridge, MA +5Apr 108
Lead AI Engineer ( MLX, Gen AI Platform Services, Agentic AI)
Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems at Capital One. The role involves designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. It requires leveraging a broad stack of AI technologies and optimizing LLM performance for scalability, cost, and latency.
ServeAgentEngineeringNew York, NY +4Apr 98
Distinguished AI Engineer
This role focuses on architecting and launching conversational AI experiences for millions of customers, involving foundation model training, LLM inference, and optimization of AI systems. It requires partnering with cross-functional teams to deliver AI-powered products and leveraging a broad stack of AI technologies.
AgentServeEngineeringCambridge, MA +2Apr 88
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
Lead AI Engineer role focused on building and deploying AI-powered products, including foundation model training, LLM inference, similarity search, guardrails, and model evaluation. The role involves leveraging AI technologies like AWS Ultraclusters, Huggingface, VectorDBs, and Nemo Guardrails, and optimizing LLM performance for scalability, cost, and latency. The position requires a strong engineering foundation and experience in AI/ML algorithm development and deployment on cloud platforms.
AgentServeEngineeringNew York, NY +4Apr 78
Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI)
Senior Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems, including LLM inference, optimization, and agentic AI platforms. The role involves designing, developing, testing, deploying, and supporting AI software components, leveraging various AI technologies, and contributing to the technical vision and roadmap.
ServeAgentEngineeringCambridge, MA +4Apr 68
Sr. Lead AI Engineer (Gen AI Platform Services)
This role focuses on engineering AI-powered products and platforms, specifically within Generative AI. Responsibilities include designing, developing, and supporting AI software components like foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role also involves optimizing LLM performance for scalability, cost, latency, and throughput, leveraging various AI technologies and cloud platforms.
ServeAgentEngineeringSan Jose, CA +3Apr 68
Sr. Distinguished Machine Learning Engineer (Remote-Eligible)
This role focuses on building and scaling the intelligence and infrastructure for real-time, personalized customer experiences using ML and GenAI systems. It involves defining technical strategy, partnering with data science and ML platform teams, developing a rules engine, building ML infrastructure for end-to-end workflows, architecting low-latency event-driven systems, driving MLOps, and optimizing LLM performance for production AI systems. The role also involves providing technical leadership and leveraging various AI technologies.
AgentServeEngineeringMcLean, VA +1 · RemoteApr 68