AI Hire Signal
JobsCompaniesTrendsInsightsWeekly
JobsStrategy timeline
AI Hire Signal

Tracking AI hiring across 200+ US tech companies. Stage, salary, and stack signals on every role — refreshed weekly.

Contact

Browse

JobsCompaniesTrendsInsightsWeekly

Resources

AboutSitemapRobots

Legal

PrivacyTerms
© 2026 AI Hire Signal·Not affiliated with companies shown
Capital One

Capital One

Banking · Banking

HQ
McLean, US
Founded
1994
Website
capitalone.com

Currently tracking 241 active AI roles, down 26% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $123k–$392k (avg $231k).

Hiring
241 / 262
Momentum (4w)
↓-218 -26%
622 opens last 4w · 840 prior 4w
Salary range · avg $231k
$123k–$392k
USD · disclosed roles only
Tracked since
Aug '25
last role 4w ago
Hiring velocityscroll left for older weeks
1 new role
May 19
1 new role
26
1 new role
Jul 21
1 new role
Aug 25
2 new roles
Sep 8
2 new roles
15
2 new roles
29
1 new role
Oct 13
1 new role
20
5 new roles
27
3 new roles
Nov 3
2 new roles
17
1 new role
24
3 new roles
Dec 1
1 new role
8
5 new roles
15
1 new role
22
1 new role
29
18 new roles
Jan 5
29 new roles
12
12 new roles
19
23 new roles
26
28 new roles
Feb 2
24 new roles
9
22 new roles
16
36 new roles
23
45 new roles
Mar 2
49 new roles
9
57 new roles
16
74 new roles
23
88 new roles
30
129 new roles
Apr 6
135 new roles
13
188 new roles
20
259 new roles
27
314 new roles
May 4
206 new roles
11
158 new roles
18
162 new roles
25
182 new roles
Jun 1
199 new roles
8
155 new roles
15
86 new roles
22

Capital One currently has 293 active AI-related job listings. The majority of these roles are focused on serving infrastructure, accounting for 28% of the total, followed closely by agents at 26% and post-training at 23%. Engineering is the dominant function, with 234 roles, and hiring is primarily concentrated in the United States. Frequent tech tags include model_serving, vector_db, and llm_observability, suggesting a focus on the operational aspects of AI deployment. In the last 30 days, Capital One posted 124 new AI roles, representing a 22% increase compared to the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Frequently asked questions

  • What AI roles is Capital One hiring for?

    Capital One currently has 305 active AI-related roles in our index. The most common open titles are: Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) (9), Lead AI Engineer (AI Foundations, LLM Core and Agentic AI) (8), Applied Researcher I (6), Distinguished Engineer (6), Applied Researcher II (5). Most positions are in Engineering and Research.

  • What stage of AI development does Capital One focus on?

    Capital One's active AI hiring is concentrated in: serving infrastructure (28%), agents (27%), post-training (23%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

  • Where is Capital One hiring AI talent?

    Capital One is hiring AI talent in: United States (299 roles), United Kingdom (3 roles), Canada (2 roles), Philippines (1 role).

  • What technologies does Capital One's AI team work with?

    Job postings at Capital One most frequently reference: model serving, vector db, fine tuning, llm observability, inference infra.

  • How many AI roles has Capital One posted recently?

    In the past 30 days, Capital One has posted 96 new AI-related roles. That is a -26% change versus the prior 30 days (130 → 96).

Jobs (79)

245 AI · 1392 total active
FilteredStageServe×
Show
Active onlyAI only (≥ 7)
Stage
AllPretrain · 11Post-train · 62Serve · 79Agent · 64Ship · 29
Function
AllEngineering · 204Research · 31Product · 10
Country
AllUnited States · 241United Kingdom · 3Canada · 1
Sort
AI scoreRecentTitle
TitleStageFunctionLocationFirst seenAI score
Senior Lead AI Engineer (GenAI Platform Services)
This role focuses on designing, developing, testing, deploying, and supporting AI software components for GenAI Platform Services. It involves foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role also emphasizes optimizing large-scale production AI systems for performance (scalability, cost, latency, throughput) and contributing to the technical vision and roadmap of foundational AI systems.
ServeAgentEngineeringSan Jose, CA +32w ago8
Lead AI Engineer (Vision model customization, VML)
Lead AI Engineer focused on vision model customization and VML, responsible for designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves optimizing large-scale production AI systems for performance (scalability, cost, latency, throughput) and contributing to the technical vision and roadmap of foundational AI systems at Capital One, leveraging technologies like AWS Ultraclusters, Huggingface, VectorDBs, and Nemo Guardrails.
1–50 of 79← Prev12Next →
ServeAgent
Engineering
New York, NY +3
2w ago
8
Senior Manager, AI Engineering (People Leader) (Gen AI Platform Services)
Senior Manager of AI Engineering leading a team focused on building and deploying Gen AI Platform Services. The role involves overseeing the design, development, and support of AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, and observability. It also requires making build-vs-buy decisions, optimizing LLM performance, and contributing to the technical vision and roadmap for foundational AI systems.
ServeAgentEngineeringSan Jose, CA +42w ago8
Senior Lead AI Engineer, Gen AI Platform
This role focuses on engineering and optimizing large-scale production AI systems, specifically within the Generative AI Platform at Capital One. Responsibilities include designing, developing, testing, deploying, and supporting AI software components such as foundation model training, LLM inference, similarity search, guardrails, model evaluation, governance, and observability. The role also involves inventing and applying state-of-the-art LLM optimization techniques to improve performance (scalability, cost, latency, throughput) of these systems. The ideal candidate is deeply technical, experienced in AI/ML algorithms and technologies, and skilled in programming languages like Python, Go, Scala, or Java, with a strong foundation in engineering and mathematics.
ServeAgentEngineeringNew York, NY +22w ago8
Lead AI Engineer (Vision model customization, VLM)
Lead AI Engineer focused on customizing vision models (VLMs) and optimizing large-scale AI systems, including foundation model training and LLM inference. The role involves designing, developing, testing, deploying, and supporting AI software components, leveraging technologies like AWS, Huggingface, VectorDBs, and Nemo Guardrails. Emphasis is placed on improving performance (scalability, cost, latency, throughput) of production AI systems and contributing to the technical vision for foundational AI systems.
ServePost-trainEngineeringNew York, NY +32w ago8
Sr. Lead AI Engineer (Inference Optimization, FM hosting, AI Platform)
This role focuses on optimizing the performance, scalability, cost, and latency of large-scale production AI systems, specifically for foundation model training and large language model inference. It involves designing, developing, and deploying AI software components, including inference services, and contributing to the AI platform. The role also touches upon aspects of foundation model training and agentic systems (via guardrails, similarity search).
ServeAgentEngineeringSan Jose, CA +43w ago8
Lead AI Engineer (FM Hosting, LLM Inference)
Lead AI Engineer focused on LLM inference and optimization for AI systems within a large enterprise. The role involves designing, developing, and deploying AI software components, with a strong emphasis on improving the performance, scalability, cost, and latency of production AI systems.
ServeEngineeringNew York, NY +33w ago8
Lead AI Engineer (FM Hosting, LLM Inference)
Lead AI Engineer focused on LLM inference and optimization for AI systems within a large enterprise. The role involves designing, developing, and deploying AI software components, with a strong emphasis on improving the performance, scalability, cost, and latency of production AI systems.
ServeEngineeringNew York, NY +33w ago8
Distinguished AI Engineer
Distinguished AI Engineer role focused on designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves optimizing LLM performance (scalability, cost, latency, throughput) for large-scale production AI systems and contributing to the technical vision and roadmap of foundational AI systems. It requires strong engineering and mathematics foundations, expertise in Python/Go/Scala/Java, and experience with cloud platforms and AI technologies like Huggingface, VectorDBs, and PyTorch.
ServeAgentEngineeringMcLean, VA +45w ago8
Senior Lead AI Engineer (Gen AI Platform Services)
Senior Lead AI Engineer role focused on building and optimizing Gen AI platform services, including foundation model training, LLM inference, similarity search, guardrails, evaluation, and governance. The role involves designing, developing, testing, deploying, and supporting AI software components, leveraging cloud platforms and AI technologies, and optimizing performance for scalability, cost, latency, and throughput.
ServeAgentEngineeringSan Jose, CA +25w ago8
Senior Lead AI Engineer(MLX, Agentic AI, Gen AI platform Services)
Senior Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems at Capital One. Responsibilities include designing, developing, testing, deploying, and supporting AI software components such as foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves leveraging AI technologies like AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, and PyTorch, and optimizing LLM performance for scalability, cost, latency, and throughput.
ServeAgentEngineeringSan Jose, CA +46w ago8
Senior Lead AI Engineer(MLX, Agentic AI, Gen AI platform Services)
Senior Lead AI Engineer responsible for designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role focuses on optimizing LLM performance for scalability, cost, latency, and throughput in production AI systems, leveraging technologies like AWS Ultraclusters, Huggingface, VectorDBs, and Nemo Guardrails. This role is part of the Intelligent Foundations and Experiences (IFX) team, aiming to advance AI science and engineering and deploy proprietary solutions.
ServeAgentEngineeringSan Jose, CA +46w ago8
Senior Lead AI Engineer (LLM Gateway, FM Hosting)
Senior Lead AI Engineer role focused on building and optimizing LLM inference infrastructure (Gateway, FM Hosting) and related AI components like similarity search, guardrails, evaluation, and observability for enterprise-scale AI products at Capital One. The role involves designing, developing, testing, deploying, and supporting these AI software components, with a strong emphasis on improving performance (scalability, cost, latency, throughput) of large-scale production AI systems.
ServeAgentEngineeringMcLean, VA +36w ago8
Lead AI Engineer (FM Hosting, LLM Inference)
Lead AI Engineer focused on optimizing LLM inference for scalable, cost-effective production AI systems within an enterprise setting. The role involves designing, developing, and deploying AI software components, including foundation model training, inference, similarity search, guardrails, evaluation, and observability, leveraging various AI technologies and cloud platforms.
ServeAgentEngineeringNew York, NY +26w ago8
Sr. Lead AI Engineer (GenAI Platform)
Sr. Lead AI Engineer focused on building and scaling GenAI platforms, including foundation model training, LLM inference, similarity search, guardrails, evaluation, governance, and observability. The role involves optimizing performance, cost, and latency of large-scale production AI systems using various open-source and cloud technologies.
ServeAgentEngineeringSan Jose, CA +46w ago8
Senior Distinguished Engineer, AI Compute (Remote Eligible)
Senior Distinguished Engineer focused on architecting and building the AI compute infrastructure for Capital One's enterprise machine learning platform. This role involves developing scalable, high-performance systems for diverse AI workloads including LLM pre-training, fine-tuning, inference, and agentic applications, leveraging distributed compute frameworks like Ray and Spark on cloud substrates.
ServePretrainEngineeringSan Francisco, CA +5 · Remote7w ago8
Lead AI/ML Engineer (Platform, kubeflow)
Lead AI/ML Engineer focused on building and optimizing AI platforms and infrastructure, including foundation model training, LLM inference, similarity search, guardrails, and evaluation. The role involves designing, developing, and deploying AI software components, leveraging various AI technologies, and improving the performance of large-scale production AI systems.
ServeAgentEngineeringSan Jose, CA +37w ago8
Lead AI Engineer (Gen AI Platform Services)
Lead AI Engineer role focused on building and optimizing Gen AI Platform Services. Responsibilities include designing, developing, testing, deploying, and supporting AI software components such as foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves leveraging various AI technologies and optimizing large-scale production AI systems for performance, scalability, cost, and latency.
ServeAgentEngineeringSan Jose, CA +18w ago8
Senior Lead AI Engineer, AI Foundations
This role focuses on designing, developing, testing, deploying, and supporting AI software components for foundational AI systems at Capital One. Key responsibilities include foundation model training, LLM inference, similarity search, guardrails, model evaluation, governance, and observability. The role also involves optimizing LLM performance for scalability, cost, latency, and throughput, leveraging technologies like AWS, Huggingface, VectorDBs, and PyTorch. The goal is to build and deploy proprietary AI solutions that deliver value to millions of customers and enhance products with AI capabilities.
ServeAgentEngineeringNew York, NY +3Apr 238
Lead AI Engineer, AI Foundations
Lead AI Engineer focused on building and optimizing AI Foundations, including foundation model training, LLM inference, similarity search, guardrails, evaluation, governance, and observability. The role involves designing, developing, testing, deploying, and supporting AI software components, with a strong emphasis on improving performance (scalability, cost, latency, throughput) of large-scale production AI systems using state-of-the-art LLM optimization techniques. The role also touches on agentic systems through guardrails and similarity search, and model evaluation.
ServeAgentEngineeringNew York, NY +3Apr 238
Senior Lead AI Engineer (Gen AI Platform Services)
Senior Lead AI Engineer role focused on building and scaling Gen AI Platform Services, including foundation model training, LLM inference, similarity search, guardrails, and model evaluation. The role involves optimizing AI systems for performance, cost, and latency, and contributing to the technical vision for foundational AI systems at Capital One.
ServeAgentEngineeringSan Jose, CA +1Apr 238
Senior Lead AI Engineer
Senior Lead AI Engineer responsible for designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves optimizing LLM performance for scalability, cost, latency, and throughput, and contributing to the technical vision and roadmap of foundational AI systems. The role leverages various AI technologies and requires strong engineering and mathematical foundations.
ServeAgentEngineeringNew York, NY +4Apr 228
Lead AI Engineer
Lead AI Engineer role focused on designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves optimizing LLM performance for scalability, cost, latency, and throughput, and contributing to the technical vision and roadmap of foundational AI systems. It requires experience with AI/ML algorithms, programming languages like Python, and cloud platforms, with a focus on deploying scalable and responsible AI solutions.
ServeAgentEngineeringNew York, NY +4Apr 228
Senior Manager, AI Engineering (People Leader) (Gen AI Platform Services)
Senior Manager, AI Engineering (People Leader) for Gen AI Platform Services at Capital One. This role involves overseeing the design, development, testing, deployment, and support of AI software components, including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The candidate will make build-vs-buy decisions, optimize LLM performance (scalability, cost, latency, throughput), contribute to the technical vision and roadmap of foundational AI systems, and lead/mentor an AI engineering team. Experience with cloud platforms and deploying scalable AI solutions is required.
ServePost-trainEngineeringSan Jose, CA +1Apr 148
Sr. Distinguished AI Engineer
This role focuses on designing, developing, testing, deploying, and supporting AI software components, including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The engineer will invent and introduce LLM optimization techniques to improve the performance (scalability, cost, latency, throughput) of large-scale production AI systems, leveraging technologies like AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, and PyTorch. The role involves contributing to the technical vision and roadmap of foundational AI systems.
ServeAgentEngineeringCambridge, MA +5Apr 108
Lead AI Engineer ( MLX, Gen AI Platform Services, Agentic AI)
Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems at Capital One. The role involves designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. It requires leveraging a broad stack of AI technologies and optimizing LLM performance for scalability, cost, and latency.
ServeAgentEngineeringNew York, NY +4Apr 98
Senior Lead AI Engineer (Gen AI Platform Services, Agentic AI)
Senior Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems, including LLM inference, optimization, and agentic AI platforms. The role involves designing, developing, testing, deploying, and supporting AI software components, leveraging various AI technologies, and contributing to the technical vision and roadmap.
ServeAgentEngineeringCambridge, MA +4Apr 68
Sr. Lead AI Engineer (Gen AI Platform Services)
This role focuses on engineering AI-powered products and platforms, specifically within Generative AI. Responsibilities include designing, developing, and supporting AI software components like foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role also involves optimizing LLM performance for scalability, cost, latency, and throughput, leveraging various AI technologies and cloud platforms.
ServeAgentEngineeringSan Jose, CA +3Apr 68
Lead AI Engineer (AI Foundations)
Lead AI Engineer focused on AI Foundations, responsible for designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, governance, and observability. The role involves optimizing large-scale production AI systems for performance (scalability, cost, latency, throughput) using various AI technologies and contributing to the technical vision and roadmap of foundational AI systems.
ServeAgentEngineeringCambridge, MA +3Apr 38
Senior Lead AI Engineer,(MLX, Agentic AI, Gen AI platform Services)
Senior Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems at Capital One. Responsibilities include designing, developing, testing, deploying, and supporting AI software components like foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves leveraging AI technologies and optimizing LLM performance for scalability, cost, and latency.
ServeAgentEngineeringSan Jose, CA +3Apr 38
Senior Lead AI Engineer (MLX, Agentic AI, Gen AI platform Services)
Senior Lead AI Engineer role focused on building and scaling Gen AI platform services, including foundation model training, LLM inference, similarity search, guardrails, and model evaluation. The role involves optimizing performance (scalability, cost, latency, throughput) of large-scale production AI systems and contributing to the technical vision for foundational AI systems. Requires strong engineering and AI expertise, with experience in cloud platforms and programming languages like Python.
ServeAgentEngineeringSan Jose, CA +3Apr 38
Lead AI Engineer (MLX, Agentic AI, Gen AI platform Services)
Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems, including LLM inference, similarity search, guardrails, and model evaluation, with an emphasis on optimizing performance and scalability for enterprise use.
ServeAgentEngineeringNew York, NY +4Apr 38
Senior Lead AI Engineer (FM Hosting, LLM Inference)
Senior Lead AI Engineer focused on LLM inference and hosting infrastructure, optimizing performance, scalability, cost, and latency for large-scale production AI systems. The role involves designing, developing, and deploying AI software components, including foundation model training, inference, similarity search, guardrails, evaluation, governance, and observability, leveraging various AI technologies and cloud platforms.
ServeAgentEngineeringNew York, NY +3Apr 28
Sr. Lead AI Engineer (AI Foundations)
This role focuses on engineering AI foundations, including foundation model training, LLM inference, similarity search, guardrails, model evaluation, and optimization techniques for scalability, cost, latency, and throughput. It involves leveraging AI technologies and contributing to the technical vision and roadmap of foundational AI systems.
ServeAgentEngineeringNew York, NY +3Apr 18
Lead AI Engineer (AI Foundations)
Lead AI Engineer focused on building and optimizing foundational AI systems, including LLM inference, similarity search, guardrails, and model evaluation, to enhance customer and associate experiences within a large enterprise.
ServeAgentEngineeringNew York, NY +4Apr 18
Distinguished AI Engineer
This role focuses on engineering and deploying AI-powered products and foundational AI systems, including large language model inference, optimization, and related components like similarity search and guardrails. The primary focus is on the serving and optimization of AI models in production, with a secondary involvement in agentic systems.
ServeAgentEngineeringSan Francisco, CA +4Mar 308
Senior Distinguished AI Engineer
Senior Distinguished AI Engineer role focused on designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves inventing and introducing state-of-the-art LLM optimization techniques to improve performance (scalability, cost, latency, throughput) of large-scale production AI systems, and contributing to the technical vision and roadmap of foundational AI systems. Requires strong engineering and mathematics foundation, expertise in hardware, software, and AI, and experience with cloud platforms and programming languages like Python, Go, Scala, or Java.
ServeAgentEngineeringSan Francisco, CA +5Mar 258
Lead AI Engineer (MLX)
Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems. Responsibilities include designing, developing, testing, deploying, and supporting AI software components such as foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves optimizing LLM performance for scalability, cost, latency, and throughput using various AI technologies and cloud platforms.
ServeAgentEngineeringNew York, NY +4Mar 258
Lead AI Engineer
Lead AI Engineer role focused on designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves leveraging AI technologies, inventing LLM optimization techniques to improve performance (scalability, cost, latency, throughput) of large-scale production AI systems, and contributing to the technical vision and roadmap of foundational AI systems. Requires strong engineering and mathematics foundation, expertise in hardware, software, and AI, and experience with cloud platforms and AI/ML algorithms.
ServeAgentEngineeringMcLean, VA +3Mar 208
Senior Lead AI Engineer
Senior Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems within an enterprise setting. Responsibilities include designing, developing, testing, deploying, and supporting AI software components such as foundation model training, LLM inference, similarity search, guardrails, model evaluation, and observability. The role emphasizes optimizing LLM performance for scalability, cost, latency, and throughput, and contributing to the technical vision and roadmap of foundational AI systems.
ServeAgentEngineeringSan Jose, CA +3Mar 208
Senior Lead AI Engineer
Senior Lead AI Engineer role focused on designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves inventing and introducing state-of-the-art LLM optimization techniques to improve the performance of large-scale production AI systems, leveraging technologies like AWS Ultraclusters, Huggingface, VectorDBs, and Nemo Guardrails. This position is key to bringing AI capabilities to life at Capital One, empowering teams across the company and delivering value to millions of customers.
ServeAgentEngineeringNew York, NY +3Mar 188
Lead AI Engineer
Lead AI Engineer role focused on building and deploying AI-powered products and foundational AI systems within a fintech company. Responsibilities include designing, developing, testing, deploying, and supporting AI software components such as foundation model training, LLM inference, similarity search, guardrails, model evaluation, and observability. The role emphasizes optimizing LLM performance (scalability, cost, latency, throughput) and contributing to the technical vision and roadmap for AI systems. Requires experience with AI/ML algorithms, programming languages like Python, and cloud platforms, with a focus on deploying scalable and responsible AI solutions.
ServeAgentEngineeringSan Jose, CA +3Feb 278
Senior Lead AI Engineer (Gen AI Platform Services)
This role focuses on engineering and optimizing AI software components, particularly large language model inference and related platform services, to improve performance, scalability, cost, and latency in a production environment. It involves designing, developing, testing, deploying, and supporting these components, leveraging various AI technologies and cloud platforms.
ServeAgentEngineeringSan Jose, CA +2Feb 198
Senior Lead AI Engineer (GenAI Platform Services)
Senior Lead AI Engineer responsible for designing, developing, testing, deploying, and supporting AI software components including foundation model training, large language model inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The role involves optimizing LLM performance for scalability, cost, and latency, and contributing to the technical vision and roadmap of foundational AI systems.
ServeAgentEngineeringSan Jose, CA +1Feb 188
Senior Lead AI Engineer (FM Hosting)
This role focuses on designing, developing, testing, deploying, and supporting AI software components, including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The engineer will invent and introduce state-of-the-art LLM optimization techniques to improve the performance of large-scale production AI systems and contribute to the technical vision and roadmap of foundational AI systems. The role involves leveraging various AI technologies and optimizing for scalability, cost, latency, and throughput.
ServeAgentEngineeringNew York, NY +3Feb 98
Lead AI Engineer (FM Hosting, LLM Inference)
Lead AI Engineer focused on LLM inference and optimization for AI-powered products within a large enterprise. The role involves designing, developing, and deploying AI software components, with a strong emphasis on improving the performance, scalability, cost, and latency of production AI systems.
ServeAgentEngineeringNew York, NY +3Feb 48
Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
Senior Lead AI Engineer focused on AI Foundations, LLM Core, and Agentic AI. The role involves designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, governance, and observability. It also requires inventing and introducing LLM optimization techniques to improve performance (scalability, cost, latency, throughput) of large-scale production AI systems. The role leverages AI technologies like AWS Ultraclusters, Huggingface, VectorDBs, Nemo Guardrails, and PyTorch.
ServeAgentEngineeringCambridge, MA +4Feb 38
Sr. Lead AI Engineer
This role focuses on designing, developing, testing, deploying, and supporting AI software components, including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. The engineer will also invent and introduce LLM optimization techniques to improve the performance (scalability, cost, latency, throughput) of large-scale production AI systems, leveraging technologies like AWS Ultraclusters, Huggingface, VectorDBs, and Nemo Guardrails. The role is within the Intelligent Foundations and Experiences (IFX) team, aiming to advance AI science and engineering and deploy proprietary solutions.
ServeAgentEngineeringMcLean, VA +3Feb 38
Senior Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
Senior Lead AI Engineer role focused on AI Foundations, LLM Core, and Agentic AI. Responsibilities include designing, developing, testing, deploying, and supporting AI software components like foundation model training, LLM inference, similarity search, guardrails, model evaluation, and governance. The role involves optimizing LLM performance for scalability, cost, and latency, and contributing to the technical vision for foundational AI systems. It requires experience with cloud platforms and AI/ML algorithms, particularly LLM inference, similarity search, vector databases, and guardrails.
ServeAgentEngineeringCambridge, MA +3Feb 38
Lead AI Engineer (AI Foundations, LLM Core and Agentic AI)
Lead AI Engineer focused on AI Foundations, LLM Core, and Agentic AI. The role involves designing, developing, testing, deploying, and supporting AI software components including foundation model training, LLM inference, similarity search, guardrails, model evaluation, experimentation, governance, and observability. Key responsibilities include optimizing LLM performance for scalability, cost, latency, and throughput using various AI technologies and techniques.
ServeAgentEngineeringSan Jose, CA +3Feb 38