What AI roles is Weights & Biases hiring for?

Weights & Biases currently has 39 active AI-related roles in our index. The most common open titles are: Account Solution Architect (5), Solutions Architect - HPC/AI/ML (2), Account Solution Architect - Financial Services, Principal Engineer - Perf and Benchmarking, Principal Engineer, Cluster Orchestration. Most positions are in Engineering and Product.

What stage of AI development does Weights & Biases focus on?

Weights & Biases's active AI hiring is concentrated in: serving infrastructure (49%), agents (38%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

Where is Weights & Biases hiring AI talent?

Weights & Biases is hiring AI talent in: United States (36 roles), United Kingdom (1 role), Singapore (1 role), Canada (1 role).

What technologies does Weights & Biases's AI team work with?

Job postings at Weights & Biases most frequently reference: model serving, inference infra, llm observability, agent orchestration, evals.

How many AI roles has Weights & Biases posted recently?

In the past 30 days, Weights & Biases has posted 9 new AI-related roles.

Weights & Biases

Data AI · ML experiment tracking

Weights & Biases currently has 32 active AI-related job listings. The majority of these roles, 59%, are focused on serving infrastructure, with an additional 31% dedicated to agents. Engineering is the most frequent function, with 29 positions. The company is actively hiring for roles involving inference infrastructure, model serving, and agent orchestration. Over the last 30 days, Weights & Biases has added 8 new AI roles, representing a 300% increase compared to the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Currently tracking 19 active AI roles, up 64% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $92k–$341k (avg $209k).

Hiring

19 / 21

Momentum (4w)

↑+34 +64%

87 opens last 4w · 53 prior 4w

Salary range · avg $209k

$92k–$341k

USD · disclosed roles only

Tracked since

Aug '24

last role 4w ago

Hiring velocityscroll left for older weeks

1 new role

Jul 15

1 new role

Aug 19

1 new role

Oct 21

1 new role

Mar 17

1 new role

Apr 7

4 new roles

2 new roles

4 new roles

May 12

2 new roles

Jun 16

1 new role

Jul 7

1 new role

Aug 11

1 new role

2 new roles

Sep 8

1 new role

2 new roles

1 new role

2 new roles

Oct 6

2 new roles

1 new role

Nov 3

6 new roles

2 new roles

Dec 1

2 new roles

6 new roles

1 new role

6 new roles

Jan 5

7 new roles

11 new roles

15 new roles

14 new roles

Feb 2

12 new roles

4 new roles

8 new roles

7 new roles

Mar 2

11 new roles

13 new roles

10 new roles

15 new roles

16 new roles

Apr 6

10 new roles

20 new roles

14 new roles

24 new roles

May 4

12 new roles

11 new roles

6 new roles

25 new roles

Jun 1

30 new roles

24 new roles

8 new roles

Jobs (25)

25 AI · 269 total active

Title	Stage	Function	Location	First seen	AI score
VP of Product, Research and Training Infrastructure VP of Product for Research and Training Infrastructure at an AI cloud provider. This role owns the product strategy and engineering execution for services powering AI research labs, focusing on specialized orchestration, evaluation, and iteration tools for massive-scale pre-training and post-training. Key responsibilities include evolving orchestration tools (SUNK), developing automated training-based evaluation frameworks, and building infrastructure for RL/RLHF pipelines. Requires deep knowledge of HPC, distributed training, and supporting frontier model research.	PretrainPost-train	Product	Bellevue, WA +4	Mar 23	9
Staff Specialist Field Engineer, Robotics Staff Specialist Field Engineer for Robotics at CoreWeave, focusing on deploying AI/ML solutions for robotics customers. The role involves establishing the robotics vertical, defining engagement strategies, leading customer engagements from scoping to deployment, and building/iterating on customer-facing applications using CoreWeave's AI platform. Requires deep understanding of robotics systems, ML capabilities, and the ability to translate field observations into product signals.	ShipAgent	Engineering	San Francisco, CA	6d ago	8
Account Solution Architect This role is for an Account Solutions Architect at CoreWeave, an AI-focused cloud provider. The architect will be the technical partner for prospective customers, designing demos, leading PoCs, and advising on best practices for AI workloads, including training, fine-tuning, evaluating, and deploying deep learning models and LLM-powered applications. The role requires strong Python skills, experience with major cloud platforms, and the ability to solve complex technical problems.	AgentPost-train	Engineering	New York, NY	2w ago	8
Principal Solution Specialist, Infrastructure This role focuses on bringing CoreWeave's AI developer services, such as MLOps platforms and LLM observability tools, to market. It involves defining commercial and technical strategies, driving adoption with early customers, and translating field insights into product roadmap requirements. The role requires deep expertise in the ML development lifecycle, LLM application patterns, and MLOps ecosystem, with a focus on experiment tracking, model lifecycle governance, and observability.	AgentEval Gate	Product	Bellevue, WA +5	2w ago	8
Staff AI Security Engineer Staff AI Security Engineer to define and operationalize security across CoreWeave's AI ecosystem, focusing on secure-by-default foundations for AI development, agentic workflows, and enterprise AI adoption. The role involves building secure infrastructure, developing AI security policies, implementing guardrails for agentic systems, leading secure adoption of AI tools, and conducting adversarial testing.	AgentServe	Engineering	Bellevue, WA +4	8w ago	8
Principal Engineer - Perf and Benchmarking Principal Engineer role focused on leading the Benchmarking & Performance team at CoreWeave, a cloud provider for AI. The role involves defining strategy, leading end-to-end MLPerf submissions (Training & Inference), designing and implementing a Kubernetes-native benchmarking service for latency and throughput, and building CI/CD pipelines for scale. It requires deep expertise in distributed systems, GPU performance, model-serving stacks, and Kubernetes, with a focus on achieving industry-leading performance data and publications.	ServeEval Gate	Engineering	Bellevue, WA +1	Dec '25	8
Senior Software Engineer, Applied AI Senior Software Engineer to design and build production-grade, full-stack AI-native analytics platforms and first-party applications that embed governed data directly into operational workflows. This role involves developing AI-enabled user experiences, scalable backend services, and intuitive interfaces, integrating AI/LLM capabilities into real-world applications, and working across the stack from React frontends to backend services on Kubernetes.	ShipServe	Engineering	Bellevue, WA +2	2w ago	7
Account Solution Architect Account Solutions Architect for a financial services customer portfolio, focusing on AI/ML and LLM workloads on CoreWeave's cloud platform. Responsibilities include deepening platform adoption, designing end-to-end solutions, guiding customers on AI lifecycle, and resolving technical challenges. Requires Python proficiency, experience with deep learning models, LLM applications, and financial services customer engagement.	AgentPost-train	Engineering	Toronto, ON	2w ago	7
Account Solution Architect - Financial Services Account Solutions Architect for Financial Services at CoreWeave, a cloud provider focused on AI workloads. This role involves being a technical partner to existing financial services customers, helping them deepen platform adoption, identify expansion opportunities, and scale their AI workloads in production. The role requires understanding customer needs around model development, deployment, governance, and infrastructure efficiency, and working with sales, product, and engineering teams to ensure customer success.	AgentServe	Engineering	New York, NY	2w ago	7
Account Solution Architect Account Solutions Architect for financial services customers, focusing on scaling AI/ML workloads on CoreWeave's cloud platform. Responsibilities include technical partnership, platform adoption, identifying expansion opportunities, and advising on model development, deployment, and infrastructure efficiency for AI/ML-intensive organizations.	AgentServe	Engineering	San Francisco, CA	3w ago	7
Account Solution Architect Account Solutions Architect for financial services customers, focusing on deepening platform adoption for AI/ML workloads, identifying expansion opportunities, and serving as a trusted advisor for production AI scaling. Requires hands-on experience with training, fine-tuning, evaluating, and deploying deep learning models and LLM-powered applications.	AgentServe	Engineering	Sunnyvale, CA	3w ago	7
Account Solution Architect Account Solution Architect for CoreWeave, focusing on AI/ML infrastructure and MLOps solutions for customers in Northern EMEA. The role involves technical discovery, solution design, proof-of-concept engagements, and acting as a customer advocate to internal teams. Requires strong knowledge of ML training/inference, MLOps platforms, and underlying infrastructure like GPUs, networking, and Kubernetes.	ServeData	Engineering	London, United Kingdom	3w ago	7
Staff Software Engineer, Inference Staff Software Engineer on the Inference Platform Team at CoreWeave, focusing on building and operating a Kubernetes-native inference platform for AI workloads. The role involves technical leadership in architecture, performance optimization (latency, throughput, GPU utilization), and system reliability for low-latency, high-throughput systems at massive scale, with deep work in distributed systems and Kubernetes infrastructure.	Serve	Engineering	Bellevue, WA +1	7w ago	7
Staff Technical Program Manager - Cluster Orchestration & Applied Training Staff Technical Program Manager to lead cross-functional programs for AI/ML Platform Services, focusing on Cluster Orchestration (scheduling, launching, managing AI workloads) and Applied Training (enabling researchers to use infrastructure for pre-training, fine-tuning, RL, evaluations). The role involves partnering with engineering, product, and research teams to improve workload execution and user interaction with training platforms, driving delivery across various AI training workflows and ensuring successful launches and operational ownership.	ServePost-train	Engineering	Bellevue, WA	7w ago	7
Principal Engineer, Cluster Orchestration CoreWeave is seeking a Principal Engineer to lead the design and evolution of their AI infrastructure's cluster orchestration systems, including Slurm, Kubernetes, and SUNK. This role involves defining long-term architecture, solving scaling problems, and ensuring the reliability and efficiency of GPU resource utilization for AI training and inference workloads.	Serve	Engineering	Bellevue, WA +1	Feb 27	7
Staff Product Manager, Insights Staff Product Manager for CoreWeave's Insights team, focusing on developing AI-powered observability experiences for AI workloads. The role involves defining strategy, roadmaps, and metrics for dashboards, alerts, and AI-driven insights to help customers understand performance, reliability, and cost in their cloud environments. Key responsibilities include translating telemetry into actionable insights and driving proactive surfacing of information, particularly for cost optimization and workload efficiency.	AgentEval Gate	Product	Bellevue, WA +5 · Remote	Feb 18	7
Senior Software Engineer, Observability Insights Senior Software Engineer to lead development of agentic interfaces and product experiences for AI system observability, focusing on multi-tenant APIs, Grafana, and tool servers. Requires experience in backend systems, distributed APIs, reliability engineering, and agentic applications/LLM features.	AgentServe	Engineering	New York, NY +1	Feb 2	7
Solutions Architect - HPC/AI/ML Solutions Architect role focused on supporting customers running AI/ML workloads on CoreWeave's HPC cloud infrastructure, with an emphasis on AI/ML inference. Responsibilities include technical customer contact, solution design, proof of concept development, and workload optimization. Requires expertise in cloud computing, distributed systems, AI/ML inference, NVIDIA GPUs, and Kubernetes.	Serve	Engineering	Singapore	Jan 29	7
Staff Software Engineer, Applied Training CoreWeave is seeking a Staff Software Engineer to join their Applied Training team. This role will focus on building and improving their Kubernetes-native research cluster platform and sandbox client for agentic training and evaluation. The goal is to provide AI researchers with the infrastructure needed to train models efficiently, abstracting away operational complexities. Responsibilities include contributing to the roadmap, designing and building cluster experiences, owning the Python SDK for agentic workflows, and documenting training frameworks. The ideal candidate has extensive experience in distributed systems, ML infrastructure, or developer platforms, with strong Kubernetes expertise and familiarity with AI training and agentic workflows.	ServeAgent	Engineering	Bellevue, WA +2	Jan 23	7
Senior Software Engineer I, Inference CoreWeave is seeking a Senior Software Engineer to own and improve their Kubernetes-native inference platform, focusing on latency, throughput, and reliability. The role involves leading design, implementing optimizations, strengthening incident posture, and mentoring junior engineers. Requires experience with distributed systems, Kubernetes, and inference internals.	Serve	Engineering	Bellevue, WA +1	Jan 23	7
Sr. Software Engineer - Perf and Benchmarking Senior Software Engineer focused on performance and benchmarking of AI infrastructure, including Kubernetes-native services, MLPerf runs, and model-serving stacks. The role involves building and improving services to measure latency, throughput, and cost, and ensuring reproducible benchmarking processes.	ServeEval Gate	Engineering	Bellevue, WA +1	Dec '25	7
Software Engineer, Inference AI/ML Software Engineer focused on improving the latency, reliability, and cost of model serving on a GPU platform, working with services like Triton, vLLM, and TensorRT-LLM.	Serve	Engineering	Bellevue, WA +1	Oct '25	7
Senior Software Engineer II, Inference Senior Software Engineer II focused on owning and optimizing CoreWeave's Kubernetes-native inference platform to meet strict P99 SLAs at scale. Responsibilities include leading design reviews, implementing advanced optimizations for latency and throughput, strengthening incident posture, and mentoring junior engineers. Requires strong experience in distributed systems, Python/Go, networked systems performance, Kubernetes, and ML inference internals.	Serve	Engineering	Bellevue, WA +1	Sep '25	7
Solutions Architect - HPC/AI/ML Solutions Architect role focused on AI/ML inference workloads on high-performance compute (HPC) infrastructure, primarily using Kubernetes and NVIDIA GPUs. The role involves customer technical contact, solution design, proof of concept, workload optimization, and providing feedback to product teams.	Serve	Engineering	Bellevue, WA +4	Oct '24	7
Senior Systems Engineer, OS Automation Senior Systems Engineer focused on automating and scaling Linux OS and Kernel build pipelines, with a strong emphasis on integrating AI/ML technologies like LLMs, RAG, and predictive modeling to create AI-native infrastructure, smart CI/CD, auto-remediation, and predictive regression detection.	ServeAgent	Engineering	Bellevue, WA +3	Aug '24	7

Frequently asked questions

What AI roles is Weights & Biases hiring for?
Weights & Biases currently has 39 active AI-related roles in our index. The most common open titles are: Account Solution Architect (5), Solutions Architect - HPC/AI/ML (2), Account Solution Architect - Financial Services, Principal Engineer - Perf and Benchmarking, Principal Engineer, Cluster Orchestration. Most positions are in Engineering and Product.
What stage of AI development does Weights & Biases focus on?
Weights & Biases's active AI hiring is concentrated in: serving infrastructure (49%), agents (38%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
Where is Weights & Biases hiring AI talent?
Weights & Biases is hiring AI talent in: United States (36 roles), United Kingdom (1 role), Singapore (1 role), Canada (1 role).
What technologies does Weights & Biases's AI team work with?
Job postings at Weights & Biases most frequently reference: model serving, inference infra, llm observability, agent orchestration, evals.
How many AI roles has Weights & Biases posted recently?
In the past 30 days, Weights & Biases has posted 9 new AI-related roles.

Title

Stage

Function

Location

First seen

AI score

VP of Product, Research and Training Infrastructure

VP of Product for Research and Training Infrastructure at an AI cloud provider. This role owns the product strategy and engineering execution for services powering AI research labs, focusing on specialized orchestration, evaluation, and iteration tools for massive-scale pre-training and post-training. Key responsibilities include evolving orchestration tools (SUNK), developing automated training-based evaluation frameworks, and building infrastructure for RL/RLHF pipelines. Requires deep knowledge of HPC, distributed training, and supporting frontier model research.

PretrainPost-train

Product

Bellevue, WA +4

Mar 23

Staff Specialist Field Engineer, Robotics

Staff Specialist Field Engineer for Robotics at CoreWeave, focusing on deploying AI/ML solutions for robotics customers. The role involves establishing the robotics vertical, defining engagement strategies, leading customer engagements from scoping to deployment, and building/iterating on customer-facing applications using CoreWeave's AI platform. Requires deep understanding of robotics systems, ML capabilities, and the ability to translate field observations into product signals.

ShipAgent

Engineering

San Francisco, CA

6d ago

Account Solution Architect

This role is for an Account Solutions Architect at CoreWeave, an AI-focused cloud provider. The architect will be the technical partner for prospective customers, designing demos, leading PoCs, and advising on best practices for AI workloads, including training, fine-tuning, evaluating, and deploying deep learning models and LLM-powered applications. The role requires strong Python skills, experience with major cloud platforms, and the ability to solve complex technical problems.

AgentPost-train

Engineering

New York, NY

2w ago

Principal Solution Specialist, Infrastructure

This role focuses on bringing CoreWeave's AI developer services, such as MLOps platforms and LLM observability tools, to market. It involves defining commercial and technical strategies, driving adoption with early customers, and translating field insights into product roadmap requirements. The role requires deep expertise in the ML development lifecycle, LLM application patterns, and MLOps ecosystem, with a focus on experiment tracking, model lifecycle governance, and observability.

AgentEval Gate

Product

Bellevue, WA +5

2w ago

Staff AI Security Engineer

Staff AI Security Engineer to define and operationalize security across CoreWeave's AI ecosystem, focusing on secure-by-default foundations for AI development, agentic workflows, and enterprise AI adoption. The role involves building secure infrastructure, developing AI security policies, implementing guardrails for agentic systems, leading secure adoption of AI tools, and conducting adversarial testing.

AgentServe

Engineering

Bellevue, WA +4

8w ago

Principal Engineer - Perf and Benchmarking

Principal Engineer role focused on leading the Benchmarking & Performance team at CoreWeave, a cloud provider for AI. The role involves defining strategy, leading end-to-end MLPerf submissions (Training & Inference), designing and implementing a Kubernetes-native benchmarking service for latency and throughput, and building CI/CD pipelines for scale. It requires deep expertise in distributed systems, GPU performance, model-serving stacks, and Kubernetes, with a focus on achieving industry-leading performance data and publications.

ServeEval Gate

Engineering

Bellevue, WA +1

Dec '25

Senior Software Engineer, Applied AI

Senior Software Engineer to design and build production-grade, full-stack AI-native analytics platforms and first-party applications that embed governed data directly into operational workflows. This role involves developing AI-enabled user experiences, scalable backend services, and intuitive interfaces, integrating AI/LLM capabilities into real-world applications, and working across the stack from React frontends to backend services on Kubernetes.

ShipServe

Engineering

Bellevue, WA +2

2w ago

Account Solution Architect

Account Solutions Architect for a financial services customer portfolio, focusing on AI/ML and LLM workloads on CoreWeave's cloud platform. Responsibilities include deepening platform adoption, designing end-to-end solutions, guiding customers on AI lifecycle, and resolving technical challenges. Requires Python proficiency, experience with deep learning models, LLM applications, and financial services customer engagement.

AgentPost-train

Engineering

Toronto, ON

2w ago

Account Solution Architect - Financial Services

Account Solutions Architect for Financial Services at CoreWeave, a cloud provider focused on AI workloads. This role involves being a technical partner to existing financial services customers, helping them deepen platform adoption, identify expansion opportunities, and scale their AI workloads in production. The role requires understanding customer needs around model development, deployment, governance, and infrastructure efficiency, and working with sales, product, and engineering teams to ensure customer success.

AgentServe

Engineering

New York, NY

2w ago

Account Solution Architect

Account Solutions Architect for financial services customers, focusing on scaling AI/ML workloads on CoreWeave's cloud platform. Responsibilities include technical partnership, platform adoption, identifying expansion opportunities, and advising on model development, deployment, and infrastructure efficiency for AI/ML-intensive organizations.

AgentServe

Engineering

San Francisco, CA

3w ago

Account Solution Architect

Account Solutions Architect for financial services customers, focusing on deepening platform adoption for AI/ML workloads, identifying expansion opportunities, and serving as a trusted advisor for production AI scaling. Requires hands-on experience with training, fine-tuning, evaluating, and deploying deep learning models and LLM-powered applications.

AgentServe

Engineering

Sunnyvale, CA

3w ago

Account Solution Architect

Account Solution Architect for CoreWeave, focusing on AI/ML infrastructure and MLOps solutions for customers in Northern EMEA. The role involves technical discovery, solution design, proof-of-concept engagements, and acting as a customer advocate to internal teams. Requires strong knowledge of ML training/inference, MLOps platforms, and underlying infrastructure like GPUs, networking, and Kubernetes.

ServeData

Engineering

London, United Kingdom

3w ago

Staff Software Engineer, Inference

Staff Software Engineer on the Inference Platform Team at CoreWeave, focusing on building and operating a Kubernetes-native inference platform for AI workloads. The role involves technical leadership in architecture, performance optimization (latency, throughput, GPU utilization), and system reliability for low-latency, high-throughput systems at massive scale, with deep work in distributed systems and Kubernetes infrastructure.

Serve

Engineering

Bellevue, WA +1

7w ago

Staff Technical Program Manager - Cluster Orchestration & Applied Training

Staff Technical Program Manager to lead cross-functional programs for AI/ML Platform Services, focusing on Cluster Orchestration (scheduling, launching, managing AI workloads) and Applied Training (enabling researchers to use infrastructure for pre-training, fine-tuning, RL, evaluations). The role involves partnering with engineering, product, and research teams to improve workload execution and user interaction with training platforms, driving delivery across various AI training workflows and ensuring successful launches and operational ownership.

ServePost-train

Engineering

Bellevue, WA

7w ago

Principal Engineer, Cluster Orchestration

CoreWeave is seeking a Principal Engineer to lead the design and evolution of their AI infrastructure's cluster orchestration systems, including Slurm, Kubernetes, and SUNK. This role involves defining long-term architecture, solving scaling problems, and ensuring the reliability and efficiency of GPU resource utilization for AI training and inference workloads.

Serve

Engineering

Bellevue, WA +1

Feb 27

Staff Product Manager, Insights

Staff Product Manager for CoreWeave's Insights team, focusing on developing AI-powered observability experiences for AI workloads. The role involves defining strategy, roadmaps, and metrics for dashboards, alerts, and AI-driven insights to help customers understand performance, reliability, and cost in their cloud environments. Key responsibilities include translating telemetry into actionable insights and driving proactive surfacing of information, particularly for cost optimization and workload efficiency.

AgentEval Gate

Product

Bellevue, WA +5 · Remote

Feb 18

Senior Software Engineer, Observability Insights

Senior Software Engineer to lead development of agentic interfaces and product experiences for AI system observability, focusing on multi-tenant APIs, Grafana, and tool servers. Requires experience in backend systems, distributed APIs, reliability engineering, and agentic applications/LLM features.

AgentServe

Engineering

New York, NY +1

Feb 2

Solutions Architect - HPC/AI/ML

Solutions Architect role focused on supporting customers running AI/ML workloads on CoreWeave's HPC cloud infrastructure, with an emphasis on AI/ML inference. Responsibilities include technical customer contact, solution design, proof of concept development, and workload optimization. Requires expertise in cloud computing, distributed systems, AI/ML inference, NVIDIA GPUs, and Kubernetes.

Serve

Engineering

Singapore

Jan 29

Staff Software Engineer, Applied Training

CoreWeave is seeking a Staff Software Engineer to join their Applied Training team. This role will focus on building and improving their Kubernetes-native research cluster platform and sandbox client for agentic training and evaluation. The goal is to provide AI researchers with the infrastructure needed to train models efficiently, abstracting away operational complexities. Responsibilities include contributing to the roadmap, designing and building cluster experiences, owning the Python SDK for agentic workflows, and documenting training frameworks. The ideal candidate has extensive experience in distributed systems, ML infrastructure, or developer platforms, with strong Kubernetes expertise and familiarity with AI training and agentic workflows.

ServeAgent

Engineering

Bellevue, WA +2

Jan 23

Senior Software Engineer I, Inference

CoreWeave is seeking a Senior Software Engineer to own and improve their Kubernetes-native inference platform, focusing on latency, throughput, and reliability. The role involves leading design, implementing optimizations, strengthening incident posture, and mentoring junior engineers. Requires experience with distributed systems, Kubernetes, and inference internals.

Serve

Engineering

Bellevue, WA +1

Jan 23

Sr. Software Engineer - Perf and Benchmarking

Senior Software Engineer focused on performance and benchmarking of AI infrastructure, including Kubernetes-native services, MLPerf runs, and model-serving stacks. The role involves building and improving services to measure latency, throughput, and cost, and ensuring reproducible benchmarking processes.

ServeEval Gate

Engineering

Bellevue, WA +1

Dec '25

Software Engineer, Inference AI/ML

Software Engineer focused on improving the latency, reliability, and cost of model serving on a GPU platform, working with services like Triton, vLLM, and TensorRT-LLM.

Serve

Engineering

Bellevue, WA +1

Oct '25

Senior Software Engineer II, Inference

Senior Software Engineer II focused on owning and optimizing CoreWeave's Kubernetes-native inference platform to meet strict P99 SLAs at scale. Responsibilities include leading design reviews, implementing advanced optimizations for latency and throughput, strengthening incident posture, and mentoring junior engineers. Requires strong experience in distributed systems, Python/Go, networked systems performance, Kubernetes, and ML inference internals.

Serve

Engineering

Bellevue, WA +1

Sep '25

Solutions Architect - HPC/AI/ML

Solutions Architect role focused on AI/ML inference workloads on high-performance compute (HPC) infrastructure, primarily using Kubernetes and NVIDIA GPUs. The role involves customer technical contact, solution design, proof of concept, workload optimization, and providing feedback to product teams.

Serve

Engineering

Bellevue, WA +4

Oct '24

Senior Systems Engineer, OS Automation

Senior Systems Engineer focused on automating and scaling Linux OS and Kernel build pipelines, with a strong emphasis on integrating AI/ML technologies like LLMs, RAG, and predictive modeling to create AI-native infrastructure, smart CI/CD, auto-remediation, and predictive regression detection.

ServeAgent

Engineering

Bellevue, WA +3

Aug '24