What AI roles is Baseten hiring for?

Baseten currently has 26 active AI-related roles in our index. The most common open titles are: AI Solutions Engineer, Applied AI Inference Engineer, Data Engineer, Engineering Manager - Forward Deployed Engineering (LLM), Engineering Manager - Model Performance. Most positions are in Engineering and Research.

What stage of AI development does Baseten focus on?

Baseten's active AI hiring is concentrated in: serving infrastructure (73%), post-training (12%), agents (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

Where is Baseten hiring AI talent?

Baseten is hiring AI talent in: United States (26 roles).

What technologies does Baseten's AI team work with?

Job postings at Baseten most frequently reference: model serving, inference infra, llm observability, agent orchestration, tool use.

How many AI roles has Baseten posted recently?

In the past 30 days, Baseten has posted 2 new AI-related roles.

Baseten — AI hiring signals

Baseten currently has 26 active AI-related job listings. The majority of these roles, 69%, are focused on serving infrastructure. The dominant function is Engineering, with 24 roles, and hiring is concentrated in the United States. Frequent tech tags include model serving, inference infrastructure, and LLM observability, suggesting a focus on the operational aspects of AI deployment. In the last 30 days, Baseten posted 3 new AI roles, a 25% decrease compared to the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Currently tracking 22 active AI roles, up 13% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $300k.

Hiring

22 / 22

Momentum (4w)

↑+2 +13%

18 opens last 4w · 16 prior 4w

Salary range

$300k

USD · disclosed roles only

Tracked since

Mar '24

last role 6w ago

Hiring velocityscroll left for older weeks

2 new roles

Mar 25

1 new role

Jul 8

1 new role

Sep 9

1 new role

Feb 3

1 new role

Mar 3

1 new role

Apr 7

1 new role

Jul 14

1 new role

Aug 18

2 new roles

1 new role

Sep 1

2 new roles

Oct 6

1 new role

Dec 8

1 new role

Jan 5

1 new role

7 new roles

Feb 23

5 new roles

Mar 2

1 new role

6 new roles

1 new role

3 new roles

2 new roles

Apr 6

2 new roles

4 new roles

3 new roles

May 4

6 new roles

3 new roles

6 new roles

Jun 1

7 new roles

1 new role

4 new roles

Jobs (80)

23 AI · 63 total active

Title	Stage	Function	Location	First seen	AI score
Post-Training Research Engineer Baseten is seeking a Post-Training Research Engineer to build in-house tooling for post-training AI models at scale. This role involves deep technical dives into ML techniques, distributed computing, and systems-level concepts to support customer custom models, which are critical for Baseten's inference platform.	Post-train	Engineering	San Francisco, CA	Mar 23	9
Post-Training Applied Researcher Post-training researcher focused on fine-tuning open-source LLMs for specific customer tasks using RL and reward engineering. Involves building training pipelines, environments, and evals, and working with customer data to improve models that reach millions of users.	Post-trainAgent	Research	San Francisco, CA	Mar 17	9
Post-Training Research Scientist Research Scientist focused on post-training methodology and performant inference, with a significant portion dedicated to pure research and the remainder to applied research informing the company's platform and customer needs. The role involves designing and executing experiments, publishing at top venues, and collaborating with engineering teams to translate research into production systems.	Post-trainServe	Research	San Francisco, CA	Mar 17	9
Software Engineer - GPU Kernels Software Engineer focused on optimizing GPU kernels for ML inference, including matrix multiplications, attention mechanisms, and quantization, using CUDA and PTX assembly.	Serve	Engineering	San Francisco, CA	Jul '25	9
Engineering Manager - Forward Deployed Engineering (LLM) Engineering Manager for Forward Deployed Engineering team focused on building, scaling, and optimizing LLM inference workloads for Baseten customers. This role involves hands-on technical ownership, team leadership, and collaboration with product and infrastructure teams to ensure best-in-class performance, reliability, and cost efficiency of AI applications on Baseten's platform. The role contributes to the core codebase and drives feature roadmap, acting as a player-coach.	ServeAgent	Engineering	San Francisco, CA	7w ago	8
Manager, Solutions Architect Manager for a Solutions Architect team focused on enabling customers to deploy and optimize AI/ML models, particularly LLMs, on Baseten's inference platform. The role involves leadership, technical guidance, customer discovery, and ensuring high performance, reliability, and cost efficiency of AI applications in production.	Serve	Engineering	San Francisco, CA	7w ago	8
Software Engineer - Voice AI (Inference Runtime) Software Engineer focused on building and optimizing the inference runtime for Voice AI models, including state-of-the-art open-source models. The role involves developing large-scale, real-time infrastructure for multi-model voice agents, reducing latency, increasing throughput, and improving GPU efficiency. It also includes designing iteration loops for voice model customization and customization.	ServeAgent	Engineering	San Francisco, CA	Apr 23	8
Software Engineer - Model APIs Software Engineer role focused on optimizing and operating Model APIs for AI inference, involving distributed systems, model serving, and developer experience. The role emphasizes performance improvements, structured outputs, tool/function calling, and multi-modal serving.	ServeAgent	Engineering	San Francisco, CA	Oct '25	8
Engineering Manager - Model Performance Engineering Manager for Model Performance at Baseten, a company providing inference infrastructure for AI companies. The role involves leading a team of engineers to optimize ML model inference and performance, focusing on production-level AI/ML solutions and scaling large models. Requires a strong engineering background, leadership experience, and expertise in ML performance optimization, with hands-on work in areas like TensorRT, PyTorch, and CUDA.	Serve	Engineering	San Francisco, CA	Sep '24	8
Software Engineer - Model Performance Software Engineer focused on ML performance for LLM inference, optimizing techniques like quantization and speculative decoding, and debugging ML performance issues in libraries like TensorRT and PyTorch.	Serve	Engineering	San Francisco, CA	Mar '24	8
Product Manager, Developer Experience Product Manager for Developer Experience at Baseten, a company providing AI inference infrastructure. The role focuses on owning the end-to-end developer journey for deploying and iterating on models, including CLI/SDKs, console, onboarding, deployment lifecycle, and multi-model composition for agents and applications. The goal is to make Baseten synonymous with great developer experience and effortless model deployment.	ServeAgent	Product	San Francisco, CA	2w ago	7
Software Engineer- BIS (Baseten Inference Stack) Software Engineer for Baseten's Inference Stack team, focusing on building and operating the distributed runtime for large-scale LLM inference. The role involves working across the stack from developer experience to low-level infrastructure, ensuring performance, scalability, and reliability of AI model deployments.	Serve	Engineering	San Francisco, CA	4w ago	7
Solution Architect (AI/LLM Inference) Solution Architect role focused on AI/LLM inference, partnering with Sales and customers to design and deploy technical solutions. Responsibilities include customer discovery, technical scoping, leading demos, managing deployments, and driving POC execution. Requires an AI/ML background and customer-facing communication skills, with the ability to script and prototype.	Serve	Engineering	San Francisco, CA	7w ago	7
Applied AI Inference Engineer This role focuses on partnering with customers to architect, build, and deploy high-scale production AI applications on Baseten's platform. It involves owning the customer journey from exploration to deployment, translating business goals into reliable, observable services with clear quality, latency, and cost outcomes. The role blends engineering, product management, technical customer success, and pre-sales solution engineering.	ServeAgent	Engineering	San Francisco, CA	Apr 21	7
AI Solutions Engineer AI Solutions Engineer role focused on partnering with customers to architect, build, and deploy high-scale production AI applications on Baseten's platform. This involves owning the customer journey from exploration to production, translating business goals into reliable, observable services with clear quality, latency, and cost outcomes. The role blends engineering, product management, technical customer success, and pre-sales solution engineering.	ShipServe	Engineering	San Francisco, CA	Apr 21	7
Solution Architect Solution Architect role at Baseten, a company providing AI inference infrastructure. The role involves partnering with Sales and customers to understand business needs, design technical solutions, run technical discovery, and guide deployments and proofs of value. Responsibilities include customer discovery calls, technical scoping, leading demos, owning benchmarking and repeatable deployments across various AI modalities, advising on infrastructure tradeoffs, and driving POC execution. Requires an AI/ML background, strong customer-facing communication, and technical depth to scope solutions.	Serve	Engineering	San Francisco, CA	Feb 25	7
Software Engineer - AI Enablement Baseten is seeking an AI Enablement Engineer to own and develop AI-powered tooling and agent infrastructure for internal productivity. This role involves evaluating, customizing, and deploying AI coding agents and building custom internal agents for tasks like incident triage and codebase Q&A. The engineer will also track usage, measure impact, and stay updated on AI tooling advancements to enhance the engineering organization's effectiveness across the SDLC.	Agent	Engineering	San Francisco, CA	Feb 24	7
Software Engineer — GPU Networking & Distributed Systems Software Engineer focused on GPU Networking and Distributed Systems to optimize AI inference infrastructure, specifically for LLMs and multi-modal models. The role involves integrating RDMA, optimizing networking layers for disaggregated KV cache and WideEP, enabling fast startup speeds, and building observability tools for bleeding-edge hardware.	Serve	Engineering	San Francisco, CA	Feb 23	7
Software Engineer - Training Product Software Engineer focused on building and shipping training products for AI companies, working across the full stack from API to infrastructure, including fine-tuning models and partnering with research engineers. The role involves developing features like multi-node training and serverless RL, with a focus on developer experience and reliability.	Post-trainServe	Engineering	San Francisco, CA	Jan 22	7
Software Engineer, Model Performance Systems Software Engineer role focused on building and optimizing the performance of AI inference infrastructure, including benchmarking, hardware profiling, and developing automated testing and monitoring tools for LLMs.	Serve	Engineering	San Francisco, CA	Jan 7	7
Software Engineer - Training Infrastructure Software Engineer on the Training Infrastructure team responsible for architecting and leading development of the ML training platform, focusing on scheduling, storage, networking, reliability, and observability for research engineers and model developers.	Data	Engineering	San Francisco, CA	Aug '25	7
Software Engineer - Infrastructure Software Engineer focused on building and maintaining the ML inference platform, enabling high-performance deployment, scaling, and monitoring of AI models for production applications.	Serve	Engineering	San Francisco, CA	Mar '25	7
Software Engineer - Core Product Software Engineer on the Core Product team at Baseten, building and maintaining the core Baseten product that enables users to deploy and get value from ML models. The role involves working across the stack, including CLI tools, REST APIs, and the web application, with a focus on new feature development, API design, and bug fixing. Example initiatives include chains for multi-component workflows, asynchronous inference, model APIs, and model training for production inference.	ServeAgent	Engineering	San Francisco, CA	Jul '24	7
Forward Deployed Engineer The Forward Deployed Engineer partners with customers to architect, build, and deploy high-scale production AI applications on Baseten's platform. This role involves owning the customer journey from exploration to production, translating business goals into reliable services with clear quality, latency, and cost outcomes. It blends engineering, product management, technical customer success, and pre-sales solution engineering.	ServeAgent	Engineering	San Francisco, CA	Mar '24	7
Capacity Strategy & Operations Lead This role focuses on capacity strategy and operations for an AI inference platform. The lead will manage the end-to-end capacity planning process, translating customer demand and growth forecasts into supply requirements, coordinating fulfillment, and building scalable systems. They will also lead cross-functional efforts during constrained supply situations and provide strategic briefings to leadership. The role requires strong quantitative and modeling skills, experience in strategy/operations, and the ability to manage complex, cross-functional processes in a fast-moving environment.	—	Product	San Francisco, CA	2w ago	5
Software Engineer - Capacity Software Engineer on the Capacity team at Baseten, a company that provides inference infrastructure for AI companies. This role focuses on owning and developing the internal operating system for managing customer lifecycle, supply, and demand, translating operational requirements into product features, and building full-stack features for the Capacity toolchain. The role requires strong full-stack proficiency, experience with internal tooling/developer infrastructure, and an interest in AI/ML infrastructure.	—	Engineering	San Francisco, CA	2w ago	5
Technical Program Manager, Infrastructure Technical Program Manager to drive complex, cross-cutting AI infrastructure programs, focusing on execution, process, and managing migrations and dependencies across multiple engineering teams.	—	Engineering	San Francisco, CA	2w ago	5
Engineering Manager, Cloud Platform Engineering Manager for Baseten's Cloud Platform team, responsible for building scalable, reliable, and efficient infrastructure for AI inference. The role involves people management, technical direction, and ensuring operational excellence in a cloud environment. Prior ML experience is not required, but familiarity with ML infrastructure is a plus.	—	Engineering	San Francisco, CA	2w ago	5
Engineering Manager, Internal Platform Engineering Manager for Baseten's Platform Team, responsible for building internal systems to improve engineering productivity, collaboration, and quality through tooling, workflows, AI enablement, and development environments. Focuses on people leadership and technical direction for platform infrastructure.	—	Engineering	San Francisco, CA	2w ago	5
Assistant General Counsel, Infrastructure & Compute This role is for an Assistant General Counsel focused on the legal strategy and execution for Baseten's compute and infrastructure supply chain, which is critical for their AI inference services. The role involves negotiating agreements for GPU compute, cloud capacity, and infrastructure services, managing colocation and network contracting, and assessing concentration risk. It requires strong legal expertise in technology transactions, particularly in compute and hardware supply chains, and the ability to partner with engineering and finance teams. While the company is in the AI space and the role supports AI companies, the core function is legal and contractual, not direct AI/ML development.	—	Engineering	San Francisco, CA	3w ago	5
Head of Legal Operations This role is for a Head of Legal Operations at an AI infrastructure company. The primary focus is on building and running the legal team's operating system, including contract lifecycle management, intake and triage, knowledge management, and budget. A key aspect is leveraging AI-assisted and agentic workflows to improve efficiency and scale the legal function. The role emphasizes building over buying AI solutions for legal operations.	—	Product	San Francisco, CA	3w ago	5
Engineering Manager, Cloud Platform Engineering Manager for Baseten's Cloud Platform team, responsible for building scalable, reliable, and efficient infrastructure for AI inference. The role involves people management, technical direction, and ensuring operational excellence in a cloud environment. Prior ML experience is not required, but familiarity with ML infrastructure is a plus.	—	Engineering	San Francisco, CA	6w ago	5
Senior Manager, Cloud Platform & Site Reliability Senior Manager role leading Cloud Platform and Site Reliability Engineering for an AI infrastructure company. Focuses on managing teams, setting technical direction for infrastructure, reliability, and platform engineering, and ensuring the health of the cloud infrastructure and SRE practice. Requires expertise in Kubernetes, cloud infrastructure, distributed systems, IaC, CI/CD, and observability. Bonus for experience with AI/ML workloads, GPU infrastructure, and AI-assisted incident tooling.	—	Engineering	San Francisco, CA	6w ago	5
Capacity and Infrastructure Lead This role focuses on building the analytics foundation for tracking infrastructure usage, capacity, and cloud spend across Baseten's AI inference platform. The lead will create data models to unify cloud billing, usage, capacity, and telemetry data, working with various teams to optimize cost and utilization. Responsibilities include building dashboards, modeling data from multiple providers, defining core metrics, supporting forecasting, developing anomaly alerting, and ensuring data reliability.	—	Engineering	San Francisco, CA	7w ago	5
SRE Site Reliability Engineer to define and codify gold standards for day 2 operations of an ML infrastructure platform, focusing on robust systems, processes, automations, and observability to ensure reliability at scale and empower the organization. The role involves incident response, building observability tooling, and diagnosing runtime issues related to ML model deployment.	—	Engineering	San Francisco, CA	7w ago	5
OS / K8s Systems Engineer Baseten is seeking an OS / K8s Systems Engineer to build and automate the infrastructure that turns raw GPU hardware into production-ready compute for AI companies. This role focuses on the software layer for reproducible, scalable, and reliable infrastructure across data centers, including OS images, provisioning pipelines, and cluster orchestration.	—	Engineering	San Francisco, CA	8w ago	5
Engineering Manager, Internal Platform Engineering Manager for Baseten's Platform Team, responsible for building internal systems to improve engineering productivity, collaboration, and quality through tooling, workflows, AI enablement, and development environments. Focuses on people leadership and technical direction for platform infrastructure.	—	Engineering	San Francisco, CA	8w ago	5
Strategic Finance, GTM Strategic Finance, GTM lead to partner with Sales and Marketing leaders, owning revenue forecasting, capacity planning, compensation design, and deal-desk for a consumption-based AI infrastructure company.	—	Product	San Francisco, CA	8w ago	5
GTM Engineer GTM Engineer to design, build, and ship AI-powered workflows for sales, marketing, and support functions. This role involves auditing the existing stack, identifying gaps, and building custom AI solutions using tools like Claude Code, integrating third-party APIs, and thinking in systems for stack consolidation.	Agent	Engineering	San Francisco, CA	Apr 22	5
Integrated Marketing Manager This role is for an Integrated Marketing Manager at Baseten, a company that provides inference infrastructure for AI companies. The manager will be responsible for planning and executing multi-channel marketing campaigns to drive pipeline and accelerate go-to-market momentum. The role requires experience in campaign management, familiarity with marketing tech stacks, strong analytical skills, and an automation- and AI-native mindset.	—	Product	San Francisco, CA	Apr 17	5
Content Engineer Baseten is seeking a Content Engineer to join their team for a 3-month contract-to-hire position. The role focuses on creating written content for developers in the AI space, identifying high-leverage channels, and defining content discoverability playbooks. Responsibilities include shipping technical content, designing automated content production workflows using AI tools, and identifying relevant topics for the Baseten ICP. Requires hands-on developer experience (Python, Javascript, SQL) and comfort with GTM automation tools. Nice-to-haves include experience with AI/ML infrastructure and LLM tooling.	—	Product	San Francisco, CA	Apr 9	5
Product Manager - Core Product Product Manager for Baseten's core product, focusing on developer experience for building, deploying, and managing AI applications. The role involves shaping APIs, SDKs, UI workflows, and integration surfaces to simplify ML infrastructure for users.	—	Product	San Francisco, CA	Apr 2	5
Security Engineer Baseten is seeking an experienced Security Engineer to build and maintain the security posture of their ML infrastructure platform, which serves AI companies. The role involves security architecture, vulnerability management, incident response, IAM, compliance, employee training, and DevSecOps integration, with a focus on cloud and container security.	—	Engineering	San Francisco, CA	Apr 1	5
Software Engineer - Model Developer Ecosystem Software Engineer focused on the model developer ecosystem, revamping the model library to help developers discover, evaluate, and select models. This role involves creating guides, evaluations, and educational content to navigate the specialized AI model landscape, operating at the intersection of technical depth, community building, and product thinking.	—	Product	San Francisco, CA	Mar 20	5
Account Executive - Industries Enterprise Account Executive role at Baseten, a company providing AI inference infrastructure. The role focuses on selling Baseten's platform to complex, regulated industries like financial services and healthcare. Responsibilities include owning the sales cycle, driving new business, acting as a trusted advisor, and collaborating with engineering and product teams. Requires 8+ years of enterprise B2B sales experience in technology, with a track record of closing large deals and experience in regulated industries. Technical acumen in areas like model serving and GPU infra is important. Nice to have direct experience selling AI/ML infrastructure.	—	Engineering	San Francisco, CA	Mar 20	5
Account Executive - AI Native: Startups Account Executive role at an AI inference platform company. Focuses on sales, prospecting, and closing new business with AI-native startups. Requires SaaS sales experience and ability to sell to technical audiences, with a preference for experience in developer/ML tooling.	—	Product	New York, NY	Mar 19	5
Data Engineer Data Engineer to build and scale Baseten's internal data platform, transforming raw product and business data into reliable datasets that power decision-making. This role will design data models, pipelines, and analytics infrastructure, working with AI inference, infrastructure, and observability data to generate insights.	Serve	Engineering	San Francisco, CA	Mar 18	5
Infrastructure Ops Engineer This role is for an Infrastructure Ops Engineer at Baseten, a company that provides inference infrastructure for AI companies. The engineer will manage the operational aspects of global infrastructure, focusing on hardware lifecycles, Kubernetes, and cloud-native tools. Key responsibilities include fleet maintenance, fulfilling customer capacity requests, improving system observability, orchestrating maintenance, documenting GPU-specific issues, and building automation to reduce manual intervention. The role acts as a bridge between customers, SRE, and infrastructure teams to ensure platform reliability and readiness for AI deployments.	Serve	Engineering	San Francisco, CA	Mar 10	5
Onboarding Program Manager Baseten is seeking an Onboarding Program Manager to build and lead their onboarding program for new hires. This role involves designing and delivering curriculum, enabling managers, and ensuring new hires are set up for success within the first 90 days in a fast-growing AI startup environment. The focus is on accelerating ramp time and reinforcing company culture and product knowledge.	—	Product	San Francisco, CA	Mar 4	5
Performance Marketing Manager Baseten is seeking a Performance Marketing Manager to own and scale their paid acquisition engine, focusing on converting ML engineers and AI builders into users, pipeline, and revenue. This role requires end-to-end ownership of paid acquisition strategy, rigorous funnel analytics, structured experimentation, and scaling high-performing campaigns to grow qualified pipeline while reducing CAC. The ideal candidate understands how to market to a technical audience, thinks in terms of funnels and metrics, and has experience in B2B SaaS or developer tooling with an interest in AI.	—	Product	San Francisco, CA	Feb 26	5

Frequently asked questions

What AI roles is Baseten hiring for?
Baseten currently has 26 active AI-related roles in our index. The most common open titles are: AI Solutions Engineer, Applied AI Inference Engineer, Data Engineer, Engineering Manager - Forward Deployed Engineering (LLM), Engineering Manager - Model Performance. Most positions are in Engineering and Research.
What stage of AI development does Baseten focus on?
Baseten's active AI hiring is concentrated in: serving infrastructure (73%), post-training (12%), agents (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
Where is Baseten hiring AI talent?
Baseten is hiring AI talent in: United States (26 roles).
What technologies does Baseten's AI team work with?
Job postings at Baseten most frequently reference: model serving, inference infra, llm observability, agent orchestration, tool use.
How many AI roles has Baseten posted recently?
In the past 30 days, Baseten has posted 2 new AI-related roles.

Title

Stage

Function

Location

First seen

AI score

Post-Training Research Engineer

Baseten is seeking a Post-Training Research Engineer to build in-house tooling for post-training AI models at scale. This role involves deep technical dives into ML techniques, distributed computing, and systems-level concepts to support customer custom models, which are critical for Baseten's inference platform.

Post-train

Engineering

San Francisco, CA

Mar 23

Post-Training Applied Researcher

Post-training researcher focused on fine-tuning open-source LLMs for specific customer tasks using RL and reward engineering. Involves building training pipelines, environments, and evals, and working with customer data to improve models that reach millions of users.

Post-trainAgent

Research

San Francisco, CA

Mar 17

Post-Training Research Scientist

Research Scientist focused on post-training methodology and performant inference, with a significant portion dedicated to pure research and the remainder to applied research informing the company's platform and customer needs. The role involves designing and executing experiments, publishing at top venues, and collaborating with engineering teams to translate research into production systems.

Post-trainServe

Research

San Francisco, CA

Mar 17

Software Engineer - GPU Kernels

Software Engineer focused on optimizing GPU kernels for ML inference, including matrix multiplications, attention mechanisms, and quantization, using CUDA and PTX assembly.

Serve

Engineering

San Francisco, CA

Jul '25

Engineering Manager - Forward Deployed Engineering (LLM)

Engineering Manager for Forward Deployed Engineering team focused on building, scaling, and optimizing LLM inference workloads for Baseten customers. This role involves hands-on technical ownership, team leadership, and collaboration with product and infrastructure teams to ensure best-in-class performance, reliability, and cost efficiency of AI applications on Baseten's platform. The role contributes to the core codebase and drives feature roadmap, acting as a player-coach.

ServeAgent

Engineering

San Francisco, CA

7w ago

Manager, Solutions Architect

Manager for a Solutions Architect team focused on enabling customers to deploy and optimize AI/ML models, particularly LLMs, on Baseten's inference platform. The role involves leadership, technical guidance, customer discovery, and ensuring high performance, reliability, and cost efficiency of AI applications in production.

Serve

Engineering

San Francisco, CA

7w ago

Software Engineer - Voice AI (Inference Runtime)

Software Engineer focused on building and optimizing the inference runtime for Voice AI models, including state-of-the-art open-source models. The role involves developing large-scale, real-time infrastructure for multi-model voice agents, reducing latency, increasing throughput, and improving GPU efficiency. It also includes designing iteration loops for voice model customization and customization.

ServeAgent

Engineering

San Francisco, CA

Apr 23

Software Engineer - Model APIs

Software Engineer role focused on optimizing and operating Model APIs for AI inference, involving distributed systems, model serving, and developer experience. The role emphasizes performance improvements, structured outputs, tool/function calling, and multi-modal serving.

ServeAgent

Engineering

San Francisco, CA

Oct '25

Engineering Manager - Model Performance

Engineering Manager for Model Performance at Baseten, a company providing inference infrastructure for AI companies. The role involves leading a team of engineers to optimize ML model inference and performance, focusing on production-level AI/ML solutions and scaling large models. Requires a strong engineering background, leadership experience, and expertise in ML performance optimization, with hands-on work in areas like TensorRT, PyTorch, and CUDA.

Serve

Engineering

San Francisco, CA

Sep '24

Software Engineer - Model Performance

Software Engineer focused on ML performance for LLM inference, optimizing techniques like quantization and speculative decoding, and debugging ML performance issues in libraries like TensorRT and PyTorch.

Serve

Engineering

San Francisco, CA

Mar '24

Product Manager, Developer Experience

Product Manager for Developer Experience at Baseten, a company providing AI inference infrastructure. The role focuses on owning the end-to-end developer journey for deploying and iterating on models, including CLI/SDKs, console, onboarding, deployment lifecycle, and multi-model composition for agents and applications. The goal is to make Baseten synonymous with great developer experience and effortless model deployment.

ServeAgent

Product

San Francisco, CA

2w ago

Software Engineer- BIS (Baseten Inference Stack)

Software Engineer for Baseten's Inference Stack team, focusing on building and operating the distributed runtime for large-scale LLM inference. The role involves working across the stack from developer experience to low-level infrastructure, ensuring performance, scalability, and reliability of AI model deployments.

Serve

Engineering

San Francisco, CA

4w ago

Solution Architect (AI/LLM Inference)

Solution Architect role focused on AI/LLM inference, partnering with Sales and customers to design and deploy technical solutions. Responsibilities include customer discovery, technical scoping, leading demos, managing deployments, and driving POC execution. Requires an AI/ML background and customer-facing communication skills, with the ability to script and prototype.

Serve

Engineering

San Francisco, CA

7w ago

Applied AI Inference Engineer

This role focuses on partnering with customers to architect, build, and deploy high-scale production AI applications on Baseten's platform. It involves owning the customer journey from exploration to deployment, translating business goals into reliable, observable services with clear quality, latency, and cost outcomes. The role blends engineering, product management, technical customer success, and pre-sales solution engineering.

ServeAgent

Engineering

San Francisco, CA

Apr 21

AI Solutions Engineer

AI Solutions Engineer role focused on partnering with customers to architect, build, and deploy high-scale production AI applications on Baseten's platform. This involves owning the customer journey from exploration to production, translating business goals into reliable, observable services with clear quality, latency, and cost outcomes. The role blends engineering, product management, technical customer success, and pre-sales solution engineering.

ShipServe

Engineering

San Francisco, CA

Apr 21

Solution Architect

Solution Architect role at Baseten, a company providing AI inference infrastructure. The role involves partnering with Sales and customers to understand business needs, design technical solutions, run technical discovery, and guide deployments and proofs of value. Responsibilities include customer discovery calls, technical scoping, leading demos, owning benchmarking and repeatable deployments across various AI modalities, advising on infrastructure tradeoffs, and driving POC execution. Requires an AI/ML background, strong customer-facing communication, and technical depth to scope solutions.

Serve

Engineering

San Francisco, CA

Feb 25

Software Engineer - AI Enablement

Baseten is seeking an AI Enablement Engineer to own and develop AI-powered tooling and agent infrastructure for internal productivity. This role involves evaluating, customizing, and deploying AI coding agents and building custom internal agents for tasks like incident triage and codebase Q&A. The engineer will also track usage, measure impact, and stay updated on AI tooling advancements to enhance the engineering organization's effectiveness across the SDLC.

Agent

Engineering

San Francisco, CA

Feb 24

Software Engineer — GPU Networking & Distributed Systems

Software Engineer focused on GPU Networking and Distributed Systems to optimize AI inference infrastructure, specifically for LLMs and multi-modal models. The role involves integrating RDMA, optimizing networking layers for disaggregated KV cache and WideEP, enabling fast startup speeds, and building observability tools for bleeding-edge hardware.

Serve

Engineering

San Francisco, CA

Feb 23

Software Engineer - Training Product

Software Engineer focused on building and shipping training products for AI companies, working across the full stack from API to infrastructure, including fine-tuning models and partnering with research engineers. The role involves developing features like multi-node training and serverless RL, with a focus on developer experience and reliability.

Post-trainServe

Engineering

San Francisco, CA

Jan 22

Software Engineer, Model Performance Systems

Software Engineer role focused on building and optimizing the performance of AI inference infrastructure, including benchmarking, hardware profiling, and developing automated testing and monitoring tools for LLMs.

Serve

Engineering

San Francisco, CA

Jan 7

Software Engineer - Training Infrastructure

Software Engineer on the Training Infrastructure team responsible for architecting and leading development of the ML training platform, focusing on scheduling, storage, networking, reliability, and observability for research engineers and model developers.

Data

Engineering

San Francisco, CA

Aug '25

Software Engineer - Infrastructure

Software Engineer focused on building and maintaining the ML inference platform, enabling high-performance deployment, scaling, and monitoring of AI models for production applications.

Serve

Engineering

San Francisco, CA

Mar '25

Software Engineer - Core Product

Software Engineer on the Core Product team at Baseten, building and maintaining the core Baseten product that enables users to deploy and get value from ML models. The role involves working across the stack, including CLI tools, REST APIs, and the web application, with a focus on new feature development, API design, and bug fixing. Example initiatives include chains for multi-component workflows, asynchronous inference, model APIs, and model training for production inference.

ServeAgent

Engineering

San Francisco, CA

Jul '24

Forward Deployed Engineer

The Forward Deployed Engineer partners with customers to architect, build, and deploy high-scale production AI applications on Baseten's platform. This role involves owning the customer journey from exploration to production, translating business goals into reliable services with clear quality, latency, and cost outcomes. It blends engineering, product management, technical customer success, and pre-sales solution engineering.

ServeAgent

Engineering

San Francisco, CA

Mar '24

Capacity Strategy & Operations Lead

This role focuses on capacity strategy and operations for an AI inference platform. The lead will manage the end-to-end capacity planning process, translating customer demand and growth forecasts into supply requirements, coordinating fulfillment, and building scalable systems. They will also lead cross-functional efforts during constrained supply situations and provide strategic briefings to leadership. The role requires strong quantitative and modeling skills, experience in strategy/operations, and the ability to manage complex, cross-functional processes in a fast-moving environment.

—

Product

San Francisco, CA

2w ago

Software Engineer - Capacity

Software Engineer on the Capacity team at Baseten, a company that provides inference infrastructure for AI companies. This role focuses on owning and developing the internal operating system for managing customer lifecycle, supply, and demand, translating operational requirements into product features, and building full-stack features for the Capacity toolchain. The role requires strong full-stack proficiency, experience with internal tooling/developer infrastructure, and an interest in AI/ML infrastructure.

—

Engineering

San Francisco, CA

2w ago

Technical Program Manager, Infrastructure

Technical Program Manager to drive complex, cross-cutting AI infrastructure programs, focusing on execution, process, and managing migrations and dependencies across multiple engineering teams.

—

Engineering

San Francisco, CA

2w ago

Engineering Manager, Cloud Platform

Engineering Manager for Baseten's Cloud Platform team, responsible for building scalable, reliable, and efficient infrastructure for AI inference. The role involves people management, technical direction, and ensuring operational excellence in a cloud environment. Prior ML experience is not required, but familiarity with ML infrastructure is a plus.

—

Engineering

San Francisco, CA

2w ago

Engineering Manager, Internal Platform

Engineering Manager for Baseten's Platform Team, responsible for building internal systems to improve engineering productivity, collaboration, and quality through tooling, workflows, AI enablement, and development environments. Focuses on people leadership and technical direction for platform infrastructure.

—

Engineering

San Francisco, CA

2w ago

Assistant General Counsel, Infrastructure & Compute

This role is for an Assistant General Counsel focused on the legal strategy and execution for Baseten's compute and infrastructure supply chain, which is critical for their AI inference services. The role involves negotiating agreements for GPU compute, cloud capacity, and infrastructure services, managing colocation and network contracting, and assessing concentration risk. It requires strong legal expertise in technology transactions, particularly in compute and hardware supply chains, and the ability to partner with engineering and finance teams. While the company is in the AI space and the role supports AI companies, the core function is legal and contractual, not direct AI/ML development.

—

Engineering

San Francisco, CA

3w ago

Head of Legal Operations

This role is for a Head of Legal Operations at an AI infrastructure company. The primary focus is on building and running the legal team's operating system, including contract lifecycle management, intake and triage, knowledge management, and budget. A key aspect is leveraging AI-assisted and agentic workflows to improve efficiency and scale the legal function. The role emphasizes building over buying AI solutions for legal operations.

—

Product

San Francisco, CA

3w ago

Engineering Manager, Cloud Platform

—

Engineering

San Francisco, CA

6w ago

Senior Manager, Cloud Platform & Site Reliability

Senior Manager role leading Cloud Platform and Site Reliability Engineering for an AI infrastructure company. Focuses on managing teams, setting technical direction for infrastructure, reliability, and platform engineering, and ensuring the health of the cloud infrastructure and SRE practice. Requires expertise in Kubernetes, cloud infrastructure, distributed systems, IaC, CI/CD, and observability. Bonus for experience with AI/ML workloads, GPU infrastructure, and AI-assisted incident tooling.

—

Engineering

San Francisco, CA

6w ago

Capacity and Infrastructure Lead

This role focuses on building the analytics foundation for tracking infrastructure usage, capacity, and cloud spend across Baseten's AI inference platform. The lead will create data models to unify cloud billing, usage, capacity, and telemetry data, working with various teams to optimize cost and utilization. Responsibilities include building dashboards, modeling data from multiple providers, defining core metrics, supporting forecasting, developing anomaly alerting, and ensuring data reliability.

—

Engineering

San Francisco, CA

7w ago

SRE

Site Reliability Engineer to define and codify gold standards for day 2 operations of an ML infrastructure platform, focusing on robust systems, processes, automations, and observability to ensure reliability at scale and empower the organization. The role involves incident response, building observability tooling, and diagnosing runtime issues related to ML model deployment.

—

Engineering

San Francisco, CA

7w ago

OS / K8s Systems Engineer

Baseten is seeking an OS / K8s Systems Engineer to build and automate the infrastructure that turns raw GPU hardware into production-ready compute for AI companies. This role focuses on the software layer for reproducible, scalable, and reliable infrastructure across data centers, including OS images, provisioning pipelines, and cluster orchestration.

—

Engineering

San Francisco, CA

8w ago

Engineering Manager, Internal Platform

—

Engineering

San Francisco, CA

8w ago

Strategic Finance, GTM

Strategic Finance, GTM lead to partner with Sales and Marketing leaders, owning revenue forecasting, capacity planning, compensation design, and deal-desk for a consumption-based AI infrastructure company.

—

Product

San Francisco, CA

8w ago

GTM Engineer

GTM Engineer to design, build, and ship AI-powered workflows for sales, marketing, and support functions. This role involves auditing the existing stack, identifying gaps, and building custom AI solutions using tools like Claude Code, integrating third-party APIs, and thinking in systems for stack consolidation.

Agent

Engineering

San Francisco, CA

Apr 22

Integrated Marketing Manager

This role is for an Integrated Marketing Manager at Baseten, a company that provides inference infrastructure for AI companies. The manager will be responsible for planning and executing multi-channel marketing campaigns to drive pipeline and accelerate go-to-market momentum. The role requires experience in campaign management, familiarity with marketing tech stacks, strong analytical skills, and an automation- and AI-native mindset.

—

Product

San Francisco, CA

Apr 17

Content Engineer

Baseten is seeking a Content Engineer to join their team for a 3-month contract-to-hire position. The role focuses on creating written content for developers in the AI space, identifying high-leverage channels, and defining content discoverability playbooks. Responsibilities include shipping technical content, designing automated content production workflows using AI tools, and identifying relevant topics for the Baseten ICP. Requires hands-on developer experience (Python, Javascript, SQL) and comfort with GTM automation tools. Nice-to-haves include experience with AI/ML infrastructure and LLM tooling.

—

Product

San Francisco, CA

Apr 9

Product Manager - Core Product

Product Manager for Baseten's core product, focusing on developer experience for building, deploying, and managing AI applications. The role involves shaping APIs, SDKs, UI workflows, and integration surfaces to simplify ML infrastructure for users.

—

Product

San Francisco, CA

Apr 2

Security Engineer

Baseten is seeking an experienced Security Engineer to build and maintain the security posture of their ML infrastructure platform, which serves AI companies. The role involves security architecture, vulnerability management, incident response, IAM, compliance, employee training, and DevSecOps integration, with a focus on cloud and container security.

—

Engineering

San Francisco, CA

Apr 1

Software Engineer - Model Developer Ecosystem

Software Engineer focused on the model developer ecosystem, revamping the model library to help developers discover, evaluate, and select models. This role involves creating guides, evaluations, and educational content to navigate the specialized AI model landscape, operating at the intersection of technical depth, community building, and product thinking.

—

Product

San Francisco, CA

Mar 20

Account Executive - Industries

Enterprise Account Executive role at Baseten, a company providing AI inference infrastructure. The role focuses on selling Baseten's platform to complex, regulated industries like financial services and healthcare. Responsibilities include owning the sales cycle, driving new business, acting as a trusted advisor, and collaborating with engineering and product teams. Requires 8+ years of enterprise B2B sales experience in technology, with a track record of closing large deals and experience in regulated industries. Technical acumen in areas like model serving and GPU infra is important. Nice to have direct experience selling AI/ML infrastructure.

—

Engineering

San Francisco, CA

Mar 20

Account Executive - AI Native: Startups

Account Executive role at an AI inference platform company. Focuses on sales, prospecting, and closing new business with AI-native startups. Requires SaaS sales experience and ability to sell to technical audiences, with a preference for experience in developer/ML tooling.

—

Product

New York, NY

Mar 19

Data Engineer

Data Engineer to build and scale Baseten's internal data platform, transforming raw product and business data into reliable datasets that power decision-making. This role will design data models, pipelines, and analytics infrastructure, working with AI inference, infrastructure, and observability data to generate insights.

Serve

Engineering

San Francisco, CA

Mar 18

Infrastructure Ops Engineer

This role is for an Infrastructure Ops Engineer at Baseten, a company that provides inference infrastructure for AI companies. The engineer will manage the operational aspects of global infrastructure, focusing on hardware lifecycles, Kubernetes, and cloud-native tools. Key responsibilities include fleet maintenance, fulfilling customer capacity requests, improving system observability, orchestrating maintenance, documenting GPU-specific issues, and building automation to reduce manual intervention. The role acts as a bridge between customers, SRE, and infrastructure teams to ensure platform reliability and readiness for AI deployments.

Serve

Engineering

San Francisco, CA

Mar 10

Onboarding Program Manager

Baseten is seeking an Onboarding Program Manager to build and lead their onboarding program for new hires. This role involves designing and delivering curriculum, enabling managers, and ensuring new hires are set up for success within the first 90 days in a fast-growing AI startup environment. The focus is on accelerating ramp time and reinforcing company culture and product knowledge.

—

Product

San Francisco, CA

Mar 4

Performance Marketing Manager

Baseten is seeking a Performance Marketing Manager to own and scale their paid acquisition engine, focusing on converting ML engineers and AI builders into users, pipeline, and revenue. This role requires end-to-end ownership of paid acquisition strategy, rigorous funnel analytics, structured experimentation, and scaling high-performing campaigns to grow qualified pipeline while reducing CAC. The ideal candidate understands how to market to a technical audience, thinks in terms of funnels and metrics, and has experience in B2B SaaS or developer tooling with an interest in AI.

—

Product

San Francisco, CA

Feb 26