AI Hire Signal
JobsCompaniesTrendsInsightsWeekly
JobsStrategy timeline
AI Hire Signal

Tracking AI hiring across 200+ US tech companies. Stage, salary, and stack signals on every role — refreshed weekly.

Contact

Browse

JobsCompaniesTrendsInsightsWeekly

Resources

AboutSitemapRobots

Legal

PrivacyTerms
© 2026 AI Hire Signal·Not affiliated with companies shown

Baseten currently has 26 active AI-related job listings. The majority of these roles, 69%, are focused on serving infrastructure. The dominant function is Engineering, with 24 roles, and hiring is concentrated in the United States. Frequent tech tags include model serving, inference infrastructure, and LLM observability, suggesting a focus on the operational aspects of AI deployment. In the last 30 days, Baseten posted 3 new AI roles, a 25% decrease compared to the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Currently tracking 22 active AI roles, up 13% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $300k.

Hiring
22 / 22
Momentum (4w)
↑+2 +13%
18 opens last 4w · 16 prior 4w
Salary range
$300k
USD · disclosed roles only
Tracked since
Mar '24
last role 6w ago
Hiring velocityscroll left for older weeks
2 new roles
Mar 25
1 new role
Jul 8
1 new role
Sep 9
1 new role
Feb 3
1 new role
Mar 3
1 new role
24
1 new role
Apr 7
1 new role
Jul 14
1 new role
Aug 18
2 new roles
25
1 new role
Sep 1
2 new roles
Oct 6
1 new role
Dec 8
1 new role
Jan 5
1 new role
19
1 new role
26
7 new roles
Feb 23
5 new roles
Mar 2
1 new role
9
6 new roles
16
1 new role
23
3 new roles
30
2 new roles
Apr 6
2 new roles
13
4 new roles
20
4 new roles
27
3 new roles
May 4
6 new roles
11
3 new roles
18
6 new roles
Jun 1
7 new roles
8
1 new role
15
4 new roles
22

Frequently asked questions

  • What AI roles is Baseten hiring for?

    Baseten currently has 26 active AI-related roles in our index. The most common open titles are: AI Solutions Engineer, Applied AI Inference Engineer, Data Engineer, Engineering Manager - Forward Deployed Engineering (LLM), Engineering Manager - Model Performance. Most positions are in Engineering and Research.

  • What stage of AI development does Baseten focus on?

    Baseten's active AI hiring is concentrated in: serving infrastructure (73%), post-training (12%), agents (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

  • Where is Baseten hiring AI talent?

    Baseten is hiring AI talent in: United States (26 roles).

  • What technologies does Baseten's AI team work with?

    Job postings at Baseten most frequently reference: model serving, inference infra, llm observability, agent orchestration, tool use.

  • How many AI roles has Baseten posted recently?

    In the past 30 days, Baseten has posted 2 new AI-related roles.

Jobs (42)

23 AI · 63 total active
FilteredFunctionEngineering×
Show
Active onlyAI only (≥ 7)
Stage
AllData · 1Post-train · 3Serve · 19Agent · 2Ship · 1
Function
AllEngineering · 42Product · 19Research · 1
Sort
AI scoreRecentTitle
TitleStageFunctionLocationFirst seenAI score
Post-Training Research Engineer
Baseten is seeking a Post-Training Research Engineer to build in-house tooling for post-training AI models at scale. This role involves deep technical dives into ML techniques, distributed computing, and systems-level concepts to support customer custom models, which are critical for Baseten's inference platform.
Post-trainEngineeringSan Francisco, CAMar 239
Software Engineer - GPU Kernels
Software Engineer focused on optimizing GPU kernels for ML inference, including matrix multiplications, attention mechanisms, and quantization, using CUDA and PTX assembly.
ServeEngineeringSan Francisco, CAJul '259
Engineering Manager - Forward Deployed Engineering (LLM)
Engineering Manager for Forward Deployed Engineering team focused on building, scaling, and optimizing LLM inference workloads for Baseten customers. This role involves hands-on technical ownership, team leadership, and collaboration with product and infrastructure teams to ensure best-in-class performance, reliability, and cost efficiency of AI applications on Baseten's platform. The role contributes to the core codebase and drives feature roadmap, acting as a player-coach.
ServeAgentEngineeringSan Francisco, CA7w ago8
Manager, Solutions Architect
Manager for a Solutions Architect team focused on enabling customers to deploy and optimize AI/ML models, particularly LLMs, on Baseten's inference platform. The role involves leadership, technical guidance, customer discovery, and ensuring high performance, reliability, and cost efficiency of AI applications in production.
ServeEngineeringSan Francisco, CA7w ago8
Software Engineer - Voice AI (Inference Runtime)
Software Engineer focused on building and optimizing the inference runtime for Voice AI models, including state-of-the-art open-source models. The role involves developing large-scale, real-time infrastructure for multi-model voice agents, reducing latency, increasing throughput, and improving GPU efficiency. It also includes designing iteration loops for voice model customization and customization.
ServeAgentEngineeringSan Francisco, CAApr 238
Software Engineer - Model APIs
Software Engineer role focused on optimizing and operating Model APIs for AI inference, involving distributed systems, model serving, and developer experience. The role emphasizes performance improvements, structured outputs, tool/function calling, and multi-modal serving.
ServeAgentEngineeringSan Francisco, CAOct '258
Engineering Manager - Model Performance
Engineering Manager for Model Performance at Baseten, a company providing inference infrastructure for AI companies. The role involves leading a team of engineers to optimize ML model inference and performance, focusing on production-level AI/ML solutions and scaling large models. Requires a strong engineering background, leadership experience, and expertise in ML performance optimization, with hands-on work in areas like TensorRT, PyTorch, and CUDA.
ServeEngineeringSan Francisco, CASep '248
Software Engineer - Model Performance
Software Engineer focused on ML performance for LLM inference, optimizing techniques like quantization and speculative decoding, and debugging ML performance issues in libraries like TensorRT and PyTorch.
ServeEngineeringSan Francisco, CAMar '248
Software Engineer- BIS (Baseten Inference Stack)
Software Engineer for Baseten's Inference Stack team, focusing on building and operating the distributed runtime for large-scale LLM inference. The role involves working across the stack from developer experience to low-level infrastructure, ensuring performance, scalability, and reliability of AI model deployments.
ServeEngineeringSan Francisco, CA4w ago7
Solution Architect (AI/LLM Inference)
Solution Architect role focused on AI/LLM inference, partnering with Sales and customers to design and deploy technical solutions. Responsibilities include customer discovery, technical scoping, leading demos, managing deployments, and driving POC execution. Requires an AI/ML background and customer-facing communication skills, with the ability to script and prototype.
ServeEngineeringSan Francisco, CA7w ago7
Applied AI Inference Engineer
This role focuses on partnering with customers to architect, build, and deploy high-scale production AI applications on Baseten's platform. It involves owning the customer journey from exploration to deployment, translating business goals into reliable, observable services with clear quality, latency, and cost outcomes. The role blends engineering, product management, technical customer success, and pre-sales solution engineering.
ServeAgentEngineeringSan Francisco, CAApr 217
AI Solutions Engineer
AI Solutions Engineer role focused on partnering with customers to architect, build, and deploy high-scale production AI applications on Baseten's platform. This involves owning the customer journey from exploration to production, translating business goals into reliable, observable services with clear quality, latency, and cost outcomes. The role blends engineering, product management, technical customer success, and pre-sales solution engineering.
ShipServeEngineeringSan Francisco, CAApr 217
Solution Architect
Solution Architect role at Baseten, a company providing AI inference infrastructure. The role involves partnering with Sales and customers to understand business needs, design technical solutions, run technical discovery, and guide deployments and proofs of value. Responsibilities include customer discovery calls, technical scoping, leading demos, owning benchmarking and repeatable deployments across various AI modalities, advising on infrastructure tradeoffs, and driving POC execution. Requires an AI/ML background, strong customer-facing communication, and technical depth to scope solutions.
ServeEngineeringSan Francisco, CAFeb 257
Software Engineer - AI Enablement
Baseten is seeking an AI Enablement Engineer to own and develop AI-powered tooling and agent infrastructure for internal productivity. This role involves evaluating, customizing, and deploying AI coding agents and building custom internal agents for tasks like incident triage and codebase Q&A. The engineer will also track usage, measure impact, and stay updated on AI tooling advancements to enhance the engineering organization's effectiveness across the SDLC.
AgentEngineeringSan Francisco, CAFeb 247
Software Engineer — GPU Networking & Distributed Systems
Software Engineer focused on GPU Networking and Distributed Systems to optimize AI inference infrastructure, specifically for LLMs and multi-modal models. The role involves integrating RDMA, optimizing networking layers for disaggregated KV cache and WideEP, enabling fast startup speeds, and building observability tools for bleeding-edge hardware.
ServeEngineeringSan Francisco, CAFeb 237
Software Engineer - Training Product
Software Engineer focused on building and shipping training products for AI companies, working across the full stack from API to infrastructure, including fine-tuning models and partnering with research engineers. The role involves developing features like multi-node training and serverless RL, with a focus on developer experience and reliability.
Post-trainServeEngineeringSan Francisco, CAJan 227
Software Engineer, Model Performance Systems
Software Engineer role focused on building and optimizing the performance of AI inference infrastructure, including benchmarking, hardware profiling, and developing automated testing and monitoring tools for LLMs.
ServeEngineeringSan Francisco, CAJan 77
Software Engineer - Training Infrastructure
Software Engineer on the Training Infrastructure team responsible for architecting and leading development of the ML training platform, focusing on scheduling, storage, networking, reliability, and observability for research engineers and model developers.
DataEngineeringSan Francisco, CAAug '257
Software Engineer - Infrastructure
Software Engineer focused on building and maintaining the ML inference platform, enabling high-performance deployment, scaling, and monitoring of AI models for production applications.
ServeEngineeringSan Francisco, CAMar '257
Software Engineer - Core Product
Software Engineer on the Core Product team at Baseten, building and maintaining the core Baseten product that enables users to deploy and get value from ML models. The role involves working across the stack, including CLI tools, REST APIs, and the web application, with a focus on new feature development, API design, and bug fixing. Example initiatives include chains for multi-component workflows, asynchronous inference, model APIs, and model training for production inference.
ServeAgentEngineeringSan Francisco, CAJul '247
Forward Deployed Engineer
The Forward Deployed Engineer partners with customers to architect, build, and deploy high-scale production AI applications on Baseten's platform. This role involves owning the customer journey from exploration to production, translating business goals into reliable services with clear quality, latency, and cost outcomes. It blends engineering, product management, technical customer success, and pre-sales solution engineering.
ServeAgentEngineeringSan Francisco, CAMar '247
Software Engineer - Capacity
Software Engineer on the Capacity team at Baseten, a company that provides inference infrastructure for AI companies. This role focuses on owning and developing the internal operating system for managing customer lifecycle, supply, and demand, translating operational requirements into product features, and building full-stack features for the Capacity toolchain. The role requires strong full-stack proficiency, experience with internal tooling/developer infrastructure, and an interest in AI/ML infrastructure.
—EngineeringSan Francisco, CA2w ago5
Technical Program Manager, Infrastructure
Technical Program Manager to drive complex, cross-cutting AI infrastructure programs, focusing on execution, process, and managing migrations and dependencies across multiple engineering teams.
—EngineeringSan Francisco, CA2w ago5
Engineering Manager, Cloud Platform
Engineering Manager for Baseten's Cloud Platform team, responsible for building scalable, reliable, and efficient infrastructure for AI inference. The role involves people management, technical direction, and ensuring operational excellence in a cloud environment. Prior ML experience is not required, but familiarity with ML infrastructure is a plus.
—EngineeringSan Francisco, CA2w ago5
Engineering Manager, Internal Platform
Engineering Manager for Baseten's Platform Team, responsible for building internal systems to improve engineering productivity, collaboration, and quality through tooling, workflows, AI enablement, and development environments. Focuses on people leadership and technical direction for platform infrastructure.
—EngineeringSan Francisco, CA2w ago5
Assistant General Counsel, Infrastructure & Compute
This role is for an Assistant General Counsel focused on the legal strategy and execution for Baseten's compute and infrastructure supply chain, which is critical for their AI inference services. The role involves negotiating agreements for GPU compute, cloud capacity, and infrastructure services, managing colocation and network contracting, and assessing concentration risk. It requires strong legal expertise in technology transactions, particularly in compute and hardware supply chains, and the ability to partner with engineering and finance teams. While the company is in the AI space and the role supports AI companies, the core function is legal and contractual, not direct AI/ML development.
—EngineeringSan Francisco, CA3w ago5
Senior Manager, Cloud Platform & Site Reliability
Senior Manager role leading Cloud Platform and Site Reliability Engineering for an AI infrastructure company. Focuses on managing teams, setting technical direction for infrastructure, reliability, and platform engineering, and ensuring the health of the cloud infrastructure and SRE practice. Requires expertise in Kubernetes, cloud infrastructure, distributed systems, IaC, CI/CD, and observability. Bonus for experience with AI/ML workloads, GPU infrastructure, and AI-assisted incident tooling.
—EngineeringSan Francisco, CA6w ago5
Capacity and Infrastructure Lead
This role focuses on building the analytics foundation for tracking infrastructure usage, capacity, and cloud spend across Baseten's AI inference platform. The lead will create data models to unify cloud billing, usage, capacity, and telemetry data, working with various teams to optimize cost and utilization. Responsibilities include building dashboards, modeling data from multiple providers, defining core metrics, supporting forecasting, developing anomaly alerting, and ensuring data reliability.
—EngineeringSan Francisco, CA7w ago5
SRE
Site Reliability Engineer to define and codify gold standards for day 2 operations of an ML infrastructure platform, focusing on robust systems, processes, automations, and observability to ensure reliability at scale and empower the organization. The role involves incident response, building observability tooling, and diagnosing runtime issues related to ML model deployment.
—EngineeringSan Francisco, CA7w ago5
OS / K8s Systems Engineer
Baseten is seeking an OS / K8s Systems Engineer to build and automate the infrastructure that turns raw GPU hardware into production-ready compute for AI companies. This role focuses on the software layer for reproducible, scalable, and reliable infrastructure across data centers, including OS images, provisioning pipelines, and cluster orchestration.
—EngineeringSan Francisco, CA8w ago5
GTM Engineer
GTM Engineer to design, build, and ship AI-powered workflows for sales, marketing, and support functions. This role involves auditing the existing stack, identifying gaps, and building custom AI solutions using tools like Claude Code, integrating third-party APIs, and thinking in systems for stack consolidation.
AgentEngineeringSan Francisco, CAApr 225
Security Engineer
Baseten is seeking an experienced Security Engineer to build and maintain the security posture of their ML infrastructure platform, which serves AI companies. The role involves security architecture, vulnerability management, incident response, IAM, compliance, employee training, and DevSecOps integration, with a focus on cloud and container security.
—EngineeringSan Francisco, CAApr 15
Account Executive - Industries
Enterprise Account Executive role at Baseten, a company providing AI inference infrastructure. The role focuses on selling Baseten's platform to complex, regulated industries like financial services and healthcare. Responsibilities include owning the sales cycle, driving new business, acting as a trusted advisor, and collaborating with engineering and product teams. Requires 8+ years of enterprise B2B sales experience in technology, with a track record of closing large deals and experience in regulated industries. Technical acumen in areas like model serving and GPU infra is important. Nice to have direct experience selling AI/ML infrastructure.
—EngineeringSan Francisco, CAMar 205
Data Engineer
Data Engineer to build and scale Baseten's internal data platform, transforming raw product and business data into reliable datasets that power decision-making. This role will design data models, pipelines, and analytics infrastructure, working with AI inference, infrastructure, and observability data to generate insights.
ServeEngineeringSan Francisco, CAMar 185
Infrastructure Ops Engineer
This role is for an Infrastructure Ops Engineer at Baseten, a company that provides inference infrastructure for AI companies. The engineer will manage the operational aspects of global infrastructure, focusing on hardware lifecycles, Kubernetes, and cloud-native tools. Key responsibilities include fleet maintenance, fulfilling customer capacity requests, improving system observability, orchestrating maintenance, documenting GPU-specific issues, and building automation to reduce manual intervention. The role acts as a bridge between customers, SRE, and infrastructure teams to ensure platform reliability and readiness for AI deployments.
ServeEngineeringSan Francisco, CAMar 105
Cloud Platform Engineer
Baseten is seeking a Cloud Platform Engineer to build and maintain scalable infrastructure for AI model deployment and operation. The role focuses on ensuring infrastructure is scalable, reliable, and efficient, automating processes, and collaborating with AI companies to support their inference needs. No prior ML experience is required, but openness to learning is expected.
—EngineeringSan Francisco, CAOct '255
Software Engineer - Enterprise Platform
This role focuses on building and architecting infrastructure and platform features for enterprise customers, including self-hosted clusters, private connectivity, and compliance solutions. It involves advancing self-hosted offerings, managing capacity, and implementing observability tooling within a cloud-agnostic environment.
—EngineeringSan Francisco, CAAug '255
Software Engineer - Internal Platform
Software Engineer on the Internal Platform team at Baseten, a company that provides inference infrastructure for AI companies. This role focuses on building internal systems to improve engineering productivity, collaboration, and quality through tooling, workflows, and development environments. Responsibilities include creating diverse tooling, enhancing monorepos, designing shared libraries for observability, improving CI pipelines, managing Terraform modules, and supporting engineering teams.
—EngineeringSan Francisco, CAMar '255
Engineering Manager, Runtime Fabric
Engineering Manager for the Runtime Fabrics team, responsible for purpose-building container runtime and storage layers for AI inference workloads. This involves leading a team of systems engineers, setting technical direction, and contributing to open-source container projects. The role focuses on optimizing container runtimes for AI inference, addressing issues like GPU memory, image pulling times, and multi-tenant isolation.
—EngineeringSan Francisco, CA2w ago0
Data Center Network Engineer
Designs and owns high-performance network infrastructure for GPU clusters, focusing on cluster fabric design, cabling architecture, and performance validation to support AI model training and inference.
—EngineeringSan Francisco, CA5w ago0
GTM Recruiter
Recruiter for an AI infrastructure company, focusing on hiring for the Sales team. This role involves full-cycle recruiting, sourcing, candidate experience, process improvements, and data analysis. Requires 3+ years of recruiting experience, preferably in a startup, with proven success in hiring for Sales teams.
—EngineeringSan Francisco, CA7w ago0
Software Engineer - Billing and Internal Tooling
Software Engineer role focused on owning and evolving Baseten's end-to-end billing and revenue infrastructure, including pricing, invoicing, metering, and reporting. This involves building and maintaining the billing platform, integrating with systems like Orb, and partnering with Finance, Sales, and GTM teams to create reliable internal tooling and automation. The role also emphasizes driving reliability, operational excellence, and leading high-impact projects within revenue-critical workflows.
—EngineeringSan Francisco, CAFeb 270