What AI roles is Together AI hiring for?

Together AI currently has 24 active AI-related roles in our index. The most common open titles are: Solutions Architect (2), AI Infrastructure Engineer, AI Researcher, Core ML (Turbo), Backend Software Engineer — Data Platform & AI Data Products, Customer Support Engineer (Inference). Most positions are in Engineering and Research.

What stage of AI development does Together AI focus on?

Together AI's active AI hiring is concentrated in: serving infrastructure (96%), post-training (4%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

Where is Together AI hiring AI talent?

Together AI is hiring AI talent in: United States (19 roles), Netherlands (2 roles), United Kingdom (1 role).

What technologies does Together AI's AI team work with?

Job postings at Together AI most frequently reference: inference infra, model serving, fine tuning, llm observability, audio speech.

How many AI roles has Together AI posted recently?

In the past 30 days, Together AI has posted 6 new AI-related roles.

Together AI — AI hiring signals

Together AI currently has 22 active AI-related job listings. The majority of these roles, 95%, are focused on serving infrastructure, with one role in post-training. The company is primarily hiring for Engineering positions, with 20 roles available, and is seeking candidates in the United States and the Netherlands. Frequent technical tags include model serving, inference infrastructure, and fine-tuning, suggesting a focus on deployment and optimization of AI models. In the last 30 days, Together AI added 5 new AI roles, a 150% increase from the previous 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Currently tracking 20 active AI roles, with 14 new openings in the last 4 weeks. Primary focus: Serve · Engineering. Salary range $160k–$300k (avg $226k).

Hiring

20 / 23

Momentum (4w)

·0 0%

14 opens last 4w · 14 prior 4w

Salary range · avg $226k

$160k–$300k

USD · disclosed roles only

Tracked since

Jan '24

last role 2w ago

Hiring velocityscroll left for older weeks

2 new roles

Jan 15

1 new role

Jun 3

2 new roles

Jan 13

1 new role

Feb 24

1 new role

Mar 24

1 new role

Apr 28

1 new role

May 12

3 new roles

Jun 2

1 new role

2 new roles

Aug 18

1 new role

Oct 13

1 new role

Nov 3

1 new role

Jan 5

3 new roles

1 new role

Feb 16

3 new roles

1 new role

Mar 2

7 new roles

3 new roles

7 new roles

Apr 6

1 new role

4 new roles

2 new roles

May 4

5 new roles

3 new roles

Jun 1

Jobs (70)

21 AI · 56 total active

Title	Stage	Function	Location	First seen	AI score
Research Engineer, Core ML Research Engineer role focused on improving inference efficiency and unifying it with RL/post-training systems for production-grade AI APIs. The role involves end-to-end ownership of critical systems, translating frontier ideas into robust infrastructure, and shipping measurable improvements in latency, throughput, cost, and model quality at scale.	ServePost-train	Research	San Francisco, CA	Feb 18	10
Staff Machine Learning Engineer, Voice AI Staff ML Engineer focused on optimizing the model serving layer for voice AI applications, including speech-to-text and text-to-speech models, with a focus on latency, throughput, and GPU utilization using inference engines like TRT-LLM and SGLang. The role involves building evaluation frameworks, supporting model partners, and shaping the architecture for next-generation voice models.	Serve	Engineering	San Francisco, CA	3w ago	9
Forward Deployed Engineer (Inference & Post-Training) Forward Deployed Engineer focused on optimizing inference engines and fine-tuning pipelines for production AI teams, acting as a technical partner to strategic customers. Responsibilities include inference engine optimization, performance tuning, post-training/fine-tuning (LoRA, SFT, DPO, RLHF, GRPO), customer alignment, onboarding, and providing product feedback.	ServePost-train	Engineering	San Francisco, CA	5w ago	9
Senior Machine Learning Engineer, Voice AI Senior ML Engineer focused on optimizing the model serving layer for voice AI workloads, including speech-to-text and text-to-speech models. The role involves hands-on work with inference engines, GPU optimization, batching strategies, and ensuring new model architectures can be productionized efficiently. The goal is to achieve best-in-class latency and reliability for real-time voice applications.	Serve	Engineering	San Francisco, CA	Mar 30	9
Research Engineer, Frontier Speculative Decoding Research Engineer focused on translating internal model training research into production-ready deployments by fine-tuning general-purpose models into specialized tools. This involves designing novel speculative algorithms, data curation, hyperparameter tuning, and checkpoint evaluation, with a focus on accuracy-efficiency tradeoffs for generative AI models.	Post-trainServe	Research	San Francisco, CA	Nov '25	9
Systems Research Engineer, GPU Programming This role focuses on optimizing and developing GPU-accelerated kernels and algorithms for ML/AI applications, requiring expertise in GPU programming (CUDA, Triton) and performance profiling. The engineer will collaborate with modeling, hardware, and software teams to enhance AI system efficiency and co-design GPU architectures.	Serve	Engineering	San Francisco, CA	Jan '24	9
AI Researcher, Core ML (Turbo) AI Researcher focused on the intersection of efficient inference algorithms, architectures, engines, and post-training/RL systems for production-scale API services. The role involves advancing inference efficiency, unifying inference with RL/post-training, and owning critical systems.	ServePost-train	Engineering	San Francisco, CA	Jan '24	9
Staff Engineer, Distributed Storage and HPC & AI Infrastructure Staff Engineer focused on designing and delivering multi-petabyte storage systems optimized for AI training and inference workloads. Responsibilities include architecting high-performance parallel filesystems and object stores, building Kubernetes-native storage operators, optimizing data paths for high throughput, and implementing intelligent caching and data distribution strategies. The role requires deep expertise in distributed storage systems, Kubernetes, and programming in Go and Python.	Serve	Engineering	San Francisco, CA	5d ago	8
Forward Deployed Engineer (GPU Clusters) The Forward Deployed Engineer (FDE) will be a technical partner to customers building large-scale AI models, focusing on GPU cluster infrastructure, networking, storage, and orchestration to ensure stability, optimize performance, and facilitate platform adoption. This role involves hardening clusters, tuning orchestration layers (Kubernetes/SLURM), debugging low-level bottlenecks, building reference designs, and leading benchmarking exercises.	Serve	Engineering	San Francisco, CA	6w ago	8
Engineering Manager, Model Serving Engineering Manager for Together AI's Model Serving platform, focusing on delivering world-class inference and fine-tuning in public APIs and customer deployments. Responsibilities include owning SLAs, improving testing/deployment/monitoring, building self-serve tooling, defining configuration best practices for inference engines, leading incident response, and mentoring team members. Requires 5+ years operating production ML inference or training systems at scale and 2+ years in senior IC or tech lead roles, with deep expertise in Kubernetes, multi-cluster orchestration, and ML serving frameworks.	ServePost-train	Engineering	San Francisco, CA	Mar 5	8
LLM Inference Frameworks and Optimization Engineer Seeking an Inference Frameworks and Optimization Engineer to design, develop, and optimize distributed inference engines for multimodal and language models. Focus on low-latency, high-throughput inference, GPU/accelerator optimizations, and software-hardware co-design for efficient large-scale AI deployment.	Serve	Engineering	Remote	Mar '25	8
Machine Learning Engineer Machine Learning Engineer at Together AI focused on developing and scaling production systems for LLM inference and fine-tuning APIs. Requires strong experience in high-performance, distributed systems and the LLM inference ecosystem.	ServePost-train	Engineering	San Francisco, CA	Jan '25	8
Machine Learning Engineer - Inference Machine Learning Engineer focused on optimizing and enhancing the performance of AI inference systems, working with state-of-the-art large language models to ensure efficient and effective operation at scale. Responsibilities include designing and building production systems, optimizing runtime inference services, and creating supporting tools and documentation.	Serve	Engineering	San Francisco, CA	Jun '24	8
Lead/Manager Together Cloud Infrastructure Lead/Manager for Together Cloud Infrastructure in Amsterdam, focusing on building and managing a team to develop and operate a global, high-performance cloud platform for AI workloads, including GPU scheduling, management plane, and customer-facing services.	Serve	Engineering	Amsterdam, Netherlands	1w ago	7
Staff Platform Engineer, Voice AI Staff Platform Engineer for Together AI's Voice AI platform, focusing on the architecture and reliability of real-time API layers, autoscaling for latency-sensitive workloads, and building the observability platform for voice infrastructure. The role requires deep expertise in distributed systems, real-time streaming, and Kubernetes, with a strong product intuition for developer platforms.	Serve	Engineering	San Francisco, CA	3w ago	7
AI Infrastructure Engineer AI Infrastructure Engineer responsible for keeping user-facing services and production systems running smoothly, specializing in systems, availability, reliability, and scalability, with interests in algorithms and distributed systems. Builds and runs infrastructure using Ansible, Terraform, and Kubernetes, and develops monitoring systems.	Serve	Engineering	San Francisco, CA	4w ago	7
Senior Platform Engineer, Voice AI Senior Platform Engineer for Together AI's Voice AI platform, focusing on the API and infrastructure layer for real-time speech-to-text and text-to-speech models. The role involves building WebSocket and HTTP APIs, designing autoscaling for latency-sensitive streaming, and ensuring platform reliability for production voice agents.	Serve	Engineering	San Francisco, CA	Mar 30	7
Backend Engineer Senior Backend/Distributed Systems Engineer to build and maintain the Together AI Sandbox service, focusing on API platform performance, reliability, and scalability. Responsibilities include designing core backend components, performing research for AI workloads, and ensuring code quality through design and code reviews.	Serve	Engineering	Amsterdam, Netherlands	Mar 10	7
Together Cloud Infrastructure Engineer This role focuses on building and maintaining the AI cloud infrastructure, including services for hardware management, IaaS software layer for GPU data centers, high-performance object storage for pretraining, and advanced observability stacks. The engineer will work on the core Together AI platform, create services and tools, and develop testing frameworks for robustness and fault-tolerance.	ServeData	Engineering	Amsterdam, Netherlands	Jan 20	7
Staff Engineer, Distributed Storage,HPC & AI Infrastructure Staff Engineer focused on designing and delivering multi-petabyte distributed storage systems optimized for AI training and inference workloads. Responsibilities include architecting high-performance parallel filesystems and object stores, integrating cutting-edge technologies, driving cost optimization, and building Kubernetes-native storage operators and self-service platforms. The role requires deep expertise in distributed storage, Kubernetes, and performance optimization for GPU/HPC clusters, with strong coding skills in Go and Python.	Serve	Engineering	Amsterdam, Netherlands	Jan 20	7
Solutions Architect Solutions Architect at Together AI, a research-driven AI company focused on lowering the cost of AI systems. This role involves working with customers to build Generative AI applications using open-source models, acting as a technical advisor, running demos and POCs, and collaborating with sales. Requires strong technical background in AI/ML, GPU technologies, Python/JavaScript, and familiarity with infrastructure services. The role contributes to product feedback and educational content creation.	Serve	Engineering	London, United Kingdom	Oct '25	7
Senior Backend Engineer, Inference Platform Senior Backend Engineer focused on building and optimizing the inference platform for advanced generative AI models, including LLMs and multimodal models, at scale. The role involves optimizing latency, throughput, and resource allocation across tens of thousands of GPUs, collaborating with researchers to productionize frontier models, and contributing to open-source inference projects.	Serve	Engineering	San Francisco, CA	Aug '25	7
Machine Learning, Platform Engineer Machine Learning Platform Engineer at Together AI, focusing on building a container platform, optimizing autoscaling, minimizing cold starts, and improving end-to-end model performance for custom models and dedicated inference. The role involves optimizing inference across the stack, including CUDA kernels, PyTorch, inference engines, and container orchestration.	Serve	Engineering	San Francisco, CA	Aug '25	7
AI Infrastructure Engineer AI Infrastructure Engineer responsible for keeping user-facing services and production systems running smoothly, applying engineering principles and automation to operating environments. Focuses on systems, availability, reliability, and scalability, with interests in algorithms and distributed systems. Builds and runs infrastructure using Ansible, Terraform, and Kubernetes, and designs monitoring systems.	Serve	Engineering	San Francisco, CA	Jun '25	7
Senior Software Engineer - Together Cloud Infrastructure Senior Software Engineer focused on building and operating a high-performance, global AI cloud infrastructure platform. This includes designing and maintaining backend services for hardware management, IaaS software layer for GPU data centers, high-performance object storage for pretraining datasets, and advanced observability stacks for distributed pretraining. The role also involves architecture and research for decentralized AI workloads and contributing to the open-source platform.	ServeData	Engineering	San Francisco, CA	Jun '25	7
Solutions Architect Solutions Architect at Together AI to work with customers and prospects to create business value through Generative AI applications. This role involves acting as a technical advisor, running demonstrations and POCs, collaborating with sales, building relationships with customer leadership, delivering feedback to product/engineering/research, and building educational content. Requires 5+ years in a customer-facing technical role with 2+ years in pre-sales, strong technical background in AI/ML/GPU, understanding of LLM training/fine-tuning/inference, Python/JavaScript proficiency, and familiarity with infrastructure services.	Serve	Engineering	San Francisco, CA	Jan '25	7
Manager, Infrastructure Strategy & Operations This role focuses on the strategy, operations, and analytical backbone for scaling compute infrastructure at an AI-native cloud company. It involves research, benchmarking, and decision frameworks for sourcing, evaluating, and deploying compute, with a focus on market intelligence, site comparisons, and operational analysis. Responsibilities include building dashboards for visibility into costs and utilization, developing comparison frameworks for sourcing decisions, and evaluating data center sites and energy options. The role requires strong quantitative skills, experience in high-growth startups or AI companies, and familiarity with AI productivity tools.	—	Engineering	San Francisco, CA	1w ago	5
Customer Support Engineer (Inference) Customer Support Engineer role focused on supporting customers with Together AI's inference and fine-tuning services, GPU clusters, and Gen AI solutions. The role involves resolving complex technical challenges, acting as a product expert, collaborating with engineering and product teams, and transforming customer insights into product roadmap improvements. Requires strong technical background in AI, ML, GPU technologies, HPC environments, and familiarity with infrastructure services and Python.	ServePost-train	Engineering	San Francisco, CA	2w ago	5
Senior Technical Recruiter, AI/ML Research Senior Technical Recruiter for Together AI, a company building an AI Native Cloud. The role focuses on scaling world-class AI research and engineering teams by partnering with leadership, leading full-cycle recruiting for specialized AI talent, and providing market intelligence.	—	Engineering	San Francisco, CA	2w ago	5
Engineering Manager, Site Reliability Engineering Engineering Manager for Site Reliability Engineering (SRE) to lead a team of ~10 engineers responsible for Together AI's production infrastructure, including bare-metal GPU compute, public-cloud Kubernetes for inference, and Kubernetes with virtualization for virtual clusters. The role involves a mix of management (50-60%) and hands-on technical work (40-50%), focusing on shifting the team from reactive, manual operations to systemic, automation-first work, improving incident response, and developing engineers.	—	Engineering	San Francisco, CA	2w ago	5
Junior Technical Program Manager — Infrastructure Operations This role focuses on the operational management of a large GPU fleet, ensuring nodes are online, GPUs are performing, and datacenter transitions are smooth. It involves owning the end-to-end node lifecycle, driving remediation, managing project timelines for new datacenter bring-ups, diagnosing utilization loss, and building dashboards for visibility and accountability. The environment is fast-paced and requires figuring things out alongside engineers building at the frontier.	—	Engineering	San Francisco, CA	3w ago	5
Staff Engineer, Customer Insights Staff Engineer to build and scale the customer-facing visibility layer for Together's AI Cloud, focusing on historical analytics, activity history, audit logs, event timelines, notifications, and investigation workflows. The role will evolve these foundations into AI-first investigation and insight workflows that summarize activity, explain anomalies, and provide trustworthy context for human operators and autonomous agents. This is a hands-on role designing event, query, delivery, and governance systems, and building user-facing workflows for enterprise customers.	—	Engineering	San Francisco, CA	5w ago	5
Technical Account Manager (TAM), AI Factory This role is a Technical Account Manager focused on the infrastructure supporting large-scale AI GPU deployments for a strategic enterprise customer. The TAM will be the primary technical point of contact, responsible for the end-to-end technical relationship across compute, networking, storage, and facilities, ensuring smooth delivery and operational health. Responsibilities include issue lifecycle management, hardware lifecycle management, advising on infrastructure stack best practices, owning the observability strategy, coordinating operations, and managing capacity expansions. The role requires deep expertise in GPU infrastructure, large-scale networking, enterprise storage, and DC operations, with experience in customer-facing technical roles and AI/HPC infrastructure.	—	Engineering	San Francisco, CA	6w ago	5
Director, Support Engineering This role leads and scales the customer support function for Together AI, focusing on both API support (serverless/dedicated inference, billing) and GPU support (large-scale training infrastructure). It's a player-coach position requiring hands-on involvement in complex escalations, managing support engineers, defining KPIs, and improving support workflows and tooling. The role requires strong technical depth in AI infrastructure, distributed systems, and experience with SLA-driven operations.	—	Engineering	San Francisco, CA	6w ago	5
Customer Support Engineer (GPU Cluster) Customer Support Engineer role focused on supporting customers using Together AI's GPU clusters for training, fine-tuning, and inference. The role involves resolving complex technical challenges, acting as a product expert, and collaborating with Engineering and Product teams. Requires experience in customer-facing technical roles, familiarity with AI/ML, GPU technologies, and infrastructure services like Kubernetes.	—	Engineering	San Francisco, CA	Apr 7	5
Sr. Partnerships Manager, Model Ecosystem This role is responsible for building and managing the model ecosystem for Together AI, focusing on negotiating deals with model builders to bring proprietary and open-source models onto the platform. It involves working closely with Product, Finance, and Marketing to ensure the model roadmap is technically superior, commercially viable, and market-facing. The role requires strong deal-making, technical curiosity, and experience in business development or strategic partnerships within developer platforms.	—	Product	San Francisco, CA	Apr 7	5
Backend Software Engineer — Data Platform & AI Data Products Backend Software Engineer focused on building data platform infrastructure and LLM-adjacent data products. The role involves designing and developing backend services for event streams, access layers, and APIs, as well as creating services for prompt categorization, enrichment, and metadata. The engineer will apply AI augmentation mindset to their own development and the systems they build, with a focus on production backend systems, distributed systems, and data modeling.	Serve	Engineering	San Francisco, CA	Mar 11	5
Customer Support Engineer (Inference), India Customer Support Engineer role at Together AI, focusing on supporting customers with their training, fine-tuning, and inference solutions. The role involves deep technical problem-solving on GPU clusters and AI services, acting as a product expert and a liaison between customers and internal engineering/product teams. Requires strong technical background in AI, ML, and HPC, with experience in customer-facing technical support.	ServePost-train	Engineering	Remote	Mar 10	5
Engineering Manager / Tech Lead Engineering Manager / Tech Lead for the Sandbox team, responsible for building and operating isolated, secure compute environments for AI code execution, including reinforcement learning workflows, LLM code interpreters, and AI agents. This role involves technical leadership, people management, hiring, and collaborating with product and other engineering teams. The team builds sandbox infrastructure, SDKs, platform integrations, and developer tooling.	—	Engineering	Amsterdam, Netherlands	Feb 27	5
Lead Product Designer Lead Product Designer to craft user experiences for technical AI development tools, shape AI development, and establish design standards for a growing organization. This role involves leading UX initiatives, elevating design quality, and collaborating with Engineering, Product, and Marketing.	—	Product	San Francisco, CA	Feb 26	5
Product Marketing Director Product Marketing Director at Together AI, a frontier AI cloud company. This role will own platform and product value propositions, GTM strategy, product launches, and messaging. The role involves leading and scaling the PMM function, partnering with Product Management, Sales, and Engineering. Requires 10+ years of PMM experience in enterprise software (preferably AI/Cloud) and 5+ years in team leadership. The company has seen significant growth and is research-driven, contributing to open-source AI advancements.	—	Product	San Francisco, CA	Nov '25	5
Customer Support Engineer (GPU Cluster), India Customer Support Engineer for GPU Clusters at Together AI, focusing on resolving technical challenges for customers building training, fine-tuning, and inference solutions. The role involves being a product expert, collaborating with engineering and product teams, and transforming customer insights into product improvements. Requires experience in customer-facing technical roles, AI/ML/GPU technologies, and infrastructure services like Kubernetes.	—	Engineering	Remote	Aug '25	5
Senior Software Engineer - Together Cloud Platform Senior Backend Engineer role focused on building and scaling the AI Acceleration Cloud platform, which virtualizes ML hardware and provides self-serve AI cloud services for ML practitioners. Responsibilities include developing distributed GPU scheduling systems, global management planes, and customer-facing cloud platform services, ensuring high availability and performance.	—	Engineering	San Francisco, CA	Jun '25	5
AI infrastructure Engineer (SRE) Amsterdam AI infrastructure Engineer (SRE) responsible for keeping user-facing services and production systems running smoothly, specializing in systems, availability, reliability, and scalability. The role involves building and running infrastructure with Ansible, Terraform, and Kubernetes, implementing monitoring and observability, and debugging production issues.	—	Engineering	EUROPE	Apr '25	5
Sr. Technical Program Manager (TPM) This role is for a Senior Technical Program Manager (TPM) at an AI infrastructure company. The TPM will focus on building, optimizing, and scaling global GPU resources, ensuring efficient and reliable operation of the AI model backbone. Responsibilities include product development for AI researchers and customers, owning the product roadmap, stakeholder engagement, and cross-functional execution across Research, Engineering, DevOps, SRE, and Go-to-Market teams. Requires 5+ years of experience in AI/ML product or infrastructure, with a technical background.	—	Product	San Francisco, CA	Feb '25	5
Strategic Finance Senior Manager Strategic Finance Senior Manager at Together AI, a research-driven AI infrastructure company. This role focuses on providing financial insights, driving strategic decision-making, and optimizing business performance, with a significant emphasis on guiding the optimization and scaling of the company's compute infrastructure. The position requires strong financial modeling, business judgment, and the ability to partner with various functions like Engineering, Product, and GTM.	—	Product	San Francisco, CA	Jan '25	5
Infrastructure Vendor Ops Manager This role focuses on managing vendor operations for GPU infrastructure, including SLA tracking, credit recovery, invoice auditing, and financial accountability. It requires strong attention to detail, technical fluency to understand incident reports for credit claims, and negotiation skills with providers. The role also involves process development and financial forecasting for infrastructure budgets.	—	Engineering	San Francisco, CA	4w ago	0
Infrastructure Design Engineer This role focuses on the physical infrastructure design of data centers that house AI GPU clusters. Responsibilities include designing whitespace layouts, power distribution, cooling, and structured cabling to support high-density AI hardware. The role requires expertise in data center design, critical facilities engineering, and collaboration with various engineering and operational teams.	—	Engineering	San Francisco, CA	4w ago	0
Sr. Revenue Accountant This role is for a Sr. Revenue Accountant responsible for end-to-end accounts receivable, invoicing, collections, and revenue recognition in compliance with ASC 606. The candidate will partner with various teams to ensure accurate revenue recording and compliance with US GAAP. Responsibilities include journal entries, reconciliations, AR process ownership, billing dispute resolution, and process improvement. Requirements include a CPA, 5+ years of accounting experience with a focus on revenue accounting/AR/O2C, and strong technical knowledge of US GAAP.	—	Product	San Francisco, CA	4w ago	0
Infrastructure Accounting Manager This role focuses on building and managing accounting processes for AI infrastructure assets, including fixed assets, CIP, and leases. It requires strong technical accounting expertise and collaboration with engineering and operations teams to ensure accurate financial reporting and compliance.	—	Product	San Francisco, CA	4w ago	0

Frequently asked questions

What AI roles is Together AI hiring for?
Together AI currently has 24 active AI-related roles in our index. The most common open titles are: Solutions Architect (2), AI Infrastructure Engineer, AI Researcher, Core ML (Turbo), Backend Software Engineer — Data Platform & AI Data Products, Customer Support Engineer (Inference). Most positions are in Engineering and Research.
What stage of AI development does Together AI focus on?
Together AI's active AI hiring is concentrated in: serving infrastructure (96%), post-training (4%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
Where is Together AI hiring AI talent?
Together AI is hiring AI talent in: United States (19 roles), Netherlands (2 roles), United Kingdom (1 role).
What technologies does Together AI's AI team work with?
Job postings at Together AI most frequently reference: inference infra, model serving, fine tuning, llm observability, audio speech.
How many AI roles has Together AI posted recently?
In the past 30 days, Together AI has posted 6 new AI-related roles.

Title

Stage

Function

Location

First seen

AI score

Research Engineer, Core ML

Research Engineer role focused on improving inference efficiency and unifying it with RL/post-training systems for production-grade AI APIs. The role involves end-to-end ownership of critical systems, translating frontier ideas into robust infrastructure, and shipping measurable improvements in latency, throughput, cost, and model quality at scale.

ServePost-train

Research

San Francisco, CA

Feb 18

Staff Machine Learning Engineer, Voice AI

Staff ML Engineer focused on optimizing the model serving layer for voice AI applications, including speech-to-text and text-to-speech models, with a focus on latency, throughput, and GPU utilization using inference engines like TRT-LLM and SGLang. The role involves building evaluation frameworks, supporting model partners, and shaping the architecture for next-generation voice models.

Serve

Engineering

San Francisco, CA

3w ago

Forward Deployed Engineer (Inference & Post-Training)

Forward Deployed Engineer focused on optimizing inference engines and fine-tuning pipelines for production AI teams, acting as a technical partner to strategic customers. Responsibilities include inference engine optimization, performance tuning, post-training/fine-tuning (LoRA, SFT, DPO, RLHF, GRPO), customer alignment, onboarding, and providing product feedback.

ServePost-train

Engineering

San Francisco, CA

5w ago

Senior Machine Learning Engineer, Voice AI

Senior ML Engineer focused on optimizing the model serving layer for voice AI workloads, including speech-to-text and text-to-speech models. The role involves hands-on work with inference engines, GPU optimization, batching strategies, and ensuring new model architectures can be productionized efficiently. The goal is to achieve best-in-class latency and reliability for real-time voice applications.

Serve

Engineering

San Francisco, CA

Mar 30

Research Engineer, Frontier Speculative Decoding

Research Engineer focused on translating internal model training research into production-ready deployments by fine-tuning general-purpose models into specialized tools. This involves designing novel speculative algorithms, data curation, hyperparameter tuning, and checkpoint evaluation, with a focus on accuracy-efficiency tradeoffs for generative AI models.

Post-trainServe

Research

San Francisco, CA

Nov '25

Systems Research Engineer, GPU Programming

This role focuses on optimizing and developing GPU-accelerated kernels and algorithms for ML/AI applications, requiring expertise in GPU programming (CUDA, Triton) and performance profiling. The engineer will collaborate with modeling, hardware, and software teams to enhance AI system efficiency and co-design GPU architectures.

Serve

Engineering

San Francisco, CA

Jan '24

AI Researcher, Core ML (Turbo)

AI Researcher focused on the intersection of efficient inference algorithms, architectures, engines, and post-training/RL systems for production-scale API services. The role involves advancing inference efficiency, unifying inference with RL/post-training, and owning critical systems.

ServePost-train

Engineering

San Francisco, CA

Jan '24

Staff Engineer, Distributed Storage and HPC & AI Infrastructure

Staff Engineer focused on designing and delivering multi-petabyte storage systems optimized for AI training and inference workloads. Responsibilities include architecting high-performance parallel filesystems and object stores, building Kubernetes-native storage operators, optimizing data paths for high throughput, and implementing intelligent caching and data distribution strategies. The role requires deep expertise in distributed storage systems, Kubernetes, and programming in Go and Python.

Serve

Engineering

San Francisco, CA

5d ago

Forward Deployed Engineer (GPU Clusters)

The Forward Deployed Engineer (FDE) will be a technical partner to customers building large-scale AI models, focusing on GPU cluster infrastructure, networking, storage, and orchestration to ensure stability, optimize performance, and facilitate platform adoption. This role involves hardening clusters, tuning orchestration layers (Kubernetes/SLURM), debugging low-level bottlenecks, building reference designs, and leading benchmarking exercises.

Serve

Engineering

San Francisco, CA

6w ago

Engineering Manager, Model Serving

Engineering Manager for Together AI's Model Serving platform, focusing on delivering world-class inference and fine-tuning in public APIs and customer deployments. Responsibilities include owning SLAs, improving testing/deployment/monitoring, building self-serve tooling, defining configuration best practices for inference engines, leading incident response, and mentoring team members. Requires 5+ years operating production ML inference or training systems at scale and 2+ years in senior IC or tech lead roles, with deep expertise in Kubernetes, multi-cluster orchestration, and ML serving frameworks.

ServePost-train

Engineering

San Francisco, CA

Mar 5

LLM Inference Frameworks and Optimization Engineer

Seeking an Inference Frameworks and Optimization Engineer to design, develop, and optimize distributed inference engines for multimodal and language models. Focus on low-latency, high-throughput inference, GPU/accelerator optimizations, and software-hardware co-design for efficient large-scale AI deployment.

Serve

Engineering

Remote

Mar '25

Machine Learning Engineer

Machine Learning Engineer at Together AI focused on developing and scaling production systems for LLM inference and fine-tuning APIs. Requires strong experience in high-performance, distributed systems and the LLM inference ecosystem.

ServePost-train

Engineering

San Francisco, CA

Jan '25

Machine Learning Engineer - Inference

Machine Learning Engineer focused on optimizing and enhancing the performance of AI inference systems, working with state-of-the-art large language models to ensure efficient and effective operation at scale. Responsibilities include designing and building production systems, optimizing runtime inference services, and creating supporting tools and documentation.

Serve

Engineering

San Francisco, CA

Jun '24

Lead/Manager Together Cloud Infrastructure

Lead/Manager for Together Cloud Infrastructure in Amsterdam, focusing on building and managing a team to develop and operate a global, high-performance cloud platform for AI workloads, including GPU scheduling, management plane, and customer-facing services.

Serve

Engineering

Amsterdam, Netherlands

1w ago

Staff Platform Engineer, Voice AI

Staff Platform Engineer for Together AI's Voice AI platform, focusing on the architecture and reliability of real-time API layers, autoscaling for latency-sensitive workloads, and building the observability platform for voice infrastructure. The role requires deep expertise in distributed systems, real-time streaming, and Kubernetes, with a strong product intuition for developer platforms.

Serve

Engineering

San Francisco, CA

3w ago

AI Infrastructure Engineer

AI Infrastructure Engineer responsible for keeping user-facing services and production systems running smoothly, specializing in systems, availability, reliability, and scalability, with interests in algorithms and distributed systems. Builds and runs infrastructure using Ansible, Terraform, and Kubernetes, and develops monitoring systems.

Serve

Engineering

San Francisco, CA

4w ago

Senior Platform Engineer, Voice AI

Senior Platform Engineer for Together AI's Voice AI platform, focusing on the API and infrastructure layer for real-time speech-to-text and text-to-speech models. The role involves building WebSocket and HTTP APIs, designing autoscaling for latency-sensitive streaming, and ensuring platform reliability for production voice agents.

Serve

Engineering

San Francisco, CA

Mar 30

Backend Engineer

Senior Backend/Distributed Systems Engineer to build and maintain the Together AI Sandbox service, focusing on API platform performance, reliability, and scalability. Responsibilities include designing core backend components, performing research for AI workloads, and ensuring code quality through design and code reviews.

Serve

Engineering

Amsterdam, Netherlands

Mar 10

Together Cloud Infrastructure Engineer

This role focuses on building and maintaining the AI cloud infrastructure, including services for hardware management, IaaS software layer for GPU data centers, high-performance object storage for pretraining, and advanced observability stacks. The engineer will work on the core Together AI platform, create services and tools, and develop testing frameworks for robustness and fault-tolerance.

ServeData

Engineering

Amsterdam, Netherlands

Jan 20

Staff Engineer, Distributed Storage,HPC & AI Infrastructure

Staff Engineer focused on designing and delivering multi-petabyte distributed storage systems optimized for AI training and inference workloads. Responsibilities include architecting high-performance parallel filesystems and object stores, integrating cutting-edge technologies, driving cost optimization, and building Kubernetes-native storage operators and self-service platforms. The role requires deep expertise in distributed storage, Kubernetes, and performance optimization for GPU/HPC clusters, with strong coding skills in Go and Python.

Serve

Engineering

Amsterdam, Netherlands

Jan 20

Solutions Architect

Solutions Architect at Together AI, a research-driven AI company focused on lowering the cost of AI systems. This role involves working with customers to build Generative AI applications using open-source models, acting as a technical advisor, running demos and POCs, and collaborating with sales. Requires strong technical background in AI/ML, GPU technologies, Python/JavaScript, and familiarity with infrastructure services. The role contributes to product feedback and educational content creation.

Serve

Engineering

London, United Kingdom

Oct '25

Senior Backend Engineer, Inference Platform

Senior Backend Engineer focused on building and optimizing the inference platform for advanced generative AI models, including LLMs and multimodal models, at scale. The role involves optimizing latency, throughput, and resource allocation across tens of thousands of GPUs, collaborating with researchers to productionize frontier models, and contributing to open-source inference projects.

Serve

Engineering

San Francisco, CA

Aug '25

Machine Learning, Platform Engineer

Machine Learning Platform Engineer at Together AI, focusing on building a container platform, optimizing autoscaling, minimizing cold starts, and improving end-to-end model performance for custom models and dedicated inference. The role involves optimizing inference across the stack, including CUDA kernels, PyTorch, inference engines, and container orchestration.

Serve

Engineering

San Francisco, CA

Aug '25

AI Infrastructure Engineer

AI Infrastructure Engineer responsible for keeping user-facing services and production systems running smoothly, applying engineering principles and automation to operating environments. Focuses on systems, availability, reliability, and scalability, with interests in algorithms and distributed systems. Builds and runs infrastructure using Ansible, Terraform, and Kubernetes, and designs monitoring systems.

Serve

Engineering

San Francisco, CA

Jun '25

Senior Software Engineer - Together Cloud Infrastructure

Senior Software Engineer focused on building and operating a high-performance, global AI cloud infrastructure platform. This includes designing and maintaining backend services for hardware management, IaaS software layer for GPU data centers, high-performance object storage for pretraining datasets, and advanced observability stacks for distributed pretraining. The role also involves architecture and research for decentralized AI workloads and contributing to the open-source platform.

ServeData

Engineering

San Francisco, CA

Jun '25

Solutions Architect

Solutions Architect at Together AI to work with customers and prospects to create business value through Generative AI applications. This role involves acting as a technical advisor, running demonstrations and POCs, collaborating with sales, building relationships with customer leadership, delivering feedback to product/engineering/research, and building educational content. Requires 5+ years in a customer-facing technical role with 2+ years in pre-sales, strong technical background in AI/ML/GPU, understanding of LLM training/fine-tuning/inference, Python/JavaScript proficiency, and familiarity with infrastructure services.

Serve

Engineering

San Francisco, CA

Jan '25

Manager, Infrastructure Strategy & Operations

This role focuses on the strategy, operations, and analytical backbone for scaling compute infrastructure at an AI-native cloud company. It involves research, benchmarking, and decision frameworks for sourcing, evaluating, and deploying compute, with a focus on market intelligence, site comparisons, and operational analysis. Responsibilities include building dashboards for visibility into costs and utilization, developing comparison frameworks for sourcing decisions, and evaluating data center sites and energy options. The role requires strong quantitative skills, experience in high-growth startups or AI companies, and familiarity with AI productivity tools.

—

Engineering

San Francisco, CA

1w ago

Customer Support Engineer (Inference)

Customer Support Engineer role focused on supporting customers with Together AI's inference and fine-tuning services, GPU clusters, and Gen AI solutions. The role involves resolving complex technical challenges, acting as a product expert, collaborating with engineering and product teams, and transforming customer insights into product roadmap improvements. Requires strong technical background in AI, ML, GPU technologies, HPC environments, and familiarity with infrastructure services and Python.

ServePost-train

Engineering

San Francisco, CA

2w ago

Senior Technical Recruiter, AI/ML Research

Senior Technical Recruiter for Together AI, a company building an AI Native Cloud. The role focuses on scaling world-class AI research and engineering teams by partnering with leadership, leading full-cycle recruiting for specialized AI talent, and providing market intelligence.

—

Engineering

San Francisco, CA

2w ago

Engineering Manager, Site Reliability Engineering

Engineering Manager for Site Reliability Engineering (SRE) to lead a team of ~10 engineers responsible for Together AI's production infrastructure, including bare-metal GPU compute, public-cloud Kubernetes for inference, and Kubernetes with virtualization for virtual clusters. The role involves a mix of management (50-60%) and hands-on technical work (40-50%), focusing on shifting the team from reactive, manual operations to systemic, automation-first work, improving incident response, and developing engineers.

—

Engineering

San Francisco, CA

2w ago

Junior Technical Program Manager — Infrastructure Operations

This role focuses on the operational management of a large GPU fleet, ensuring nodes are online, GPUs are performing, and datacenter transitions are smooth. It involves owning the end-to-end node lifecycle, driving remediation, managing project timelines for new datacenter bring-ups, diagnosing utilization loss, and building dashboards for visibility and accountability. The environment is fast-paced and requires figuring things out alongside engineers building at the frontier.

—

Engineering

San Francisco, CA

3w ago

Staff Engineer, Customer Insights

Staff Engineer to build and scale the customer-facing visibility layer for Together's AI Cloud, focusing on historical analytics, activity history, audit logs, event timelines, notifications, and investigation workflows. The role will evolve these foundations into AI-first investigation and insight workflows that summarize activity, explain anomalies, and provide trustworthy context for human operators and autonomous agents. This is a hands-on role designing event, query, delivery, and governance systems, and building user-facing workflows for enterprise customers.

—

Engineering

San Francisco, CA

5w ago

Technical Account Manager (TAM), AI Factory

This role is a Technical Account Manager focused on the infrastructure supporting large-scale AI GPU deployments for a strategic enterprise customer. The TAM will be the primary technical point of contact, responsible for the end-to-end technical relationship across compute, networking, storage, and facilities, ensuring smooth delivery and operational health. Responsibilities include issue lifecycle management, hardware lifecycle management, advising on infrastructure stack best practices, owning the observability strategy, coordinating operations, and managing capacity expansions. The role requires deep expertise in GPU infrastructure, large-scale networking, enterprise storage, and DC operations, with experience in customer-facing technical roles and AI/HPC infrastructure.

—

Engineering

San Francisco, CA

6w ago

Director, Support Engineering

This role leads and scales the customer support function for Together AI, focusing on both API support (serverless/dedicated inference, billing) and GPU support (large-scale training infrastructure). It's a player-coach position requiring hands-on involvement in complex escalations, managing support engineers, defining KPIs, and improving support workflows and tooling. The role requires strong technical depth in AI infrastructure, distributed systems, and experience with SLA-driven operations.

—

Engineering

San Francisco, CA

6w ago

Customer Support Engineer (GPU Cluster)

Customer Support Engineer role focused on supporting customers using Together AI's GPU clusters for training, fine-tuning, and inference. The role involves resolving complex technical challenges, acting as a product expert, and collaborating with Engineering and Product teams. Requires experience in customer-facing technical roles, familiarity with AI/ML, GPU technologies, and infrastructure services like Kubernetes.

—

Engineering

San Francisco, CA

Apr 7

Sr. Partnerships Manager, Model Ecosystem

This role is responsible for building and managing the model ecosystem for Together AI, focusing on negotiating deals with model builders to bring proprietary and open-source models onto the platform. It involves working closely with Product, Finance, and Marketing to ensure the model roadmap is technically superior, commercially viable, and market-facing. The role requires strong deal-making, technical curiosity, and experience in business development or strategic partnerships within developer platforms.

—

Product

San Francisco, CA

Apr 7

Backend Software Engineer — Data Platform & AI Data Products

Backend Software Engineer focused on building data platform infrastructure and LLM-adjacent data products. The role involves designing and developing backend services for event streams, access layers, and APIs, as well as creating services for prompt categorization, enrichment, and metadata. The engineer will apply AI augmentation mindset to their own development and the systems they build, with a focus on production backend systems, distributed systems, and data modeling.

Serve

Engineering

San Francisco, CA

Mar 11

Customer Support Engineer (Inference), India

Customer Support Engineer role at Together AI, focusing on supporting customers with their training, fine-tuning, and inference solutions. The role involves deep technical problem-solving on GPU clusters and AI services, acting as a product expert and a liaison between customers and internal engineering/product teams. Requires strong technical background in AI, ML, and HPC, with experience in customer-facing technical support.

ServePost-train

Engineering

Remote

Mar 10

Engineering Manager / Tech Lead

Engineering Manager / Tech Lead for the Sandbox team, responsible for building and operating isolated, secure compute environments for AI code execution, including reinforcement learning workflows, LLM code interpreters, and AI agents. This role involves technical leadership, people management, hiring, and collaborating with product and other engineering teams. The team builds sandbox infrastructure, SDKs, platform integrations, and developer tooling.

—

Engineering

Amsterdam, Netherlands

Feb 27

Lead Product Designer

Lead Product Designer to craft user experiences for technical AI development tools, shape AI development, and establish design standards for a growing organization. This role involves leading UX initiatives, elevating design quality, and collaborating with Engineering, Product, and Marketing.

—

Product

San Francisco, CA

Feb 26

Product Marketing Director

Product Marketing Director at Together AI, a frontier AI cloud company. This role will own platform and product value propositions, GTM strategy, product launches, and messaging. The role involves leading and scaling the PMM function, partnering with Product Management, Sales, and Engineering. Requires 10+ years of PMM experience in enterprise software (preferably AI/Cloud) and 5+ years in team leadership. The company has seen significant growth and is research-driven, contributing to open-source AI advancements.

—

Product

San Francisco, CA

Nov '25

Customer Support Engineer (GPU Cluster), India

Customer Support Engineer for GPU Clusters at Together AI, focusing on resolving technical challenges for customers building training, fine-tuning, and inference solutions. The role involves being a product expert, collaborating with engineering and product teams, and transforming customer insights into product improvements. Requires experience in customer-facing technical roles, AI/ML/GPU technologies, and infrastructure services like Kubernetes.

—

Engineering

Remote

Aug '25

Senior Software Engineer - Together Cloud Platform

Senior Backend Engineer role focused on building and scaling the AI Acceleration Cloud platform, which virtualizes ML hardware and provides self-serve AI cloud services for ML practitioners. Responsibilities include developing distributed GPU scheduling systems, global management planes, and customer-facing cloud platform services, ensuring high availability and performance.

—

Engineering

San Francisco, CA

Jun '25

AI infrastructure Engineer (SRE) Amsterdam

AI infrastructure Engineer (SRE) responsible for keeping user-facing services and production systems running smoothly, specializing in systems, availability, reliability, and scalability. The role involves building and running infrastructure with Ansible, Terraform, and Kubernetes, implementing monitoring and observability, and debugging production issues.

—

Engineering

EUROPE

Apr '25

Sr. Technical Program Manager (TPM)

This role is for a Senior Technical Program Manager (TPM) at an AI infrastructure company. The TPM will focus on building, optimizing, and scaling global GPU resources, ensuring efficient and reliable operation of the AI model backbone. Responsibilities include product development for AI researchers and customers, owning the product roadmap, stakeholder engagement, and cross-functional execution across Research, Engineering, DevOps, SRE, and Go-to-Market teams. Requires 5+ years of experience in AI/ML product or infrastructure, with a technical background.

—

Product

San Francisco, CA

Feb '25

Strategic Finance Senior Manager

Strategic Finance Senior Manager at Together AI, a research-driven AI infrastructure company. This role focuses on providing financial insights, driving strategic decision-making, and optimizing business performance, with a significant emphasis on guiding the optimization and scaling of the company's compute infrastructure. The position requires strong financial modeling, business judgment, and the ability to partner with various functions like Engineering, Product, and GTM.

—

Product

San Francisco, CA

Jan '25

Infrastructure Vendor Ops Manager

This role focuses on managing vendor operations for GPU infrastructure, including SLA tracking, credit recovery, invoice auditing, and financial accountability. It requires strong attention to detail, technical fluency to understand incident reports for credit claims, and negotiation skills with providers. The role also involves process development and financial forecasting for infrastructure budgets.

—

Engineering

San Francisco, CA

4w ago

Infrastructure Design Engineer

This role focuses on the physical infrastructure design of data centers that house AI GPU clusters. Responsibilities include designing whitespace layouts, power distribution, cooling, and structured cabling to support high-density AI hardware. The role requires expertise in data center design, critical facilities engineering, and collaboration with various engineering and operational teams.

—

Engineering

San Francisco, CA

4w ago

Sr. Revenue Accountant

This role is for a Sr. Revenue Accountant responsible for end-to-end accounts receivable, invoicing, collections, and revenue recognition in compliance with ASC 606. The candidate will partner with various teams to ensure accurate revenue recording and compliance with US GAAP. Responsibilities include journal entries, reconciliations, AR process ownership, billing dispute resolution, and process improvement. Requirements include a CPA, 5+ years of accounting experience with a focus on revenue accounting/AR/O2C, and strong technical knowledge of US GAAP.

—

Product

San Francisco, CA

4w ago

Infrastructure Accounting Manager

This role focuses on building and managing accounting processes for AI infrastructure assets, including fixed assets, CIP, and leases. It requires strong technical accounting expertise and collaboration with engineering and operations teams to ensure accurate financial reporting and compliance.

—

Product

San Francisco, CA

4w ago