Currently tracking 20 active AI roles, with 14 new openings in the last 4 weeks. Primary focus: Serve · Engineering. Salary range $160k–$300k (avg $226k).
Together AI currently has 24 active AI-related roles in our index. The most common open titles are: Solutions Architect (2), AI Infrastructure Engineer, AI Researcher, Core ML (Turbo), Backend Software Engineer — Data Platform & AI Data Products, Customer Support Engineer (Inference). Most positions are in Engineering and Research.
Together AI's active AI hiring is concentrated in: serving infrastructure (96%), post-training (4%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
Together AI is hiring AI talent in: United States (19 roles), Netherlands (2 roles), United Kingdom (1 role).
Job postings at Together AI most frequently reference: inference infra, model serving, fine tuning, llm observability, audio speech.
In the past 30 days, Together AI has posted 6 new AI-related roles.
Together AI currently has 22 active AI-related job listings. The majority of these roles, 95%, are focused on serving infrastructure, with one role in post-training. The company is primarily hiring for Engineering positions, with 20 roles available, and is seeking candidates in the United States and the Netherlands. Frequent technical tags include model serving, inference infrastructure, and fine-tuning, suggesting a focus on deployment and optimization of AI models. In the last 30 days, Together AI added 5 new AI roles, a 150% increase from the previous 30-day period.
| Title | Stage | AI score |
|---|---|---|
| Research Engineer, Core ML Research Engineer role focused on improving inference efficiency and unifying it with RL/post-training systems for production-grade AI APIs. The role involves end-to-end ownership of critical systems, translating frontier ideas into robust infrastructure, and shipping measurable improvements in latency, throughput, cost, and model quality at scale. | ServePost-train | 10 |
| Staff Machine Learning Engineer, Voice AI Staff ML Engineer focused on optimizing the model serving layer for voice AI applications, including speech-to-text and text-to-speech models, with a focus on latency, throughput, and GPU utilization using inference engines like TRT-LLM and SGLang. The role involves building evaluation frameworks, supporting model partners, and shaping the architecture for next-generation voice models. | Serve | 9 |
| Forward Deployed Engineer (Inference & Post-Training) Forward Deployed Engineer focused on optimizing inference engines and fine-tuning pipelines for production AI teams, acting as a technical partner to strategic customers. Responsibilities include inference engine optimization, performance tuning, post-training/fine-tuning (LoRA, SFT, DPO, RLHF, GRPO), customer alignment, onboarding, and providing product feedback. | ServePost-train | 9 |
| Senior Machine Learning Engineer, Voice AI Senior ML Engineer focused on optimizing the model serving layer for voice AI workloads, including speech-to-text and text-to-speech models. The role involves hands-on work with inference engines, GPU optimization, batching strategies, and ensuring new model architectures can be productionized efficiently. The goal is to achieve best-in-class latency and reliability for real-time voice applications. | Serve | 9 |
| Research Engineer, Frontier Speculative Decoding Research Engineer focused on translating internal model training research into production-ready deployments by fine-tuning general-purpose models into specialized tools. This involves designing novel speculative algorithms, data curation, hyperparameter tuning, and checkpoint evaluation, with a focus on accuracy-efficiency tradeoffs for generative AI models. | Post-trainServe | 9 |
| Systems Research Engineer, GPU Programming This role focuses on optimizing and developing GPU-accelerated kernels and algorithms for ML/AI applications, requiring expertise in GPU programming (CUDA, Triton) and performance profiling. The engineer will collaborate with modeling, hardware, and software teams to enhance AI system efficiency and co-design GPU architectures. | Serve | 9 |
| AI Researcher, Core ML (Turbo) AI Researcher focused on the intersection of efficient inference algorithms, architectures, engines, and post-training/RL systems for production-scale API services. The role involves advancing inference efficiency, unifying inference with RL/post-training, and owning critical systems. | ServePost-train | 9 |
| Staff Engineer, Distributed Storage and HPC & AI Infrastructure Staff Engineer focused on designing and delivering multi-petabyte storage systems optimized for AI training and inference workloads. Responsibilities include architecting high-performance parallel filesystems and object stores, building Kubernetes-native storage operators, optimizing data paths for high throughput, and implementing intelligent caching and data distribution strategies. The role requires deep expertise in distributed storage systems, Kubernetes, and programming in Go and Python. | Serve | 8 |
| Forward Deployed Engineer (GPU Clusters) The Forward Deployed Engineer (FDE) will be a technical partner to customers building large-scale AI models, focusing on GPU cluster infrastructure, networking, storage, and orchestration to ensure stability, optimize performance, and facilitate platform adoption. This role involves hardening clusters, tuning orchestration layers (Kubernetes/SLURM), debugging low-level bottlenecks, building reference designs, and leading benchmarking exercises. | Serve | 8 |
| Engineering Manager, Model Serving Engineering Manager for Together AI's Model Serving platform, focusing on delivering world-class inference and fine-tuning in public APIs and customer deployments. Responsibilities include owning SLAs, improving testing/deployment/monitoring, building self-serve tooling, defining configuration best practices for inference engines, leading incident response, and mentoring team members. Requires 5+ years operating production ML inference or training systems at scale and 2+ years in senior IC or tech lead roles, with deep expertise in Kubernetes, multi-cluster orchestration, and ML serving frameworks. | ServePost-train | 8 |
| LLM Inference Frameworks and Optimization Engineer Seeking an Inference Frameworks and Optimization Engineer to design, develop, and optimize distributed inference engines for multimodal and language models. Focus on low-latency, high-throughput inference, GPU/accelerator optimizations, and software-hardware co-design for efficient large-scale AI deployment. | Serve | 8 |
| Machine Learning Engineer Machine Learning Engineer at Together AI focused on developing and scaling production systems for LLM inference and fine-tuning APIs. Requires strong experience in high-performance, distributed systems and the LLM inference ecosystem. | ServePost-train | 8 |
| Machine Learning Engineer - Inference Machine Learning Engineer focused on optimizing and enhancing the performance of AI inference systems, working with state-of-the-art large language models to ensure efficient and effective operation at scale. Responsibilities include designing and building production systems, optimizing runtime inference services, and creating supporting tools and documentation. | Serve | 8 |
| Lead/Manager Together Cloud Infrastructure Lead/Manager for Together Cloud Infrastructure in Amsterdam, focusing on building and managing a team to develop and operate a global, high-performance cloud platform for AI workloads, including GPU scheduling, management plane, and customer-facing services. | Serve | 7 |
| Staff Platform Engineer, Voice AI Staff Platform Engineer for Together AI's Voice AI platform, focusing on the architecture and reliability of real-time API layers, autoscaling for latency-sensitive workloads, and building the observability platform for voice infrastructure. The role requires deep expertise in distributed systems, real-time streaming, and Kubernetes, with a strong product intuition for developer platforms. | Serve | 7 |
| AI Infrastructure Engineer AI Infrastructure Engineer responsible for keeping user-facing services and production systems running smoothly, specializing in systems, availability, reliability, and scalability, with interests in algorithms and distributed systems. Builds and runs infrastructure using Ansible, Terraform, and Kubernetes, and develops monitoring systems. | Serve | 7 |
| Senior Platform Engineer, Voice AI Senior Platform Engineer for Together AI's Voice AI platform, focusing on the API and infrastructure layer for real-time speech-to-text and text-to-speech models. The role involves building WebSocket and HTTP APIs, designing autoscaling for latency-sensitive streaming, and ensuring platform reliability for production voice agents. | Serve | 7 |
| Backend Engineer Senior Backend/Distributed Systems Engineer to build and maintain the Together AI Sandbox service, focusing on API platform performance, reliability, and scalability. Responsibilities include designing core backend components, performing research for AI workloads, and ensuring code quality through design and code reviews. | Serve | 7 |
| Together Cloud Infrastructure Engineer This role focuses on building and maintaining the AI cloud infrastructure, including services for hardware management, IaaS software layer for GPU data centers, high-performance object storage for pretraining, and advanced observability stacks. The engineer will work on the core Together AI platform, create services and tools, and develop testing frameworks for robustness and fault-tolerance. | ServeData | 7 |
| Staff Engineer, Distributed Storage,HPC & AI Infrastructure Staff Engineer focused on designing and delivering multi-petabyte distributed storage systems optimized for AI training and inference workloads. Responsibilities include architecting high-performance parallel filesystems and object stores, integrating cutting-edge technologies, driving cost optimization, and building Kubernetes-native storage operators and self-service platforms. The role requires deep expertise in distributed storage, Kubernetes, and performance optimization for GPU/HPC clusters, with strong coding skills in Go and Python. | Serve | 7 |
| Solutions Architect Solutions Architect at Together AI, a research-driven AI company focused on lowering the cost of AI systems. This role involves working with customers to build Generative AI applications using open-source models, acting as a technical advisor, running demos and POCs, and collaborating with sales. Requires strong technical background in AI/ML, GPU technologies, Python/JavaScript, and familiarity with infrastructure services. The role contributes to product feedback and educational content creation. | Serve | 7 |
| Senior Backend Engineer, Inference Platform Senior Backend Engineer focused on building and optimizing the inference platform for advanced generative AI models, including LLMs and multimodal models, at scale. The role involves optimizing latency, throughput, and resource allocation across tens of thousands of GPUs, collaborating with researchers to productionize frontier models, and contributing to open-source inference projects. | Serve | 7 |
| Machine Learning, Platform Engineer Machine Learning Platform Engineer at Together AI, focusing on building a container platform, optimizing autoscaling, minimizing cold starts, and improving end-to-end model performance for custom models and dedicated inference. The role involves optimizing inference across the stack, including CUDA kernels, PyTorch, inference engines, and container orchestration. | Serve | 7 |
| AI Infrastructure Engineer AI Infrastructure Engineer responsible for keeping user-facing services and production systems running smoothly, applying engineering principles and automation to operating environments. Focuses on systems, availability, reliability, and scalability, with interests in algorithms and distributed systems. Builds and runs infrastructure using Ansible, Terraform, and Kubernetes, and designs monitoring systems. | Serve | 7 |
| Senior Software Engineer - Together Cloud Infrastructure Senior Software Engineer focused on building and operating a high-performance, global AI cloud infrastructure platform. This includes designing and maintaining backend services for hardware management, IaaS software layer for GPU data centers, high-performance object storage for pretraining datasets, and advanced observability stacks for distributed pretraining. The role also involves architecture and research for decentralized AI workloads and contributing to the open-source platform. | ServeData | 7 |
| Solutions Architect Solutions Architect at Together AI to work with customers and prospects to create business value through Generative AI applications. This role involves acting as a technical advisor, running demonstrations and POCs, collaborating with sales, building relationships with customer leadership, delivering feedback to product/engineering/research, and building educational content. Requires 5+ years in a customer-facing technical role with 2+ years in pre-sales, strong technical background in AI/ML/GPU, understanding of LLM training/fine-tuning/inference, Python/JavaScript proficiency, and familiarity with infrastructure services. | Serve | 7 |
| Manager, Infrastructure Strategy & Operations This role focuses on the strategy, operations, and analytical backbone for scaling compute infrastructure at an AI-native cloud company. It involves research, benchmarking, and decision frameworks for sourcing, evaluating, and deploying compute, with a focus on market intelligence, site comparisons, and operational analysis. Responsibilities include building dashboards for visibility into costs and utilization, developing comparison frameworks for sourcing decisions, and evaluating data center sites and energy options. The role requires strong quantitative skills, experience in high-growth startups or AI companies, and familiarity with AI productivity tools. | — | 5 |
| Customer Support Engineer (Inference) Customer Support Engineer role focused on supporting customers with Together AI's inference and fine-tuning services, GPU clusters, and Gen AI solutions. The role involves resolving complex technical challenges, acting as a product expert, collaborating with engineering and product teams, and transforming customer insights into product roadmap improvements. Requires strong technical background in AI, ML, GPU technologies, HPC environments, and familiarity with infrastructure services and Python. | ServePost-train | 5 |
| Senior Technical Recruiter, AI/ML Research Senior Technical Recruiter for Together AI, a company building an AI Native Cloud. The role focuses on scaling world-class AI research and engineering teams by partnering with leadership, leading full-cycle recruiting for specialized AI talent, and providing market intelligence. | — | 5 |
| Engineering Manager, Site Reliability Engineering Engineering Manager for Site Reliability Engineering (SRE) to lead a team of ~10 engineers responsible for Together AI's production infrastructure, including bare-metal GPU compute, public-cloud Kubernetes for inference, and Kubernetes with virtualization for virtual clusters. The role involves a mix of management (50-60%) and hands-on technical work (40-50%), focusing on shifting the team from reactive, manual operations to systemic, automation-first work, improving incident response, and developing engineers. | — | 5 |
| Junior Technical Program Manager — Infrastructure Operations This role focuses on the operational management of a large GPU fleet, ensuring nodes are online, GPUs are performing, and datacenter transitions are smooth. It involves owning the end-to-end node lifecycle, driving remediation, managing project timelines for new datacenter bring-ups, diagnosing utilization loss, and building dashboards for visibility and accountability. The environment is fast-paced and requires figuring things out alongside engineers building at the frontier. | — | 5 |
| Staff Engineer, Customer Insights Staff Engineer to build and scale the customer-facing visibility layer for Together's AI Cloud, focusing on historical analytics, activity history, audit logs, event timelines, notifications, and investigation workflows. The role will evolve these foundations into AI-first investigation and insight workflows that summarize activity, explain anomalies, and provide trustworthy context for human operators and autonomous agents. This is a hands-on role designing event, query, delivery, and governance systems, and building user-facing workflows for enterprise customers. | — | 5 |
| Technical Account Manager (TAM), AI Factory This role is a Technical Account Manager focused on the infrastructure supporting large-scale AI GPU deployments for a strategic enterprise customer. The TAM will be the primary technical point of contact, responsible for the end-to-end technical relationship across compute, networking, storage, and facilities, ensuring smooth delivery and operational health. Responsibilities include issue lifecycle management, hardware lifecycle management, advising on infrastructure stack best practices, owning the observability strategy, coordinating operations, and managing capacity expansions. The role requires deep expertise in GPU infrastructure, large-scale networking, enterprise storage, and DC operations, with experience in customer-facing technical roles and AI/HPC infrastructure. | — | 5 |
| Director, Support Engineering This role leads and scales the customer support function for Together AI, focusing on both API support (serverless/dedicated inference, billing) and GPU support (large-scale training infrastructure). It's a player-coach position requiring hands-on involvement in complex escalations, managing support engineers, defining KPIs, and improving support workflows and tooling. The role requires strong technical depth in AI infrastructure, distributed systems, and experience with SLA-driven operations. | — | 5 |
| Customer Support Engineer (GPU Cluster) Customer Support Engineer role focused on supporting customers using Together AI's GPU clusters for training, fine-tuning, and inference. The role involves resolving complex technical challenges, acting as a product expert, and collaborating with Engineering and Product teams. Requires experience in customer-facing technical roles, familiarity with AI/ML, GPU technologies, and infrastructure services like Kubernetes. | — | 5 |
| Sr. Partnerships Manager, Model Ecosystem This role is responsible for building and managing the model ecosystem for Together AI, focusing on negotiating deals with model builders to bring proprietary and open-source models onto the platform. It involves working closely with Product, Finance, and Marketing to ensure the model roadmap is technically superior, commercially viable, and market-facing. The role requires strong deal-making, technical curiosity, and experience in business development or strategic partnerships within developer platforms. | — | 5 |
| Backend Software Engineer — Data Platform & AI Data Products Backend Software Engineer focused on building data platform infrastructure and LLM-adjacent data products. The role involves designing and developing backend services for event streams, access layers, and APIs, as well as creating services for prompt categorization, enrichment, and metadata. The engineer will apply AI augmentation mindset to their own development and the systems they build, with a focus on production backend systems, distributed systems, and data modeling. | Serve | 5 |
| Customer Support Engineer (Inference), India Customer Support Engineer role at Together AI, focusing on supporting customers with their training, fine-tuning, and inference solutions. The role involves deep technical problem-solving on GPU clusters and AI services, acting as a product expert and a liaison between customers and internal engineering/product teams. Requires strong technical background in AI, ML, and HPC, with experience in customer-facing technical support. | ServePost-train | 5 |
| Engineering Manager / Tech Lead Engineering Manager / Tech Lead for the Sandbox team, responsible for building and operating isolated, secure compute environments for AI code execution, including reinforcement learning workflows, LLM code interpreters, and AI agents. This role involves technical leadership, people management, hiring, and collaborating with product and other engineering teams. The team builds sandbox infrastructure, SDKs, platform integrations, and developer tooling. | — | 5 |
| Lead Product Designer Lead Product Designer to craft user experiences for technical AI development tools, shape AI development, and establish design standards for a growing organization. This role involves leading UX initiatives, elevating design quality, and collaborating with Engineering, Product, and Marketing. | — | 5 |
| Product Marketing Director Product Marketing Director at Together AI, a frontier AI cloud company. This role will own platform and product value propositions, GTM strategy, product launches, and messaging. The role involves leading and scaling the PMM function, partnering with Product Management, Sales, and Engineering. Requires 10+ years of PMM experience in enterprise software (preferably AI/Cloud) and 5+ years in team leadership. The company has seen significant growth and is research-driven, contributing to open-source AI advancements. | — | 5 |
| Customer Support Engineer (GPU Cluster), India Customer Support Engineer for GPU Clusters at Together AI, focusing on resolving technical challenges for customers building training, fine-tuning, and inference solutions. The role involves being a product expert, collaborating with engineering and product teams, and transforming customer insights into product improvements. Requires experience in customer-facing technical roles, AI/ML/GPU technologies, and infrastructure services like Kubernetes. | — | 5 |
| Senior Software Engineer - Together Cloud Platform Senior Backend Engineer role focused on building and scaling the AI Acceleration Cloud platform, which virtualizes ML hardware and provides self-serve AI cloud services for ML practitioners. Responsibilities include developing distributed GPU scheduling systems, global management planes, and customer-facing cloud platform services, ensuring high availability and performance. | — | 5 |
| AI infrastructure Engineer (SRE) Amsterdam AI infrastructure Engineer (SRE) responsible for keeping user-facing services and production systems running smoothly, specializing in systems, availability, reliability, and scalability. The role involves building and running infrastructure with Ansible, Terraform, and Kubernetes, implementing monitoring and observability, and debugging production issues. | — | 5 |
| Sr. Technical Program Manager (TPM) This role is for a Senior Technical Program Manager (TPM) at an AI infrastructure company. The TPM will focus on building, optimizing, and scaling global GPU resources, ensuring efficient and reliable operation of the AI model backbone. Responsibilities include product development for AI researchers and customers, owning the product roadmap, stakeholder engagement, and cross-functional execution across Research, Engineering, DevOps, SRE, and Go-to-Market teams. Requires 5+ years of experience in AI/ML product or infrastructure, with a technical background. | — | 5 |
| Strategic Finance Senior Manager Strategic Finance Senior Manager at Together AI, a research-driven AI infrastructure company. This role focuses on providing financial insights, driving strategic decision-making, and optimizing business performance, with a significant emphasis on guiding the optimization and scaling of the company's compute infrastructure. The position requires strong financial modeling, business judgment, and the ability to partner with various functions like Engineering, Product, and GTM. | — | 5 |
| Infrastructure Vendor Ops Manager This role focuses on managing vendor operations for GPU infrastructure, including SLA tracking, credit recovery, invoice auditing, and financial accountability. It requires strong attention to detail, technical fluency to understand incident reports for credit claims, and negotiation skills with providers. The role also involves process development and financial forecasting for infrastructure budgets. | — | 0 |
| Infrastructure Design Engineer This role focuses on the physical infrastructure design of data centers that house AI GPU clusters. Responsibilities include designing whitespace layouts, power distribution, cooling, and structured cabling to support high-density AI hardware. The role requires expertise in data center design, critical facilities engineering, and collaboration with various engineering and operational teams. | — | 0 |
| Sr. Revenue Accountant This role is for a Sr. Revenue Accountant responsible for end-to-end accounts receivable, invoicing, collections, and revenue recognition in compliance with ASC 606. The candidate will partner with various teams to ensure accurate revenue recording and compliance with US GAAP. Responsibilities include journal entries, reconciliations, AR process ownership, billing dispute resolution, and process improvement. Requirements include a CPA, 5+ years of accounting experience with a focus on revenue accounting/AR/O2C, and strong technical knowledge of US GAAP. | — | 0 |
| Infrastructure Accounting Manager This role focuses on building and managing accounting processes for AI infrastructure assets, including fixed assets, CIP, and leases. It requires strong technical accounting expertise and collaboration with engineering and operations teams to ensure accurate financial reporting and compliance. | — | 0 |