Data AI · ML experiment tracking
Currently tracking 20 active AI roles, up 25% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $92k–$341k (avg $209k).
| Title | Stage | AI score |
|---|---|---|
| VP of Product, Research and Training Infrastructure VP of Product for Research and Training Infrastructure at an AI cloud provider. This role owns the product strategy and engineering execution for services powering AI research labs, focusing on specialized orchestration, evaluation, and iteration tools for massive-scale pre-training and post-training. Key responsibilities include evolving orchestration tools (SUNK), developing automated training-based evaluation frameworks, and building infrastructure for RL/RLHF pipelines. Requires deep knowledge of HPC, distributed training, and supporting frontier model research. | PretrainPost-train | 9 |
| Staff AI Security Engineer Staff AI Security Engineer to define and operationalize security across CoreWeave's AI ecosystem, focusing on secure-by-default foundations for AI development, agentic workflows, and enterprise AI adoption. The role involves building secure infrastructure, developing AI security policies, implementing guardrails for agentic systems, leading secure adoption of AI tools, and conducting adversarial testing. | AgentServe | 8 |
| AI Solutions Engineer, Pre-Sales- W&B AI Solutions Engineer focused on helping customers design, deploy, and scale ML and GenAI systems using Weights & Biases and CoreWeave's AI cloud. This role involves technical depth, customer engagement, architecting solutions for distributed training, RAG, agents, fine-tuning, and inference. | AgentServe | 8 |
| Principal Engineer - Perf and Benchmarking Principal Engineer role focused on leading the Benchmarking & Performance team at CoreWeave, a cloud provider for AI. The role involves defining strategy, leading end-to-end MLPerf submissions (Training & Inference), designing and implementing a Kubernetes-native benchmarking service for latency and throughput, and building CI/CD pipelines for scale. It requires deep expertise in distributed systems, GPU performance, model-serving stacks, and Kubernetes, with a focus on achieving industry-leading performance data and publications. | ServeEval Gate | 8 |
| Staff Software Engineer, Inference Staff Software Engineer on the Inference Platform Team at CoreWeave, focusing on building and operating a Kubernetes-native inference platform for AI workloads. The role involves technical leadership in architecture, performance optimization (latency, throughput, GPU utilization), and system reliability for low-latency, high-throughput systems at massive scale, with deep work in distributed systems and Kubernetes infrastructure. | Serve | 7 |
| Staff Technical Program Manager - Cluster Orchestration & Applied Training Staff Technical Program Manager to lead cross-functional programs for AI/ML Platform Services, focusing on Cluster Orchestration (scheduling, launching, managing AI workloads) and Applied Training (enabling researchers to use infrastructure for pre-training, fine-tuning, RL, evaluations). The role involves partnering with engineering, product, and research teams to improve workload execution and user interaction with training platforms, driving delivery across various AI training workflows and ensuring successful launches and operational ownership. | ServePost-train | 7 |
| Senior Software Engineer, Applied AI Senior Software Engineer to design and build production-grade, full-stack AI-native analytics platforms and first-party applications that embed governed data directly into operational workflows. The role involves developing AI-enabled user experiences, scalable backend services, and intuitive interfaces, integrating AI/LLM capabilities into real-world applications. | Ship | 7 |
| Principal Engineer, Cluster Orchestration CoreWeave is seeking a Principal Engineer to lead the design and evolution of their AI infrastructure's cluster orchestration systems, including Slurm, Kubernetes, and SUNK. This role involves defining long-term architecture, solving scaling problems, and ensuring the reliability and efficiency of GPU resource utilization for AI training and inference workloads. | Serve | 7 |
| Staff Product Manager, Insights Staff Product Manager for CoreWeave's Insights team, focusing on developing AI-powered observability experiences for AI workloads. The role involves defining strategy, roadmaps, and metrics for dashboards, alerts, and AI-driven insights to help customers understand performance, reliability, and cost in their cloud environments. Key responsibilities include translating telemetry into actionable insights and driving proactive surfacing of information, particularly for cost optimization and workload efficiency. | AgentEval Gate | 7 |
| Senior Software Engineer, Observability Insights Senior Software Engineer to lead development of agentic interfaces and product experiences for AI system observability, focusing on multi-tenant APIs, Grafana, and tool servers. Requires experience in backend systems, distributed APIs, reliability engineering, and agentic applications/LLM features. | AgentServe | 7 |
| Solutions Architect - HPC/AI/ML Solutions Architect role focused on supporting customers running AI/ML workloads on CoreWeave's HPC cloud infrastructure, with an emphasis on AI/ML inference. Responsibilities include technical customer contact, solution design, proof of concept development, and workload optimization. Requires expertise in cloud computing, distributed systems, AI/ML inference, NVIDIA GPUs, and Kubernetes. | Serve | 7 |
| Senior Software Engineer II, Applied Training Senior Software Engineer II, Applied Training at CoreWeave, focusing on building and scaling Kubernetes-native research cluster platforms and sandbox client infrastructure for agentic training and evaluation. The role aims to provide AI labs with advanced research infrastructure, enabling them to focus on model training rather than operations. Responsibilities include contributing to the roadmap, designing cluster experiences, owning SDKs for agent rollouts and benchmarks, writing documentation, and working closely with large AI labs. | ServeAgent | 7 |
| Staff Software Engineer, Applied Training CoreWeave is seeking a Staff Software Engineer to join their Applied Training team. This role will focus on building and improving their Kubernetes-native research cluster platform and sandbox client for agentic training and evaluation. The goal is to provide AI researchers with the infrastructure needed to train models efficiently, abstracting away operational complexities. Responsibilities include contributing to the roadmap, designing and building cluster experiences, owning the Python SDK for agentic workflows, and documenting training frameworks. The ideal candidate has extensive experience in distributed systems, ML infrastructure, or developer platforms, with strong Kubernetes expertise and familiarity with AI training and agentic workflows. | ServeAgent | 7 |
| Senior Software Engineer I, Inference CoreWeave is seeking a Senior Software Engineer to own and improve their Kubernetes-native inference platform, focusing on latency, throughput, and reliability. The role involves leading design, implementing optimizations, strengthening incident posture, and mentoring junior engineers. Requires experience with distributed systems, Kubernetes, and inference internals. | Serve | 7 |
| Sr. Software Engineer - Perf and Benchmarking Senior Software Engineer focused on performance and benchmarking of AI infrastructure, including Kubernetes-native services, MLPerf runs, and model-serving stacks. The role involves building and improving services to measure latency, throughput, and cost, and ensuring reproducible benchmarking processes. | ServeEval Gate | 7 |
| Senior Software Engineer (Full-Stack + Agentic AI) Senior Software Engineer role focused on developing AI agents and full-stack applications for internal enterprise systems. The role involves using frameworks like LangChain and LangGraph, building backend services, and integrating with various enterprise systems to automate tasks in Finance, Billing, and Supply Chain. | Agent | 7 |
| Software Engineer, Inference AI/ML Software Engineer focused on improving the latency, reliability, and cost of model serving on a GPU platform, working with services like Triton, vLLM, and TensorRT-LLM. | Serve | 7 |
| Senior Software Engineer II, Inference Senior Software Engineer II focused on owning and optimizing CoreWeave's Kubernetes-native inference platform to meet strict P99 SLAs at scale. Responsibilities include leading design reviews, implementing advanced optimizations for latency and throughput, strengthening incident posture, and mentoring junior engineers. Requires strong experience in distributed systems, Python/Go, networked systems performance, Kubernetes, and ML inference internals. | Serve | 7 |
| Solutions Architect - HPC/AI/ML Solutions Architect role focused on AI/ML inference workloads on high-performance compute (HPC) infrastructure, primarily using Kubernetes and NVIDIA GPUs. The role involves customer technical contact, solution design, proof of concept, workload optimization, and providing feedback to product teams. | Serve | 7 |
| Senior Systems Engineer, OS Automation Senior Systems Engineer focused on automating and scaling Linux OS and Kernel build pipelines, with a strong emphasis on integrating AI/ML technologies like LLMs, RAG, and predictive modeling to create AI-native infrastructure, smart CI/CD, auto-remediation, and predictive regression detection. | ServeAgent | 7 |
| Product Growth Strategist - AI & Engineering This role focuses on translating high-level AI ambitions into executable deployment programs for enterprise clients in the physical AI space. The Product Strategist will partner with sales and engineering teams to shape AI opportunities, engage stakeholders to define roadmaps, and convert client conversations into qualified sales opportunities while gathering product feedback. | — | 5 |
| Senior Security Engineer I, Advanced Response This role focuses on leading critical cybersecurity incidents, hunting adversaries, and building AI-powered tooling to enhance CoreWeave's defense capabilities at scale. The Senior Security Engineer will architect and build AI tools to accelerate threat detection and response, conduct deep technical investigations, and run a structured threat hunting program. | Agent | 5 |
| Product Manager, Finance Product Manager for Finance Systems at CoreWeave, focusing on shaping the Accounting and Finance technology landscape. The role owns the vision, roadmap, and execution for Finance systems, partnering with Finance stakeholders, IT Engineering, Data, and Security to deliver scalable, AI-enabled solutions. Key responsibilities include defining roadmaps, driving prioritization, leading discovery and design, and identifying opportunities to improve Finance processes through AI-enabled workflows and automation. The role also ensures auditability, observability, and SOX compliance. | — | 5 |
| Staff People Operations Partner Staff People Operations Partner role at CoreWeave, focusing on scaling core People Operations programs, managing employee lifecycle operations, leaves of absence, accommodations, and service delivery. The role involves driving and refining processes, ensuring consistent execution across a global organization, and mentoring a team. Requires deep expertise in People Operations, employee lifecycle management, LOA, accommodations, HRIS platforms, and service delivery models. Experience with AI-enabled tools for operational improvement is a plus. | — | 5 |
| Financial Analyst This role is for a Financial Analyst at CoreWeave, an AI infrastructure company. The analyst will focus on the financial coverage and support for the company's serverless product suite, including forecasting, developing cost-to-serve frameworks, and supporting deal evaluation. The role involves building and maintaining financial models, partnering with product, sales, and engineering teams, and leveraging AI tooling for automation. The position is high-visibility and requires strong analytical and modeling skills. | — | 5 |
| Software Engineer II, Developer Experience Software Engineer II, Developer Experience at CoreWeave, focusing on building and shipping features for the Developer Experience platform, including CI/CD, artifact management, and agentic AI tooling to accelerate developer workflows. | Agent | 5 |
| Senior Software Engineer, Developer Experience Senior Software Engineer focused on Developer Experience (DevEx) at CoreWeave, an AI cloud provider. The role involves designing and building platforms, services, and systems that enhance developer productivity, reliability, and the adoption of agentic AI tooling across the engineering organization. Key responsibilities include setting technical direction, architecting agentic developer platforms, and ensuring systems scale to serve thousands of instances. | Agent | 5 |
| Staff Software Engineer, Developer Experience Staff Software Engineer focused on Developer Experience, specifically owning the technical strategy and execution for the agentic developer platform. This role involves defining the architecture for CI, build infrastructure, test experience, delivery systems, and artifact management, with a strong emphasis on evolving agentic systems and autonomous workflows to improve developer productivity. | Agent | 5 |
| Manager, Technical Recruiting Manager, Technical Recruiting at CoreWeave, a cloud provider focused on AI. The role involves scaling engineering teams, including those focused on AI infrastructure, by leading a team of recruiters, developing hiring strategies, and managing full-funnel recruiting performance. Requires experience in technical recruiting for high-growth environments and leading recruiting teams. | — | 5 |
| Senior Manager, Observability Senior Manager, Observability Engineering to lead a team responsible for building, scaling, and operating observability systems across metrics, logs, traces, and telemetry pipelines. This role combines technical leadership, operational ownership, and team management to ensure observability platforms scale with business and customer needs, supporting AI infrastructure. | — | 5 |
| Senior Manager, Data Infrastructure Services This role is for a Senior Manager of Data Infrastructure Services at CoreWeave, a cloud provider focused on AI. The manager will lead a team responsible for data infrastructures including managed databases, data ingestion, data flow, data lakes, and analytics. The role involves building and developing the engineering team, designing and implementing data services, improving performance and reliability, establishing data access guidelines, ensuring compliance with data protection regulations, and analyzing system data for improvements. The ideal candidate has experience managing data service infra teams, expertise in open table formats (Iceberg, DeltaLake, Hudi), and knowledge of production operations, reliability engineering, and data governance. | — | 5 |
| Security Engineering Manager, Platform Security This role is for a Security Engineering Manager, Platform Security at CoreWeave, an AI hyperscaler. The individual will lead and scale the platform security engineering function, focusing on designing security into their Kubernetes-based platform and public cloud environments. Responsibilities include building and operating security controls, defining strategy for cloud security posture, workload isolation, platform guardrails, image integrity, and multi-cloud security. The role involves leading a team of platform security engineers and partnering with other engineering teams. The company emphasizes building and operating systems over writing policy. | — | 5 |
| Integration Engineer Enterprise Systems This role focuses on building and scaling internal systems for Finance and Supply Chain, delivering AI-first, highly automated, and deeply integrated solutions. The Integration Engineer will design and build scalable integrations using Workato/Boomi, develop APIs, and enable real-time data movement across business-critical platforms, with growing exposure to GenAI-enabled automation. | — | 5 |
| Product Manager, Data Center Product Manager for Data Center Technology at CoreWeave, focusing on defining and delivering a scalable technology ecosystem for Data Center Operations. This includes owning product strategy, roadmap, and execution for systems like DCIM, CMMS/EAM, BMS/SPoG, construction management, and workforce systems. The role involves translating business goals into solutions, making build/buy/extend decisions, and driving adoption of data, AI, and automation. The ideal candidate has 8+ years of experience in product management within data center or industrial environments, with a proven track record in building and scaling data- and AI-driven products. | — | 5 |
| Sr. Engineer, Storage CoreWeave is seeking a Sr. Engineer, Storage to design and implement distributed storage solutions for AI workloads. This role involves working with exabyte-scale object storage, distributed filesystems, and optimizing performance and reliability using technologies like RDMA, GPU Direct Storage, NFS, and FUSE. The engineer will also lead efforts in reliability, security, observability, and collaborate with various teams to ensure seamless storage capabilities. Experience with AI tools for software development and storage observability tools is required. | — | 5 |
| Staff Product Manager, Data Services Staff Product Manager for Data Services at CoreWeave, focusing on the strategy, roadmap, and execution of the data platform powering their AI cloud. The role involves defining how data flows across the platform, managing databases, data lakes, streaming, metadata, and governance, with a focus on scale, throughput, and performance for GPU-intensive AI workloads. This includes customer engagement, cross-functional alignment, and market analysis. | — | 5 |
| Senior Site Reliability Engineer, Data Infrastructure This role focuses on the reliability, scalability, and security of a data platform that supports internal AI workloads. The Senior Site Reliability Engineer will own the reliability and performance of a Kubernetes-based data platform, designing and operating highly available, multi-region systems, and ensuring services meet strict uptime and latency targets. Responsibilities include scaling infrastructure, improving deployment pipelines, hardening security, and evolving DevSecOps practices. | — | 5 |
| Senior Software Engineer - Data Lake & BI The role is for a Senior Software Engineer focused on building and evolving a planet-scale performance data warehouse for an AI cloud provider. The engineer will own the architecture for ingesting, storing, transforming, and surfacing performance data, turning raw events into actionable insights for engineering and business decisions. Key responsibilities include data lake architecture, schema design, time-series metrics infrastructure, BI/visualization, and query optimization. The role requires strong experience in distributed systems, data platforms, Python/Go, Kubernetes, data lake architectures, columnar databases, time-series databases, and BI tools. Experience with MLPerf or benchmarking GPU fleets is preferred. | — | 5 |
| Senior Software Engineer, Cluster Orchestration CoreWeave is seeking a Senior Software Engineer to join their Cluster Orchestration team. This role will focus on advancing CoreWeave's orchestration platform, including SUNK (Slurm on Kubernetes) and their Kubernetes-native foundation, which powers AI training and inference at scale. The engineer will be responsible for ensuring workloads run seamlessly, reliably, and efficiently across massive GPU clusters, eliminating infrastructure bottlenecks and creating new orchestration capabilities to empower customers. The role involves owning multiple services, leading design/code reviews, decomposing projects, driving improvements in reliability and performance, defining SLIs/SLOs, strengthening operational practices, and mentoring junior engineers. | Serve | 5 |
| Senior Developer Relations Engineer - Marimo Senior Developer Relations Engineer for marimo, a next-gen Python notebook environment. Focuses on community growth, developer engagement, and bridging technical feedback between users and the engineering team. Requires software engineering, developer relations, and AI/ML/data science technical background. | — | 5 |
| Enterprise GTM Leader This role focuses on defining and executing the technical go-to-market strategy for enterprise customers adopting CoreWeave's GPU infrastructure for AI workloads. It involves guiding complex AI infrastructure deployments from proof-of-concept to production, shaping deal strategy, building repeatable deployment frameworks, and translating enterprise requirements into product and platform innovation. The role requires deep expertise in enterprise cloud infrastructure, AI/ML platforms, and GPU environments, with a focus on scaling AI deployments. | Serve | 5 |
| Staff Engineer, Storage Engine CoreWeave is seeking a Staff Engineer for their Storage Engine Team to design and implement distributed storage solutions for AI workloads. Responsibilities include developing exabyte-scale S3-compatible object storage, integrating dedicated storage clusters, optimizing performance using technologies like RDMA and GPU Direct Storage, and improving reliability, durability, security, and observability of the storage stack. The role involves analyzing telemetry data, collaborating with cross-functional teams, and mentoring engineers. Requires 8-10+ years of experience in storage systems engineering, proficiency in systems programming languages (Go, C, Rust), and familiarity with storage protocols (S3, NFS) and systems (Ceph, DAOS). | — | 5 |
| Senior Marketing Performance Analyst This role focuses on building and expanding a cohesive metrics framework to track the buyer's journey and optimize marketing spend. It involves defining KPIs, attribution models, forecasting, analyzing campaign performance, and ensuring data capture. The role also emphasizes driving the adoption of AI-driven insights for marketing decision-making. | — | 5 |
| Staff Security Engineer, SOAR This role focuses on designing, deploying, and maintaining Security Orchestration, Automation, and Response (SOAR) capabilities, developing automations that interact across various products and services, and enriching decision-making through integrations. The role will leverage AI tooling to provide context for security events and alerts, and requires writing production-quality code for internal security services. The ideal candidate is a detection engineer or incident responder with strong development skills in Python or GoLang, experience with Kubernetes and Git, and familiarity with SOAR platforms. | — | 5 |
| Director, Developer Relations Director of Developer Relations to build and grow a technical community around CoreWeave's SaaS offerings, including tools from Weights & Biases, OpenPipe, and Marimo. The role involves acting as a bridge between developers and internal teams, translating product capabilities into technical narratives, and channeling community feedback. The goal is to help AI researchers, developers, and builders create models and self-improving AI agents. | — | 5 |
| Staff Software Engineer, Cluster Orchestration Staff Software Engineer role focused on advancing CoreWeave's orchestration platform (SUNK, Kubernetes) for AI training and inference at scale. Responsibilities include technical leadership, architectural direction, and ensuring seamless, reliable, and efficient workload execution on massive GPU clusters. | — | 5 |
| Product Strategy Principal This role focuses on developing a portfolio-level product strategy for an AI cloud platform company. The Product Strategy Principal will conduct market research, translate insights into strategy and roadmap, and collaborate with product and engineering teams to align different product areas. The role requires a strong understanding of the AI/ML landscape and customer needs to ensure the company remains at the forefront of the AI Hyperscaler market. | — | 5 |
| Senior Product Marketing Manager, SUNK Product Marketing Manager for CoreWeave's SUNK (Slurm on Kubernetes) offering, focusing on positioning and bringing to market a research and training cluster for demanding AI workloads. Responsibilities include defining messaging, creating launch narratives, enabling sales, and demonstrating marketing impact on product adoption and deal velocity. | — | 5 |
| Senior Security Engineer, SOAR Senior Security Engineer role focused on designing, deploying, and maintaining Security Orchestration, Automation, and Response (SOAR) capabilities, developing automations that interact across multiple products and services, and leveraging AI tooling to provide context for security events. The role involves writing production-quality code for internal security services. | Agent | 5 |
| AI Solutions Engineer, Post Sales Scale - W&B Weights & Biases is seeking a Post-Sales AI Solutions Engineer to help customers implement and scale AI/ML workflows and GenAI/agentic applications. This role involves designing and delivering technical enablement and adoption programs, creating reusable assets, and using product signals to improve customer outcomes. The ideal candidate has Python proficiency, experience with deep learning frameworks, and familiarity with cloud platforms. | Ship | 5 |