Currently tracking 20 active AI roles, up 25% versus the prior 4 weeks. Primary focus: Serve · Engineering. Salary range $92k–$341k (avg $209k).
Data AI · ML experiment tracking
| Title | Stage | AI score |
|---|---|---|
| IT Operations Specialist IT Operations Specialist role focused on supporting and scaling internal IT environments, including identity management, endpoints, SaaS platforms, and office infrastructure. Requires experience in IT support, systems administration, and automation scripting. The role emphasizes reliability, automation, and operational maturity within a fast-moving, high-growth environment. | — | 0 |
| Senior Manager, Site Selection Manager The Site Selection Manager will identify, evaluate, and advance data center site opportunities across the Americas, partnering with internal teams and external stakeholders to ensure deals progress efficiently from market screening through lease execution and delivery readiness. This role requires extensive experience in data center site selection, due diligence, and commercial negotiations, with a strong understanding of utility infrastructure and regulatory frameworks. | — | 0 |
| Security Engineering Manager, Enterprise Security |
| — |
| 0 |
| Reliability Lead, Common Services CoreWeave is seeking a Reliability Lead for their Common Services organization. This role will establish and lead the Reliability Engineering and production operations practice for shared platforms, APIs, and foundational services that power their AI cloud products. Responsibilities include defining reliability strategy, processes, and standards, managing incidents, driving observability, designing for reliability, and automating operational workflows. The role requires strong experience in SRE, distributed systems, Linux, observability stacks, incident response, and infrastructure-as-code. | — | 0 |
| Data Center Energy Analyst This role focuses on energy data management, analysis, and forecasting for a data center portfolio, supporting operational and financial decision-making. It involves collecting, validating, and analyzing energy usage and cost data, partnering with external energy providers, and supporting financial processes. The role requires strong data analysis skills and experience with energy data or utility data. | — | 0 |
| Manager, Bare Metal Support Engineering Manager of Bare Metal Support Engineering role at CoreWeave, focusing on leading a team to maintain and optimize physical infrastructure (servers, GPUs, power, cooling) for AI workloads. Responsibilities include daily support operations, incident triage, escalation management, process improvement, and client communication, ensuring the stability and performance of the cloud platform for AI clients. | — | 0 |
| Principal Engineer, Storage Principal Engineer role focused on designing, building, and operating the data plane for a high-performance AI storage platform. The role involves developing scalable, high-throughput storage solutions, optimizing performance and reliability, and collaborating with infrastructure and platform teams. Experience with object storage, distributed file systems, and systems programming languages is required. | — | 0 |
| HPC Engineer This role focuses on supporting large-scale data center deployments of NVLink systems, involving hardware and software lifecycle management, building automation tooling, and troubleshooting complex network and server issues. It requires strong networking fundamentals, Linux administration, and proficiency in a scripting language like Python or Go. | — | 0 |
| Firmware Engineering Manager CoreWeave is seeking a Firmware Engineering Manager to lead a new team focused on developing and maintaining BMC and BIOS firmware for their server infrastructure. The role involves people leadership, technical guidance, execution, and cross-functional collaboration to ensure reliable and scalable firmware for data center platforms. | — | 0 |
| Infrastructure Operations Program Manager This role is for an Infrastructure Operations Program Manager at CoreWeave, a cloud provider focused on AI workloads. The role involves operationalizing and scaling the company's bare-metal support program, focusing on data analysis and reporting to improve client experience and operational processes. Responsibilities include owning data processes and tooling, driving operational insights through data, partnering cross-functionally, building data models and reporting layers, transforming ad hoc processes into standardized workflows, developing key metrics and dashboards, and enabling a metrics-driven operating model. The ideal candidate has program or operations management experience in cloud infrastructure or datacenter operations, with strong data analysis and reporting skills, and familiarity with support or service delivery environments and cloud computing concepts. | — | 0 |
| Senior Manager, Controllership Data Governance & Systems This role focuses on the design, governance, and optimization of a company's global financial systems and reporting architecture, specifically within the ERP and accounting systems. It involves managing master data like chart of accounts, ensuring scalability, integrity, and compliance with global accounting policies and reporting requirements. The role also drives enterprise-wide ERP transformation initiatives, focusing on corporate accounting, financial reporting, internal controls, and compliance, translating accounting policies into robust ERP system designs. | — | 0 |
| Staff Production Engineer This role focuses on building and operating foundational platforms and frameworks for a cloud infrastructure provider, emphasizing reliability, observability, and scalability. The engineer will design, build, and own systems that reduce operational toil, improve delivery velocity, and enhance availability and resiliency. Key responsibilities include developing automation, self-service capabilities, and paved paths for operational excellence, as well as participating in incident response and shipping production code. The role requires deep expertise in distributed systems, cloud-native platforms (especially Kubernetes), and observability practices. | — | 0 |
| Master Scheduler CoreWeave is seeking a Master Scheduler to manage raw material deliveries for data center projects. This role involves analyzing project plans, forecasting material needs, coordinating cross-functionally, tracking deliveries, and resolving schedule conflicts to ensure on-time and in-full material arrival. The goal is to maintain build velocity, cost control, and execution excellence in a fast-paced environment. | — | 0 |
| Senior Engineering Manager, Data Engineering Seeking a Senior Engineering Manager to lead the data engineering function, responsible for building and maintaining a petabyte-scale enterprise data lake. The role involves technical leadership, people management, and partnering with stakeholders to translate strategic priorities into scalable data solutions. | — | 0 |
| Senior Supply Chain Compliance Analyst (SOX) This role is for a Senior Supply Chain Compliance Analyst (SOX) at CoreWeave, a cloud provider for AI. The analyst will be responsible for all supply chain related SOX and internal control activities, mapping operational processes to financial controls, and ensuring the supply chain organization is audit-ready. This involves owning SOX controls, partnering with Finance SOX and Internal Audit, coordinating audits, developing policies, monitoring control metrics, and creating training materials. The role requires experience in SOX 404, ERP/procurement platforms, process/control documentation, data analysis, and supporting audits. Experience in capital-intensive environments and public company SOX programs is preferred. | — | 0 |
| Technical Project Manager - Afton Technical Project Manager for data center deployments, focusing on infrastructure (power, cooling, networking, servers) and project lifecycle management. Requires on-site presence and experience with high-performance compute/GPU technologies is a plus. | — | 0 |
| Operations Engineer, Fleet Reliability CoreWeave is seeking an Operations Engineer for Fleet Reliability to manage and maintain their GPU supercomputing clusters. Responsibilities include provisioning, troubleshooting hardware/software issues, monitoring system performance, and creating documentation. Requires strong Linux system administration and scripting skills, with preferred experience in data center infrastructure, observability platforms, and HPC. | — | 0 |
| Data Center Manager - Muskogee This role is for a Data Center Manager responsible for the operational excellence of a facility, leading technicians in hardware diagnostics, repairs, and installations, and ensuring the stability, security, and scalability of physical assets. It involves hands-on leadership, troubleshooting, and coordinating with cross-functional teams to maintain critical infrastructure. | — | 0 |
| Technical Program Manager, IaaS The role is for a Technical Program Manager (TPM) focused on CoreWeave's Infrastructure as a Service (IaaS) for their CPU Compute platform, which complements GPU acceleration in AI clusters. The TPM will lead cross-functional programs to convert product strategy into scalable, reliable, and observable infrastructure, focusing on performance and scalability initiatives for high-throughput CPU clusters in AI environments. This involves managing programs, defining scope, partnering with engineering and product teams, and driving process improvements for complex, high-throughput platforms. While the role operates within an AI-focused company and supports AI infrastructure, the core responsibilities are in program management for cloud infrastructure, not direct AI/ML model development or research. | — | 0 |
| Staff Security Engineer, Network Security Staff Network Security Engineer responsible for architecting the defense of global backbone, edge, and massive-scale GPU clusters. Focuses on engineering security into the network fabric through automation, telemetry, and protocol analysis, rather than just configuring firewalls. Involves developing automation frameworks for network security, integrating security into CI/CD pipelines, and providing security recommendations for network architecture. | — | 0 |
| Hardware Engineer - Liquid Cooling This role focuses on the design, development, and optimization of liquid cooling hardware infrastructure for AI data centers. Responsibilities include automating the hardware lifecycle, defining requirements, optimizing performance, and implementing monitoring systems for cooling components. The role requires experience with server hardware, data center thermal design, and automation tools like Ansible/Python. | — | 0 |
| Firmware Engineer, SPX Firmware Engineer role focused on BMC firmware development for AI server platforms, involving C programming, integration, debugging, and optimization within a large-scale data center environment. | — | 0 |
| Staff Engineer, Storage Engine Staff Engineer for the Storage Engine Team at CoreWeave, focusing on designing and implementing scalable, high-performance distributed storage solutions for AI workloads. Responsibilities include optimizing storage performance, ensuring reliability and security, and developing observability tools for exabyte-scale storage systems. | — | 0 |
| Senior Engineer, Network Observability Senior Engineer for Network Observability to design, develop, and maintain monitoring, telemetry, and observability systems for a GPU cloud network. Focus on building solutions for real-time insights into network performance, proactive issue detection, and rapid resolution. Responsibilities include developing observability platforms using Python and Golang, ingesting and unifying logs, metrics, and events, designing scalable telemetry solutions, and collaborating with network engineering, SRE, and security teams. | — | 0 |
| Production Engineer Production Engineer role focused on maintaining the reliability and stability of CoreWeave's cloud infrastructure, involving incident response, operational support, and process improvements. Requires experience in cloud operations, SRE, or related technical roles, with familiarity in monitoring tools and scripting. | — | 0 |
| Senior Engineer, Storage Control Plane Senior Engineer, Storage Control Plane at CoreWeave, focusing on designing, building, and operating a high-performance AI storage platform. The role involves developing scalable, multi-tenant control planes and optimizing storage systems for AI workloads, collaborating with infrastructure and platform teams. | — | 0 |
| Senior Platform Engineer II, Compute Services Senior Platform Engineer II, Compute Services role at CoreWeave, focusing on administering and championing reliability for multi-tenant Kubernetes platforms. Responsibilities include lifecycle management, day 2 operations, and deep dives into reliability issues. Requires 5+ years of Kubernetes administration, Gitops/Devops experience, and proficiency in Go. | — | 0 |
| Senior Electrical Engineer Senior Electrical Engineer role focused on designing and developing servers and AI/ML hardware, including board design, signal integrity, and power design, from concept to mass production. Requires experience with high-speed interfaces and interdisciplinary collaboration. | — | 0 |
| Systems Engineer, Kernel CoreWeave is seeking a Systems Kernel Engineer to join their HAVOCK Team. The role focuses on maintaining and improving the stability, performance, and evolution of CoreWeave’s Linux-based infrastructure, with responsibilities including debugging kernel-level issues, analyzing crashes, and upstreaming fixes and features. The ideal candidate will have deep experience in low-level systems engineering and understand how modern workloads stress kernels, working across CPUs, GPUs, DPUs, networking, and storage. | — | 0 |
| Staff Engineer, Data Services Staff Engineer specializing in database and stream processing to manage and develop CoreWeave's data infrastructure, including managed databases, data ingestion, data flow, and data lakes. The role involves driving technical decisions, championing event-driven architecture, designing and implementing data platforms, developing stream processing architecture, and ensuring scalability, reliability, and security. The engineer will also establish guidelines for data access and storage, ensure compliance with data protection regulations, and contribute to the company's global datastore strategy. | — | 0 |
| Data Center Manager - Ellendale, ND This role is for a Data Center Manager responsible for the operational excellence of a facility, including leading a team of technicians, hardware diagnostics, physical repairs, new equipment installations, and ensuring the stability, security, and scalability of physical assets. It requires experience in data center operations, leadership, and infrastructure management. | — | 0 |
| Senior Business Systems Engineer – Supply Chain Systems This role is for a Senior Business Systems Engineer focused on supply chain systems, specifically NetSuite ERP customizations, integrations, and optimizations. It involves partnering with cross-functional teams to translate business requirements into scalable solutions for procurement, inventory management, manufacturing, and fulfillment. Experience with SOX compliance is required, and experience applying AI to supply chain use cases is preferred. | — | 0 |
| Data Center Technician - Ellendale, ND This role is for a Data Center Technician responsible for maintaining data center operations, performing hardware and network diagnostics and repairs, and providing technical support. It involves physical repair, script development for hardware updates, and potentially on-call rotation. The role requires hands-on experience with computer hardware, networking, and scripting languages, as well as experience in a data center environment. | — | 0 |
| Software Engineer, Kubernetes Software Engineer role focused on building, operating, and scaling Kubernetes-based production infrastructure. Responsibilities include developing automation, implementing monitoring and observability solutions, driving incident response, and engineering for resiliency. The role emphasizes experience with Kubernetes administration, container orchestration, and infrastructure-focused programming. | — | 0 |
| Staff Software Engineer, Observability Staff Software Engineer focused on building and maintaining scalable observability systems (logging, tracing, metrics) for a cloud provider specializing in AI infrastructure. This role involves leading engineers, managing production clusters, and ensuring reliability of critical infrastructure. | — | 0 |
| Software Engineer, Network Services Software Engineer to lead architecture, scaling, and operations of network services for GPU cloud services, focusing on high performance and reliability for AI workloads. | — | 0 |
| Senior Security Production Engineer CoreWeave is seeking a Senior Security Production Engineer to build, scale, and maintain the secure infrastructure for their AI cloud platform. This role involves designing and operating security systems, automating processes, enhancing observability, and responding to incidents, with a focus on reliability and performance for AI workloads. | — | 0 |
| Solutions Architect- Networking This role is for a Solutions Architect focused on networking technologies within high-performance compute (HPC) environments for AI workloads. The individual will be the primary technical point of contact for customers, helping them onboard, optimize, and succeed with CoreWeave's cloud infrastructure. Responsibilities include collaborating with customers to prototype and deploy tailored solutions, leading proof of concept initiatives, providing technical leadership, and offering insights for product enhancement. The role requires expertise in cloud computing, distributed systems, HPC/cloud services, and specifically infrastructure networking, with familiarity with NVIDIA GPUs, Infiniband, and NCCL. | — | 0 |
| Bare Metal Support Engineer This role supports the infrastructure for AI workloads, focusing on bare-metal GPU fleet management, customer issue resolution, and operational reliability within a data center environment. It involves troubleshooting hardware, software, and networking issues to ensure seamless customer experiences for AI computations. | — | 0 |
| Senior Network Engineer, Data Center Senior Network Engineer for a data center network supporting AI and GPU cloud infrastructure. Responsibilities include designing, deploying, and managing large-scale networks, troubleshooting complex issues, and collaborating across teams. Requires extensive experience in network engineering, hyperscale fabrics, routing protocols, and automation. | — | 0 |
| Senior Software Engineer, Server Fleet Infrastructure CoreWeave is seeking a Senior Software Engineer for their Server Fleet Infrastructure team. This role involves designing and building software to manage large-scale bare metal compute infrastructure across globally distributed data centers, focusing on automation, fleet lifecycle management, and observability. The engineer will work with technologies like Go, Python, Ansible, Linux, gRPC, and Kubernetes, and will be responsible for developing provisioning services, custom controllers, and monitoring solutions to ensure reliable and efficient infrastructure for AI workloads. | — | 0 |
| Senior Engineer, Compute Services (Kubernetes, Bare Metal) Senior Engineer, Compute Services role at CoreWeave focused on building and maintaining fault-tolerant Kubernetes infrastructure on bare-metal, using Python, Golang, and Bash. Responsibilities include provisioning, lifecycle management, optimization, and automated testing of Kubernetes control planes. Requires strong DevOps and Linux troubleshooting skills, with experience in Ansible and CI/CD tools. On-call rotation is expected. | — | 0 |
| Senior Firmware Engineer, OpenBMC Senior Firmware Engineer role focused on developing and maintaining OpenBMC-based firmware for datacenter infrastructure, involving design, implementation, integration, debugging, and optimization of embedded systems. | — | 0 |