Scale AI currently has 104 active AI-related job listings. The majority of these roles are focused on agents, representing 34% of the total openings. Engineering is the top function, followed by Research. The company is actively hiring for positions related to model serving, agent orchestration, and evals. Over the last 30 days, Scale AI has added 20 new AI roles, a significant increase of 186% compared to the preceding 30-day period.
Currently tracking 83 active AI roles, down 46% versus the prior 4 weeks. Primary focus: Agent · Engineering. Salary range $139k–$393k (avg $255k).
Scale AI currently has 99 active AI-related roles in our index. The most common open titles are: Product Manager of AI Applications, Global Public Sector (2), Software Engineer, Robotics (2), Solutions Engineer, Enterprise (2), Machine Learning Research Engineer, Agent Data Foundation - Enterprise GenAI, Senior Software Engineer, Full-Stack – Scale GP. Most positions are in Engineering and Research.
Scale AI's active AI hiring is concentrated in: agents (32%), application (23%), evaluation (15%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
Scale AI is hiring AI talent in: United States (64 roles), United Kingdom (15 roles), Qatar (2 roles), Mexico (2 roles).
Job postings at Scale AI most frequently reference: model serving, agent orchestration, evals, llm observability, fine tuning.
In the past 30 days, Scale AI has posted 11 new AI-related roles. That is a -42% change versus the prior 30 days (19 → 11).
Data AI · Data labeling
| Title | Stage | AI score |
|---|---|---|
| Machine Learning Engineer, Global Public Sector Scale AI is hiring ML Research Engineers to bridge the gap between frontier research and real-world impact for global governments. The role involves leading research into Agent design, Deep Research, and AI Safety/reliability, developing novel methodologies for public sector applications and setting new standards across the organization. Responsibilities include pioneering novel architectures, leading AI safety initiatives, driving deep research capabilities, publishing, consulting, and building evaluation frontiers. | AgentPost-train | 10 |
| AI Builder Intern This internship focuses on building and deploying AI-powered tools, automating workflows, and creating agentic systems using LLM-integrated frameworks. The role involves developing internal tools, dashboards, and API-connected automations, with an emphasis on shipping functional products and measuring their impact. It requires hands-on experience with LLM APIs, Python/JavaScript, and familiarity with agentic frameworks. |
| Agent |
| 9 |
| Research Scientist, Safety Post Training Research Scientist focused on developing and applying post-training methods and interpretability techniques to enhance the safety, robustness, and alignment of frontier AI systems. The role involves designing post-training pipelines, conducting evaluations to understand model behaviors, and collaborating to translate findings into safety standards and best practices. | Post-train | 9 |
| Senior Staff Forward Deployed AI Engineer, Enterprise Senior Staff Forward Deployed AI Engineer for Enterprise team at Scale AI. Role involves being a technical bridge between Scale AI's capabilities and strategic customers, understanding challenges, architecting custom AI solutions, and ensuring successful deployment. This hands-on role combines deep engineering expertise with customer-facing problem solving, integrating AI into critical workflows. | AgentServe | 9 |
| Staff Forward Deployed AI Engineer, Enterprise Staff Forward Deployed AI Engineer role focused on bridging Scale AI's capabilities with enterprise clients, involving custom AI solution architecture, integration, deployment, AI agent development (including multi-agent systems and RAG), prompt engineering, and technical leadership. Requires strong software engineering, Python, ML/AI frameworks, and cloud platform experience. | AgentServe | 9 |
| Technical Lead Manager, Physical AI Scale AI is seeking a Technical Lead Manager for their Physical AI team to lead research engineers in developing and evaluating Large-Scale Foundation Models for robots and AVs. The role involves hands-on contributions to model scaling, VLA/world model development, and data strategy, alongside team mentorship and translating research into production-ready features. | PretrainAgent | 9 |
| Director, Forward Deployed Engineering Director of Forward Deployed Engineering at Scale AI, leading a team to deliver and integrate AI agents for large enterprise customers. The role involves owning end-to-end delivery, technical oversight of agents, models, evaluations, and infrastructure, and partnering with Product teams to translate field lessons into reusable platform capabilities. Requires strong leadership, hands-on AI stack fluency, and experience deploying AI in complex enterprise environments. | AgentEval Gate | 9 |
| Research Scientist, Frontier Risk Evaluations Research Scientist role focused on designing and building evaluation measures, harnesses, and datasets for frontier AI systems, with a focus on identifying and mitigating risks. The role involves collaboration with external agencies and publishing findings, bridging AI research and policy. | Eval GateAgent | 9 |
| Research Scientist, Agent Robustness Research Scientist focused on agent robustness, AI safety, and risk evaluations. The role involves researching AI agent capabilities, designing tests for harmful actions, creating exploits and mitigations for failure modes, and characterizing risks in multi-agent systems. Experience with post-training techniques like RLHF and published research in generative AI is required. | AgentEval Gate | 9 |
| Research Scientist, AI Controls and Monitoring Research Scientist role focused on designing methods, systems, and experiments for AI controls and monitoring, ensuring advanced AI models and agents remain aligned with intended goals, even in high-stakes or adversarial environments. This includes developing monitoring techniques, researching layered control mechanisms, designing red-team simulations, and collaborating with policymakers. | Eval GatePost-train | 9 |
| Machine Learning Fellow - Human Frontier Collective (Canada) This role is for a Machine Learning Fellow focused on evaluating, interpreting, and optimizing advanced generative AI systems. The fellow will engage in ML projects, contribute to research publications, and collaborate with AI labs and platforms. The role requires a PhD or postdoctoral degree in a related field and experience with Python and ML frameworks. | Post-train | 9 |
| Senior/Staff Machine Learning Engineer, General Agents, Enterprise GenAI Senior/Staff Machine Learning Engineer on the General Agents team, responsible for designing, building, and deploying production-ready AI agents for enterprise use cases. This role involves working across the full agent lifecycle, from model and system design to evaluation, deployment, and iteration, bridging cutting-edge agentic techniques with real-world deployment constraints. | Agent | 9 |
| Forward Deployed AI Engineering Manager, Enterprise This role is for an Engineering Manager on the Enterprise team at Scale AI, focusing on being a technical bridge between Scale AI's AI capabilities and strategic enterprise customers. Responsibilities include understanding customer challenges, leading a team to architect and deploy AI solutions (specifically AI agents and integrations), prompt engineering, and RAG systems. The role requires strong software engineering and management experience, Python expertise, and cloud platform familiarity, with a focus on customer-facing AI deployments. | AgentServe | 9 |
| Distinguished Engineer Distinguished Engineer to shape the vision and technical roadmap of core AI/ML infrastructure powering enterprise AI applications, driving long-term technical direction for the Scale GenerativeAI Platform (SGP), influencing architecture, and partnering with leaders to deliver advanced AI capabilities to enterprise customers. This hands-on technical leader will set standards, mentor senior engineers, and ensure global-scale deployment readiness. | ServeAgent | 9 |
| Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI Scale AI is seeking a Staff Machine Learning Research Engineer focused on post-training algorithms for complex agents in enterprise GenAI applications. The role involves building a next-generation Agent RL training platform, integrating cutting-edge research, and training state-of-the-art models for enterprise customers, including cybersecurity and healthtech use cases. Experience with LLM training, post-training methods like RLHF/RLVR, and publications in top conferences are required. | Post-trainAgent | 9 |
| Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI Scale AI is seeking an ML Systems Research Engineer to work on building algorithms for their next-gen Agent RL training platform, supporting large-scale training, and researching/integrating state-of-the-art technologies to optimize ML systems. The role involves post-training state-of-the-art models for enterprise engagements and creating next-gen agent training algorithms for multi-agent/multi-tool rollouts. | Post-trainAgent | 9 |
| Machine Learning Research Engineer, Agents - Enterprise GenAI Research Engineer focused on building and training advanced AI agents for enterprise GenAI applications, utilizing post-training and agent-building algorithms on real-world datasets to achieve state-of-the-art results. | AgentPost-train | 9 |
| Machine Learning Research Engineer, Agent Data Foundation - Enterprise GenAI This role focuses on researching and building synthetic data pipelines and agents to improve enterprise GenAI models. It involves creating agents for trace analysis, contributing to an agent-building framework, and training state-of-the-art models using post-training and agent-building algorithms. | Post-trainAgent | 9 |
| Engineering Manager, AgentOps Engineering Manager for AgentOps team focused on building an Agent Development Platform to manage agent lifecycles, including building, deploying, monitoring, evaluating, and improving agents. The platform aims to support RL workflows and knowledge capture for continuous performance improvements. | Agent | 9 |
| Machine Learning Research Scientist, Reasoning Machine Learning Research Scientist focused on reasoning in LLMs, specifically for agentic systems like browser and software engineering agents. The role involves studying critical data types, identifying effective data sources and methodologies to improve LLM reasoning, and contributing to research while collaborating with engineering teams to implement solutions. | AgentPost-train | 9 |
| ML Research Engineer, ML Systems ML Research Engineer focused on building and optimizing the internal distributed framework for large language model training and inference, supporting ML research and development. | ServePost-train | 9 |
| Machine Learning Research Scientist, Post-Training Research Scientist focused on LLM post-training techniques (SFT, RLHF, reward modeling) to enhance text and multimodal capabilities. Involves optimizing data curation, analyzing model behavior, and publishing findings. | Post-train | 9 |
| Senior / Staff Machine Learning Research Scientist, Agents Research Scientist role focused on building state-of-the-art AI agents, studying essential data types for agents like browser and SWE agents, and guiding data strategy to advance intelligent, adaptable AI agents. The role involves contributing to research publications, collaborating with customer researchers, and translating advancements into scalable solutions. | Agent | 9 |
| Strategic Projects Lead, Red Team Scale AI is seeking a Strategic Projects Lead for their Red Team and Safety function. This role focuses on managing partnerships with frontier AI model developers, stress-testing AI models, and shaping their deployment. The lead will act as a subject-matter expert, coordinate delivery with research and operations, and contribute to public benchmark launches. The role requires technical curiosity, operational rigor, and strong communication skills to bridge technical and commercial audiences. | Eval Gate | 8 |
| Head of Policy & Security Research Lab Lead a team of research scientists, policy experts, and engineers focused on foundational AI safety and security work, including developing frameworks and benchmarks for frontier AI models. The role requires a strong technical and policy background with extensive knowledge of frontier risk evaluations, AI control, and preparedness research. | Eval Gate | 8 |
| Forward Deployed AI Engineer, Enterprise Forward Deployed AI Engineer role focused on integrating Scale AI's capabilities with enterprise clients, architecting custom AI solutions, developing production-grade AI agents, and ensuring successful deployment and adoption. Involves customer integration, AI agent development, prompt engineering, and technical leadership. | AgentServe | 8 |
| Research Advisor - Human Frontier Collective (UK) Independent contractor opportunity for a Research Advisor to join the Human Frontier Collective (HFC) at Scale AI. The role involves providing consultancy on model behavior and domain-specific logic, collaborating on research to design evaluation frameworks for frontier models, engaging with clients as a Subject Matter Expert, creating technical content, and contributing to research publications. The role requires 5+ years of relevant industry experience with strong domain knowledge in fields like finance, legal, or medical, and advanced degrees. The compensation is $300/hr USD. | Eval Gate | 8 |
| Research Advisor - Human Frontier Collective (US) This role involves providing expert consultancy on AI model behavior and governance, collaborating on research to design evaluation frameworks for frontier models, engaging with clients as a Subject Matter Expert, creating technical content, and co-authoring research publications. The focus is on evaluating and interpreting advanced generative AI systems, particularly in specialized domains like finance, legal, or medical. | Eval Gate | 8 |
| SWE Fellow - Human Frontier Collective (Canada) This role focuses on evaluating and interpreting advanced generative AI systems, designing datasets for rigorous evaluation, and contributing to research publications. It involves collaborating with AI labs and platforms to enhance model accuracy and reasoning, positioning it as a key player in the AI evaluation gate. | Eval Gate | 8 |
| SWE Fellow - Human Frontier Collective (UK) This role is for a Software Engineer Fellow focused on evaluating and interpreting advanced generative AI systems within the Human Frontier Collective program. The fellow will design datasets, evaluate AI models, and contribute to research publications, aiming to enhance AI accuracy and reasoning. | Eval Gate | 8 |
| SWE Fellow - Human Frontier Collective (US) This role focuses on evaluating advanced generative AI systems by designing domain-specific problems and datasets, providing expert insights to enhance model performance, and contributing to research publications. It's a research-oriented role within a fellowship program aimed at shaping the future of AI. | Eval Gate | 8 |
| Senior AI Infrastructure Engineer - Training Platform This role focuses on building and scaling the infrastructure for large-scale AI model training, specifically the 'Operating System' for GPU clusters. It involves architecting a high-performance training platform, managing multi-tenant orchestration, optimizing job scheduling, and ensuring deep observability and reliability for massive workloads. The goal is to maximize the efficiency and velocity of AI researchers training advanced models. | Data | 8 |
| Staff Software Engineer, Public Sector Staff Software Engineer at Scale AI focused on building core product building blocks for agentic capabilities in the public sector. Responsibilities include processing federal datasets for real-time decision-making, developing multi-layered guardrails around agents, optimizing data retrieval, orchestrating asynchronous agents, and alerting users to data deviations. The role involves mentoring engineers, defining technical strategy for agentic features, ensuring system reliability, and consulting on AI-powered solutions for federal contracts. | Agent | 8 |
| ML Systems Engineer, Robotics ML Systems Engineer focused on building and scaling serving platforms for robotics-related foundation models, optimizing algorithms for cloud GPUs, and developing internal platforms for model capability discovery. The role involves backend system design, ML infrastructure, and ensuring low latency for real-time applications. | ServeAgent | 8 |
| Machine Learning Fellow - Human Frontier Collective (UK) This role is for a Machine Learning Fellow focused on designing, evaluating, and interpreting advanced generative AI systems. The fellow will work on ML projects, contribute to research publications, and engage with a community of AI researchers. The role involves optimizing PyTorch models, evaluating ML code, advising on GPU optimization, and collaborating on research papers. | Post-train | 8 |
| Machine Learning Fellow - Human Frontier Collective (US) This role involves applying academic and professional expertise to design, evaluate, and interpret advanced generative AI systems. The fellow will work on ML projects, optimize PyTorch models, evaluate ML code, advise on GPU optimization, and contribute to research publications and technical reports. | Post-train | 8 |
| Product Manager of AI Applications, Global Public Sector Product Manager for AI Applications in the Global Public Sector at Scale AI, focusing on developing custom AI solutions and LLMs for government clients. The role involves leading cross-functional teams, understanding client needs, and ensuring the successful delivery of AI-powered products. | ShipPost-train | 8 |
| STEM Fellow - Human Frontier Collective (UK) This role focuses on evaluating and interpreting advanced generative AI systems by designing domain-specific problems and datasets, providing expert insights to enhance model performance, and contributing to research publications. It's a research-oriented fellowship with a focus on AI evaluation. | Eval Gate | 8 |
| Staff Product Manager, Agentic Platform Product Manager for Agentic AI platforms supporting national security decisions, focusing on the entire lifecycle from design to launch, with a strong emphasis on government and regulated environments, ethical AI, and risk management. | Agent | 8 |
| Tech Lead Manager- MLRE, ML Systems Tech Lead Manager for MLRE, ML Systems at Scale AI, focusing on building and optimizing the internal distributed framework for large language model training and evaluation. The role involves collaborating with ML teams to accelerate research and development, and integrating state-of-the-art technologies to optimize the ML system, supporting both training and inference. | Post-trainServe | 8 |
| Product Manager of AI Applications, Global Public Sector Product Manager for AI Applications within Scale AI's Global Public Sector team, focusing on developing bespoke AI solutions for governments and government-backed entities. This role involves leading cross-functional teams to build custom AI applications and LLMs, leveraging the Scale GenAI Platform, and ensuring customer success through client workshops, use case scoping, and continuous feedback. | ShipPost-train | 8 |
| Principal Architect Seeking a Principal Architect to lead a team of 50+ engineers in designing, developing, and deploying agentic AI products, focusing on LLMs and AI agents, for defense applications. The role involves technical direction, executive stakeholder engagement, and strategic planning, with success metrics tied to customer demonstrations and contract awards. | Agent | 8 |
| Forward Deployed AI Engineering Manager, GenAI Applications Engineering Manager to lead a Forward Deployed Engineering (FDE) team focused on delivering high-impact GenAI solutions in production for enterprise customers. The role involves leading, mentoring, and growing the team, working hands-on with customers to solve complex business and technical challenges, and shaping the product roadmap based on customer needs. | Ship | 8 |
| STEM Fellow - Human Frontier Collective (US) This role focuses on evaluating and interpreting advanced generative AI systems by designing domain-specific problems and datasets, providing expert insights to enhance model performance, and contributing to research publications. It involves collaboration with AI labs and Scale's research team. | Eval GatePost-train | 8 |
| Applied AI Engineer, Enterprise Scale AI is seeking an Applied AI Engineer for their Enterprise team in London, UK. The role involves working with clients to build advanced AI agents and ML solutions using the Scale Generative Platform (SGP), focusing on enterprise needs such as cybersecurity and genomics. Responsibilities include owning AI strategy, leveraging SGP for agent development (including multimodal and tool-calling features), gathering business requirements, collaborating with clients and internal teams, and deploying production code. The ideal candidate has a strong engineering background, Python proficiency, and experience with cloud ML development. Familiarity with Generative AI in production is a plus. | Agent | 8 |
| Senior AI Infrastructure Engineer, Model Serving Platform Senior AI Infrastructure Engineer focused on building and maintaining scalable, reliable, and efficient platforms for serving LLMs. The role involves backend system design, integrating models, developing monitoring solutions, and leading projects end-to-end. Requires strong programming skills and experience with LLM serving fundamentals and container orchestration. | Serve | 8 |
| AI Strategy Consultant, Frontier Tech This role focuses on designing and executing research experiments, building and evaluating frontier LLM datasets, and developing training/testing material to improve the quality of AI products. It involves close collaboration with ML research scientists and SPM teams, with a strong emphasis on analytical and problem-solving skills in a fast-paced environment. | DataEval Gate | 8 |
| Applied AI Engineer, Global Public Sector Applied AI Engineer for Scale AI's Global Public Sector team, focusing on building custom end-to-end AI applications for public sector clients, generating training data for LLMs, and providing AI advisory services. The role involves deploying AI solutions, creating datasets, fine-tuning models, and establishing evaluation frameworks. | AgentShip | 8 |
| Director, Technical Program Manager Director of Technical Program Management focused on building and scaling a TPM team to execute enterprise-grade AI initiatives. This role involves strategic planning, organizational design, cross-functional orchestration, and executive communication, with a requirement for deep expertise in the AI/ML lifecycle and deploying GenAI at scale. | Ship | 7 |
| Product Manager, Enterprise Core Platform Product Manager responsible for defining and owning the core platform infrastructure that powers AI agent deployment for enterprise customers. This role focuses on sequencing platform work, identifying reusable patterns to graduate to core, and ensuring a high-quality, trustworthy foundation for AI delivery teams. | Agent | 7 |