Currently tracking 82 active AI roles, up 61% versus the prior 4 weeks. Primary focus: Agent · Engineering. Salary range $139k–$393k (avg $256k).
Data AI · Data labeling
| Title | Stage | AI score |
|---|---|---|
| Technical Lead Manager, Physical AI Scale AI is seeking a Technical Lead Manager for their Physical AI team to lead research engineers in developing and evaluating Large-Scale Foundation Models for robots and AVs. The role involves hands-on contributions to model scaling, VLA/world model development, and data strategy, alongside team mentorship and translating research into production-ready features. | PretrainAgent | 9 |
| Staff Applied AI Engineer, Enterprise GenAI Scale AI is seeking a Staff Applied AI Engineer to build advanced AI agents for enterprise clients using their Generative Platform (SGP). The role involves owning and optimizing AI solutions, leveraging SGP for multimodal functionality and tool-calling, gathering business requirements, collaborating with clients, and pushing production code in customer and Scale codebases. The ideal candidate has 7+ years of experience, a strong engineering background, and familiarity with data-driven ML model iteration and cloud environments. |
| Agent |
| 9 |
| Director, Enterprise Machine Learning & Research Director of Enterprise ML at Scale AI, leading research scientists and engineers in GenAI initiatives. The role involves defining and driving a multi-year research roadmap, collaborating cross-functionally, and communicating research outcomes. Focus is on turning research into production-ready systems, with experience in evaluation, post-training, agents, and RL environments. Requires strong research background, publications, and team leadership experience. | Post-trainAgent | 9 |
| Research Scientist, Frontier Risk Evaluations Research Scientist role focused on designing and building evaluation measures, harnesses, and datasets for frontier AI systems, with a focus on identifying and mitigating risks. The role involves collaboration with external agencies and publishing findings, bridging AI research and policy. | Eval GateAgent | 9 |
| Research Scientist, Agent Robustness Research Scientist focused on agent robustness, AI safety, and risk evaluations. The role involves researching AI agent capabilities, designing tests for harmful actions, creating exploits and mitigations for failure modes, and characterizing risks in multi-agent systems. Experience with post-training techniques like RLHF and published research in generative AI is required. | AgentEval Gate | 9 |
| Research Scientist, AI Controls and Monitoring Research Scientist role focused on designing methods, systems, and experiments for AI controls and monitoring, ensuring advanced AI models and agents remain aligned with intended goals, even in high-stakes or adversarial environments. This includes developing monitoring techniques, researching layered control mechanisms, designing red-team simulations, and collaborating with policymakers. | Eval GatePost-train | 9 |
| Senior/Staff Machine Learning Engineer, General Agents, Enterprise GenAI Senior/Staff Machine Learning Engineer on the General Agents team, responsible for designing, building, and deploying production-ready AI agents for enterprise use cases. This role involves working across the full agent lifecycle, from model and system design to evaluation, deployment, and iteration, bridging cutting-edge agentic techniques with real-world deployment constraints. | Agent | 9 |
| Forward Deployed AI Engineering Manager, Enterprise This role is for an Engineering Manager on the Enterprise team at Scale AI, focusing on being a technical bridge between Scale AI's AI capabilities and strategic enterprise customers. Responsibilities include understanding customer challenges, leading a team to architect and deploy AI solutions (specifically AI agents and integrations), prompt engineering, and RAG systems. The role requires strong software engineering and management experience, Python expertise, and cloud platform familiarity, with a focus on customer-facing AI deployments. | AgentServe | 9 |
| Distinguished Engineer Distinguished Engineer to shape the vision and technical roadmap of core AI/ML infrastructure powering enterprise AI applications, driving long-term technical direction for the Scale GenerativeAI Platform (SGP), influencing architecture, and partnering with leaders to deliver advanced AI capabilities to enterprise customers. This hands-on technical leader will set standards, mentor senior engineers, and ensure global-scale deployment readiness. | ServeAgent | 9 |
| Manager, Machine Learning Research Scientist, GenAI Manager for a GenAI research team focused on evaluation, post-training, agents, and RL environments. The role involves leading a team, defining research roadmaps, driving execution, and collaborating cross-functionally. Requires a strong research background with publications and experience in fast-paced environments. | Post-trainAgent | 9 |
| Evals Engineer, Applied AI Scale AI is looking for an AI Research Engineer to join their Enterprise Evaluations team, focusing on building and improving GenAI Evaluation Suites for enterprise LLM-powered workflows and agents. The role involves creating human-rated datasets, designing LLM-as-a-Judge autorater frameworks, and researching new methodologies for evaluating AI systems. | Eval GateAgent | 9 |
| Staff Machine Learning Research Scientist, LLM Evals Scale AI is seeking a Staff Machine Learning Research Scientist to lead the development of novel evaluation methodologies, metrics, and benchmarks for large language models (LLMs). This role focuses on defining and measuring the capabilities and limitations of frontier LLMs, driving research that informs internal roadmaps and the broader community. Responsibilities include researching existing evaluation techniques, designing new benchmarks, implementing scalable evaluation pipelines, publishing findings, and mentoring junior researchers. The ideal candidate has 5+ years of experience in LLMs/NLP, a strong publication record, and experience leading research teams. | Eval GatePost-train | 9 |
| Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI Scale AI is seeking a Staff Machine Learning Research Engineer focused on post-training algorithms for complex agents in enterprise GenAI applications. The role involves building a next-generation Agent RL training platform, integrating cutting-edge research, and training state-of-the-art models for enterprise customers, including cybersecurity and healthtech use cases. Experience with LLM training, post-training methods like RLHF/RLVR, and publications in top conferences are required. | Post-trainAgent | 9 |
| Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI Scale AI is seeking an ML Systems Research Engineer to work on building algorithms for their next-gen Agent RL training platform, supporting large-scale training, and researching/integrating state-of-the-art technologies to optimize ML systems. The role involves post-training state-of-the-art models for enterprise engagements and creating next-gen agent training algorithms for multi-agent/multi-tool rollouts. | Post-trainAgent | 9 |
| Machine Learning Research Engineer, Agents - Enterprise GenAI Research Engineer focused on building and training advanced AI agents for enterprise GenAI applications, utilizing post-training and agent-building algorithms on real-world datasets to achieve state-of-the-art results. | AgentPost-train | 9 |
| Machine Learning Research Engineer, Agent Data Foundation - Enterprise GenAI This role focuses on researching and building synthetic data pipelines and agents to improve enterprise GenAI models. It involves creating agents for trace analysis, contributing to an agent-building framework, and training state-of-the-art models using post-training and agent-building algorithms. | Post-trainAgent | 9 |
| Engineering Manager, AgentOps Engineering Manager for AgentOps team focused on building an Agent Development Platform to manage agent lifecycles, including building, deploying, monitoring, evaluating, and improving agents. The platform aims to support RL workflows and knowledge capture for continuous performance improvements. | Agent | 9 |
| Deep Research Agent Tech Lead Scale AI is looking for a Staff/Senior Staff ML Engineer to lead Deep Research Agent Development for enterprise applications. This role involves setting technical strategy, driving research to production, and leading a team in building, orchestrating, and evaluating multi-agent systems at scale. Requires strong experience in Generative AI, LLMs, and AI Agents, with a focus on integrating diverse data modalities and ensuring production-readiness. | Agent | 9 |
| Machine Learning Research Scientist, Reasoning Machine Learning Research Scientist focused on reasoning in LLMs, specifically for agentic systems like browser and software engineering agents. The role involves studying critical data types, identifying effective data sources and methodologies to improve LLM reasoning, and contributing to research while collaborating with engineering teams to implement solutions. | AgentPost-train | 9 |
| Senior Forward Deployed AI Engineer, Enterprise Senior Forward Deployed AI Engineer for Scale AI's Enterprise team, acting as a technical bridge to strategic customers. Responsibilities include understanding customer challenges, architecting custom AI solutions, ensuring successful deployment and adoption of AI systems, developing production-grade AI agents and multi-agent systems, implementing evaluation frameworks, prompt engineering, RAG, fine-tuning, and collaborating with customer and internal teams. Requires strong software engineering, Python, ML/AI frameworks, and cloud platform experience. | AgentData | 9 |
| ML Research Engineer, ML Systems ML Research Engineer focused on building and optimizing the internal distributed framework for large language model training and inference, supporting ML research and development. | ServePost-train | 9 |
| Machine Learning Research Scientist, Post-Training Research Scientist focused on LLM post-training techniques (SFT, RLHF, reward modeling) to enhance text and multimodal capabilities. Involves optimizing data curation, analyzing model behavior, and publishing findings. | Post-train | 9 |
| Machine Learning Research Engineer, GenAI Applied ML Lead applied ML engineering on Scale's Applied ML team, focusing on building and deploying scalable multi-agent systems to validate agentic reasoning and behaviors, scale human expertise, and drive research into real-world agent reliability failures, shipping production fixes. | Agent | 9 |
| Senior / Staff Machine Learning Research Scientist, Agents Research Scientist role focused on building state-of-the-art AI agents, studying essential data types for agents like browser and SWE agents, and guiding data strategy to advance intelligent, adaptable AI agents. The role involves contributing to research publications, collaborating with customer researchers, and translating advancements into scalable solutions. | Agent | 9 |
| Tech Lead/Manager, Machine Learning Research Scientist- LLM Evals Scale AI is seeking a Tech Lead/Manager for their LLM Evals Research team. This role involves leading a team to develop and implement novel evaluation methodologies, metrics, and benchmarks for large language models, focusing on areas like instruction following, factuality, robustness, and fairness. The position requires research into LLM evaluation techniques, communication with clients and internal teams, implementation of scalable evaluation pipelines, and publishing research findings. The ideal candidate has extensive experience in LLMs, NLP, and Transformer modeling, with a proven track record of research impact and team leadership. | Eval GatePost-train | 9 |
| Senior Machine Learning Engineer, Public Sector Senior Machine Learning Engineer focused on deploying and improving generative AI, computer vision, reinforcement learning, and agentic AI models for mission-critical government systems. The role involves building agent frameworks, fine-tuning models, and advancing research in RL for LLMs, with a strong emphasis on production environments and large datasets. | AgentPost-train | 9 |
| Senior AI Infrastructure Engineer - Training Platform This role focuses on building and scaling the infrastructure for large-scale AI model training, specifically the 'Operating System' for GPU clusters. It involves architecting a high-performance training platform, managing multi-tenant orchestration, optimizing job scheduling, and ensuring deep observability and reliability for massive workloads. The goal is to maximize the efficiency and velocity of AI researchers training advanced models. | Data | 8 |
| Staff Software Engineer, Public Sector Staff Software Engineer at Scale AI focused on building core product building blocks for agentic capabilities in the public sector. Responsibilities include processing federal datasets for real-time decision-making, developing multi-layered guardrails around agents, optimizing data retrieval, orchestrating asynchronous agents, and alerting users to data deviations. The role involves mentoring engineers, defining technical strategy for agentic features, ensuring system reliability, and consulting on AI-powered solutions for federal contracts. | Agent | 8 |
| ML Systems Engineer, Robotics ML Systems Engineer focused on building and scaling serving platforms for robotics-related foundation models, optimizing algorithms for cloud GPUs, and developing internal platforms for model capability discovery. The role involves backend system design, ML infrastructure, and ensuring low latency for real-time applications. | ServeAgent | 8 |
| Senior Forward Deployed Data Scientist/Engineer Senior Forward Deployed Data Scientist/Engineer to partner with enterprise customers, understand workflows and pain points, and build end-to-end data products and AI/ML systems. This role involves defining metrics, designing experiments, building tools (evaluation explorers, workflow applications, decision-support systems), and driving solutions into production, with a focus on measurable business impact and rigorous evaluation. Experience with AI-assisted development tools is expected. | ShipEval Gate | 8 |
| Senior Machine Learning Engineer - Model Evaluations, Public Sector This role focuses on building and scaling automated evaluation pipelines for AI systems, including LLMs and agentic models, to ensure their reliability, safety, and effectiveness in mission-critical government environments. It involves designing test datasets, benchmarks, and frameworks for various metrics, including LLM-judge evaluations, agent testing, and stress tests. | Eval GateAgent | 8 |
| Staff Product Manager, Agentic Platform Product Manager for Agentic AI platforms supporting national security decisions, focusing on the entire lifecycle from design to launch, with a strong emphasis on government and regulated environments, ethical AI, and risk management. | Agent | 8 |
| Tech Lead Manager- MLRE, ML Systems Tech Lead Manager for MLRE, ML Systems at Scale AI, focusing on building and optimizing the internal distributed framework for large language model training and evaluation. The role involves collaborating with ML teams to accelerate research and development, and integrating state-of-the-art technologies to optimize the ML system, supporting both training and inference. | Post-trainServe | 8 |
| AI Product Manager AI Product Manager to own the Agent & Reinforcement Learning Environments data vertical, focusing on Computer Using Agent (CUA) data. Responsibilities include owning the product roadmap, data generation pipelines, quality, and researcher-facing tools for training and evaluating intelligent agents. Requires a blend of entrepreneurial, go-to-market, and technical skills, with experience in product management and understanding of RL, simulation environments, and data pipelines for model training/evaluation. | AgentData | 8 |
| Principal Architect Seeking a Principal Architect to lead a team of 50+ engineers in designing, developing, and deploying agentic AI products, focusing on LLMs and AI agents, for defense applications. The role involves technical direction, executive stakeholder engagement, and strategic planning, with success metrics tied to customer demonstrations and contract awards. | Agent | 8 |
| Senior AI Infrastructure Engineer, Model Serving Platform Senior AI Infrastructure Engineer focused on building and maintaining scalable, reliable, and efficient platforms for serving LLMs. The role involves backend system design, integrating models, developing monitoring solutions, and leading projects end-to-end. Requires strong programming skills and experience with LLM serving fundamentals and container orchestration. | Serve | 8 |
| Applied AI Engineer, Enterprise GenAI Scale AI is seeking an Applied AI Engineer to build advanced AI agents for enterprise clients using their Generative Platform (SGP). The role involves owning AI solutions for complex technical problems, leveraging multimodal functionality and tool-calling, and collaborating with clients and internal teams. The engineer will push production code and work with data-driven experiments to improve product performance. | Agent | 8 |
| SACC DC 2026 - Job Application Scale AI is seeking candidates who attended the DC SACC '26 conference for roles focused on developing reliable AI systems for critical decisions, particularly in defense applications and autonomous vehicles. The company is a leading AI data foundry, providing data and technologies for AI model development and deployment. | Ship | 7 |
| Product Manager, Public Sector GenAI Test & Evaluation (T&E) Product Manager for GenAI Test & Evaluation (T&E) in the Public Sector team at Scale AI. This role focuses on defining the vision and roadmap for evaluation capabilities, owning the T&E tech stack to measure and improve agentic applications. Requires strong engineering depth, experience with evaluation systems, problem distillation, ambiguity management, cross-functional leadership, and operational execution. Experience with GenAI implementation, public sector work, and security clearance are preferred. | Eval Gate | 7 |
| Senior Software Engineer, Public Sector Senior Software Engineer focused on building core product components for agentic capabilities in the public sector, involving processing federal datasets for real-time decision-making and developing multi-layered guardrails, data retrieval optimization, and agent orchestration. | Agent | 7 |
| Product Manager, Gen AI Product Manager for GenAI at Scale AI, focusing on building data infrastructure and tooling for AI model training and evaluation. The role involves shaping products for both customers (demand side) and contributors (supply side), requiring end-to-end product ownership from strategy to execution. The position is cross-functional, working with engineering, design, and data science teams in a fast-paced, growth-stage environment. | Data | 7 |
| Product Manager, Data Engine Product Manager for Scale AI's Public Sector Data Engine, focusing on building ML Ops infrastructure for computer vision and generative AI models used in national security systems. The role involves architecting the AI engine, managing roadmaps, technical scoping, and operationalizing collaboration between engineering, operations, and government stakeholders. | ShipData | 7 |
| Infrastructure Software Engineer, Enterprise GenAI Scale AI is seeking an Infrastructure Software Engineer to build and scale their enterprise GenAI platform, focusing on multi-cloud systems, customer data integrations, and productizing AI technologies like LLMs and vector databases for regulated industries. | Serve | 7 |
| Technical Program Manager, Robotics Data Technical Program Manager for Robotics Data at Scale AI, responsible for driving complex technical operations, managing a portfolio of robotics data projects, ensuring milestones are met, and de-risking dependencies. The role involves bridging technical and operational gaps, architecting scalable data ingestion pipelines for VLA models, enforcing data quality and governance, standardizing processes for scale, and communicating progress with data-driven updates. Requires a strong technical foundation in AI/ML, TPM experience in robotics or data operations, and hands-on problem-solving skills. | Data | 7 |
| Robotics Engineer Scale AI is looking for a Robotics Engineer to join their expanding Robotics business unit. The role focuses on building out the company's robotics fleet and software systems for data collection and evaluations, with responsibilities including developing data collection systems, designing hardware, building pipelines, and owning hardware/software integrations. The ideal candidate has a strong engineering background, experience in Python/C++/Java, hardware labs, mechanical design, robotics, AI, computer vision, and managing data collection operations. | Data | 7 |
| Senior Software Engineer, Agentic Data Products Senior Software Engineer role on a new Agentic Data Products team focused on building next-generation agent-powered tools for enterprises. The role involves full-stack development, integrating LLMs, vector databases, and agentic frameworks to create intelligent systems that reason over data and take action. It's a 0->1 build with a focus on shipping quickly and owning product areas from concept to deployment. | Agent | 7 |
| GenAI Strategic Projects Lead, Public Sector Scale AI is seeking a GenAI Strategic Projects Lead for their Public Sector team in Washington, DC. This role will own high-impact projects focused on generative AI data labeling pipelines, operational processes for data workforce management, and improving training/evaluation dataset quality for public sector customers, particularly in national security. The role involves developing infrastructure, managing data production pipelines, partnering with SMEs and customers, and influencing cross-organizational strategy to enable mission-critical AI applications. | Post-trainData | 7 |
| Solutions Engineer, Robotics Solutions Engineer for Scale AI's Robotics team, focusing on physical AI and autonomous systems. The role involves partnering with customers to deliver pilots, influence product roadmaps, and become a domain expert in robotics and physical AI. Key responsibilities include technical customer engagement, designing solutions, and contributing to the development of a Robotics Data Marketplace. | Ship | 7 |
| Engagement Manager (Homeland Layered Defense), Public Sector This role focuses on delivering agentic AI solutions for homeland defense clients, managing customer relationships, and partnering with engineering to build and deploy AI systems tailored to government use cases in computer vision and generative AI. | Agent | 7 |
| Technical Program Manager, Public Sector This role is a Technical Program Manager for Scale AI's Public Sector division, focusing on customer success and driving the delivery of AI/ML solutions. The TPM will manage technical projects, leverage data and analytics to solve customer problems, and partner with engineering teams to deliver AI systems tailored to government use cases in computer vision and generative AI. The role requires a technical background, understanding of ML operations, and experience in stakeholder management within a regulated environment. | Ship | 7 |