What AI roles is Handshake hiring for?

Handshake currently has 65 active AI-related roles in our index. The most common open titles are: Music Producer - AI Trainer (2), Strategic Projects Lead, Coding (2), 3D Slicer Specialist - AI Trainer , AI Red Teamer, LLM Generalist, Analog Engineer - AI Trainer. Most positions are in Engineering and Research.

What stage of AI development does Handshake focus on?

Handshake's active AI hiring is concentrated in: data (66%), agents (15%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.

Where is Handshake hiring AI talent?

Handshake is hiring AI talent in: United States (22 roles), India (4 roles).

What technologies does Handshake's AI team work with?

Job postings at Handshake most frequently reference: evals, synthetic data, model serving, agent orchestration, llm observability.

How many AI roles has Handshake posted recently?

In the past 30 days, Handshake has posted 10 new AI-related roles. That is a -57% change versus the prior 30 days (23 → 10).

Handshake — AI hiring signals

Handshake currently has 68 active AI-related job listings. The majority of these roles, 60%, are focused on data, with a further 15% in agents. Research and Engineering are the most frequent functions hiring for these positions. Recent hiring activity shows a significant increase, with 23 new AI roles posted in the last 30 days, representing a 92% rise compared to the preceding 30-day period.

Auto-generated from active job postings · last refreshed 2026-05-24

Currently tracking 23 active AI roles, down 48% versus the prior 4 weeks. Primary focus: Agent · Engineering.

Hiring

23 / 33

Momentum (4w)

↓-22 -48%

24 opens last 4w · 46 prior 4w

Salary range

—

Tracked since

Dec '25

last role 5w ago

Hiring velocityscroll left for older weeks

1 new role

Sep 15

1 new role

Nov 17

3 new roles

Dec 15

1 new role

Jan 26

2 new roles

Feb 2

3 new roles

9 new roles

17 new roles

Mar 2

33 new roles

3 new roles

4 new roles

5 new roles

Apr 6

3 new roles

8 new roles

4 new roles

23 new roles

May 4

8 new roles

6 new roles

9 new roles

3 new roles

Jun 1

8 new roles

3 new roles

10 new roles

Jobs (42)

24 AI · 116 total active

Title	Stage	Function	Location	First seen	AI score
AI Red Teamer, CBRNE This role focuses on evaluating AI models for safety and security, specifically concerning CBRNE threats. The Red Teamer will design adversarial prompts, assess model outputs for dangerous knowledge gaps, and document findings to help labs improve model defenses before they reach the real world. This requires deep domain expertise in CBRNE fields, strong ethical judgment, and the ability to think like a threat actor within a structured evaluation framework.	Eval Gate	Research	Seattle, WA	4w ago	9
AI Red Teamer, LLM Generalist The AI Red Teamer, LLM Generalist role focuses on stress-testing large language models by designing creative, adversarial prompts to expose vulnerabilities in AI safety, guardrails, and robustness. This involves probing models across various risk categories (content safety, CBRN, cybersecurity, etc.) and potentially across different modalities (text, image, voice, agentic). The role requires strong prompt crafting skills, ethical judgment, and collaboration with engineers and researchers to share findings and strengthen defenses. It is a generalist role that may involve working with sensitive content.	Eval GateAgent	Engineering	Seattle, WA	4w ago	9
Senior Forward Deployed Engineer, Handshake AI Enterprise Senior Forward Deployed Engineer to embed within enterprise customer environments, define AI-driven solutions, build and deploy production-grade AI agents, and design/run evals to measure performance. The role requires full-stack ownership, deep understanding of customer business, and iteration until performance improves. Emphasis on real-world AI application shipping and systematic improvement of AI performance.	AgentEval Gate	Engineering	San Francisco, CA	8w ago	9
AI PhD Student Researcher - Fall 2026 Handshake AI is seeking a PhD Student Researcher to work on novel RLHF/GRPO pipelines, instruction-following refinements, reasoning-trace supervision, multilingual/long-horizon/domain-specific benchmarks, automatic vs. human preference studies, robustness diagnostics, active-learning loops, data value estimation, synthetic data generation, and low-resource fine-tuning strategies. The goal is to produce an archive-ready manuscript or top-tier conference submission.	Post-trainEval Gate	Research	San Francisco, CA	Apr 8	9
Engineering Manager, RLE This role is for a Senior Software Engineer to build and scale a Reinforcement Learning Environments (RLE) platform. This platform simulates real-world workflows for AI models to learn, generating data for training and evaluation. The engineer will drive architecture, build plug-and-play domains, and ensure system reliability and quality, working closely with research, product, and operations teams. Strong applied AI experience is required.	DataEval Gate	Engineering	India	3w ago	8
Manager Strategic Projects India Manager for a team of Strategic Project Leads (SPLs) focused on AI data and evaluation projects. The role involves leading delivery, quality, and scalability, managing a team, translating needs into project plans, owning performance metrics, and partnering with Product and Engineering. The role operates in a high-pressure, fast-changing environment with a focus on operational excellence and continuous improvement in AI data pipelines and labeling workflows.	DataEval Gate	Engineering	India	4w ago	8
AI Red Teamer, Cybersecurity This role focuses on evaluating AI models, specifically LLMs, for cybersecurity vulnerabilities. The AI Red Teamer will craft adversarial prompts and multi-turn interactions to test if models can be manipulated into generating functional malware, exploit code, or attack tooling. The core responsibility is to assess the output's real-world exploitability and contribute to improving model safety guardrails. This involves deep cybersecurity expertise and understanding attacker methodologies.	Eval GateAgent	Engineering	Seattle, WA	4w ago	8
Senior Product Manager, RL Environments — Handshake AI Senior Product Manager to own the product surface that turns RL environment creation from a bespoke, weeks-long lift into a repeatable factory. This role will design and ship the platform that compresses lead time, replaces hand-built workflows with self-serve tooling, and lets a small team of operators turn out high-quality environments for any vertical.	DataEval Gate	Product	San Francisco, CA	5w ago	8
Applied AI Engineer, Handshake AI Enterprise Applied AI Engineer role focused on building and deploying production-grade AI agents within enterprise customer environments. The role involves understanding customer business needs, developing agents, running evaluations, and iterating on performance to drive measurable business impact. Requires backend engineering depth and experience with AI/ML systems in production.	Agent	Engineering	San Francisco, CA	5w ago	8
Senior Applied AI Engineer, Handshake AI Enterprise Senior Applied AI Engineer role focused on embedding within enterprise customer environments to build and deploy production-grade AI agents. The role involves defining AI-driven solutions, owning end-to-end delivery, designing and running evaluations, and iterating on performance. It requires strong applied AI and backend experience, with a focus on real-world application and systems thinking.	Agent	Engineering	San Francisco, CA	5w ago	8
Forward Deployed Engineer, Handshake AI Enterprise Forward Deployed Engineer to embed within enterprise customer environments, define AI-driven solutions, build and deploy production-grade AI agents, and design evals to measure and improve performance. The role requires full-stack capabilities with strong backend depth and real-world experience shipping AI applications.	AgentEval Gate	Engineering	San Francisco, CA	6w ago	8
Software Engineer, Agentic Infrastructure Software Engineer focused on building the core agent orchestration layer, including tool use, memory, and multi-step reasoning systems, to power AI-driven features for millions of users. The role also involves designing evaluation, observability, and reliability frameworks for agent behavior and establishing engineering standards for agentic development.	Agent	Engineering	San Francisco, CA	7w ago	8
Senior Software Engineer, Agentic Infrastructure Senior Software Engineer to architect and build the foundational systems for AI agents, including tool use, memory, and multi-step reasoning. The role involves designing evaluation and observability frameworks, establishing engineering standards for agentic development, and partnering with ML/product teams to ship agent-powered features.	Agent	Engineering	San Francisco, CA	7w ago	8
Technical Lead Manager, Handshake AI Technical Lead Manager for Handshake AI, focusing on building and shipping production AI solutions. This player-coach role requires hands-on coding, system architecture, and team leadership, working with frontier AI labs on data, evals, and AI systems.	Ship	Engineering	San Francisco, CA	7w ago	8
Staff Software Engineer, RLE Staff Software Engineer to lead the architecture and evolution of Handshake's Reinforcement Learning Environments (RLE) platform, focusing on scalable systems, data pipelines, and enabling rapid domain creation for frontier AI models. This role involves technical leadership, system design, and cross-team collaboration to ensure reliability, observability, and performance.	DataEval Gate	Engineering	Remote	Apr 23	8
Senior Engineering Manager, Reinforcement Learning Environments (RLE) Senior Engineering Manager to lead the Reinforcement Learning Environments (RLE) team, responsible for building interactive sandboxes that simulate end-to-end workflows for frontier models. The team generates high-signal interaction data used for training and evaluating models on task completion, quality, and robustness. The role involves leading a team of engineers, owning the RLE roadmap, driving architecture for scalable systems, and ensuring reliability and data quality.	Post-trainEval Gate	Engineering	San Francisco, CA	Feb 18	8
Data Development, Principal This Principal Data Development role focuses on sourcing, negotiating, and closing data partnerships with companies and institutions to supply proprietary real-world data to frontier AI labs for training next-generation AI models. The role involves translating data requirements between AI labs and enterprise leaders, structuring commercial and compensation models, and managing senior stakeholder relationships.	Data	Product	San Francisco, CA	1w ago	7
Senior Manager, Forward Deployed Engineering Senior Manager, Forward Deployed Engineering at Handshake AI. This role involves technical leadership and people management for a team of 10+ engineers focused on customer-facing AI solutions. The manager will contribute to technical strategy, architecture, and code, while also developing the team and defining the FDE operating model. The role requires strong technical credibility, customer engagement skills, and experience in building reusable platforms.	Agent	Engineering	San Francisco, CA	3w ago	7
Strategic Projects Lead This role is responsible for owning the execution of large-scale human data programs that directly power frontier AI model training and evaluation. The role involves managing hundreds to thousands of expert Fellows, designing staffing models, and partnering with AI labs and senior stakeholders to deliver programs with significant ARR-equivalent impact. The ideal candidate has a technical background, strong analytical and problem-solving skills, and experience in technical or analytical roles.	Data	Engineering	India	4w ago	7
Strategic Projects Lead, Coding This role leads coding data initiatives for AI and platform teams, managing SWE Fellows, designing evaluation workflows, and ensuring delivery, margins, and quality. It involves writing coding assessments, building review processes, instrumenting quality signals, and adapting workflows. The role requires strong technical and analytical skills, with experience in coding, data quality, and stakeholder management, bridging ML/product/engineering and operations.	Data	Engineering	India	4w ago	7
Senior Forward Deployed Engineer Senior Forward Deployed Engineer at Handshake AI, focusing on technical leadership for AI lab deployments. This role involves understanding customer needs, architecting and building solutions, and scaling them for production, operating across the stack in ambiguous environments. Experience with LLMs and customer-facing roles is highly valued.	Agent	Engineering	San Francisco, CA	6w ago	7
Strategic Projects Lead, Coding This role involves leading coding data initiatives for AI and platform teams, coordinating SWE Fellows, designing and owning technical evaluation and annotation workflows, and ensuring delivery, margins, quality, and customer relationships. Responsibilities include writing and validating coding assessments, building rubric-driven code review processes, instrumenting quality signals, and adapting workflows. The role requires strong technical and analytical skills, coding proficiency, and stakeholder management.	Data	Engineering	San Francisco, CA	7w ago	7
AI Tutor, Electrochemistry & Functional Materials Specialist (contract), Handshake AI This role involves designing and evaluating chemistry prompts for AI models, focusing on scientific reasoning and identifying model breakdowns. The specialist will act as a subject matter expert in electrochemistry and functional materials, applying adversarial prompting and assessing the accuracy of AI-generated responses.	Eval Gate	Research	Remote	8w ago	7
AI Tutor, Organic & Polymer Chemistry Specialist (NMR/Spectroscopy) (contract), Handshake AI This role focuses on evaluating AI models in chemistry, specifically using expertise in organic chemistry, polymer chemistry, and spectroscopy (NMR) to design prompts, assess model outputs, and identify reasoning errors. The specialist will contribute to quality standards and provide feedback to the AI team.	Eval Gate	Research	Remote	8w ago	7
AI Tutor, Biophysical & Computational Chemistry Specialist (contract), Handshake AI Role focuses on evaluating AI models in chemistry, designing prompts, assessing outputs, and identifying reasoning errors. Requires PhD in Chemistry and experience in AI data annotation or RLHF.	Eval Gate	Research	Remote	8w ago	7
AI Tutor, Biology Specialist (contract), Handshake AI The role focuses on evaluating and stress-testing complex scientific prompts for large language models, specifically in biology. The specialist will design high-difficulty prompts, identify reasoning errors and weaknesses in model outputs, and apply adversarial prompting techniques. This is a research-oriented role focused on improving AI model capabilities through expert evaluation.	Eval Gate	Research	Remote	8w ago	7
AI Tutor, Physics Specialist (contract), Handshake AI This role focuses on evaluating AI models, specifically in physics, by crafting and assessing challenging problems, probing model reasoning, and identifying failures using adversarial prompting. It involves providing expert critique of AI responses and ensuring quality benchmarks are met. Prior experience in AI data annotation or RLHF is required, with a PhD in Physics or a related field.	Eval GatePost-train	Research	Remote	8w ago	7
Technical Lead Manager, Forward Deployed Engineering Technical Lead Manager, Forward Deployed Engineering at Handshake AI. This player-coach role involves shipping end-to-end AI solutions for strategic partners, designing and building integrations, tooling, APIs, and workflows, and managing a small team of FDEs. The focus is on building production-ready systems and scaling team output through reusable components, while staying hands-on with coding.	Agent	Engineering	San Francisco, CA	8w ago	7
Machine Learning Engineer, PhD Intern Machine Learning Engineer Intern at Handshake AI, focusing on building intelligent product experiences for job seekers. The role involves developing, evaluating, and deploying ML models for search, recommendations, and matching systems in a production environment. Requires a PhD candidate with Python, PyTorch/TensorFlow, and ML operations experience.	AgentEval Gate	Engineering	San Francisco, CA	8w ago	7
Software Engineer II, RLE Software Engineer to build and scale Reinforcement Learning Environments (RLE) platform, which are interactive systems for frontier AI models to learn real-world tasks. The role involves owning components end-to-end, designing backend systems and data pipelines, and improving system reliability and performance, supporting model training and evaluation.	DataEval Gate	Engineering	San Francisco, CA	8w ago	7
Software Engineer I , Coding Pod Software Engineer on the Coding Pod will build data infrastructure and pipelines for frontier AI coding models, focusing on creating large-scale, high-quality benchmark datasets for evaluating model performance on coding tasks. This role involves owning end-to-end data pipelines, integrating with developer ecosystems, and working with evaluation systems and agentic coding tools.	DataEval Gate	Engineering	San Francisco, CA	Apr 27	7
Associate Software Engineer, RLE Associate Software Engineer to build Reinforcement Learning Environments (RLE) platform, including supporting infrastructure, backend systems, frontend interfaces, and data pipelines for model training and evaluation. The role involves creating modular workflow domains and working with senior engineers to improve system reliability and performance.	DataPost-train	Engineering	San Francisco, CA	Apr 23	7
Software Engineer I, RLE Software Engineer to build and scale the Reinforcement Learning Environments (RLE) platform, which involves designing and implementing backend systems, data pipelines, and modular workflow domains to support frontier AI model training and evaluation. The role requires experience in backend/distributed systems, ML-adjacent infrastructure, and cloud technologies.	DataEval Gate	Engineering	San Francisco, CA	Apr 23	7
Senior Software Engineer, RLE Senior Software Engineer to build and scale Reinforcement Learning Environments (RLE) platform, simulating real-world workflows for AI model training and evaluation. This role involves driving architecture for scalable systems and data generation pipelines, partnering with research and product teams, and ensuring system reliability and observability.	DataEval Gate	Engineering	Remote	Apr 22	7
Senior Software Engineer, FDE Senior Forward Deployed Engineer to serve as a technical leader at the intersection of engineering and strategic customers (leading AI labs). Owns end-to-end lifecycle of high-impact deployments, architecting, building, and scaling solutions to improve customer workflows and model performance. Operates across the stack in ambiguous, fast-changing environments.	Agent	Engineering	San Francisco, CA	Apr 20	7
Machine Learning PhDs - AI Trainer Machine Learning PhDs needed for hourly contract work to evaluate AI-generated content and provide feedback on machine learning reasoning, proof construction, and technical problem-solving. This role focuses on assessing AI responses for accuracy, rigor, and relevance to real-world physics research.	Eval Gate	Research	Remote	Apr 9	7
Machine Learning Engineer I This Machine Learning Engineer role focuses on developing and deploying ML models that directly impact user experience and business metrics for a consumer platform. The role involves end-to-end ownership of the ML lifecycle, working with cutting-edge infrastructure like embedding-based retrieval and multi-stage rankers, and contributing to responsible AI practices.	Ship	Engineering	San Francisco, CA	Apr 6	7
Senior Engineering Manager, Forward Deployed Engineering Senior Engineering Manager to lead and scale a team of Forward Deployed Engineers (FDEs) focused on customer-facing AI solutions for strategic partners. The role involves building the organizational structure, managing technical execution, and ensuring reliability and maintainability of AI products.	Ship	Engineering	San Francisco, CA	Apr 4	7
Associate Machine Learning Engineer Associate Machine Learning Engineer for the Growth Relevance team, focusing on developing, deploying, and enhancing ML systems for lifecycle optimization, personalized notifications, and monetization strategies. The role involves working with embedding-based retrieval, GNNs, and multi-stage rankers, and contributing to responsible AI practices.	AgentServe	Engineering	San Francisco, CA	Apr 2	7
Staff Forward Deployed Engineer Staff Forward Deployed Engineer role at Handshake AI, focusing on defining and driving technical strategy for engineered solutions to strategic customers, including leading AI labs. The role involves architecting and delivering production-grade systems, setting technical direction, and influencing product and platform architecture. It requires deep customer engagement and scaling forward-deployed engineering as a function, with a strong emphasis on customer-facing AI products.	Ship	Engineering	San Francisco, CA	Feb 9	7
Software Engineer, Consumer Experience Software Engineer role focused on building core consumer experiences, including agentic AI features for students using OpenAI APIs and agentic frameworks. The company also has a separate AI data business focused on frontier AI labs.	Agent	Engineering	San Francisco, CA	Jan 28	7
Manager, Strategic Projects Manager, Strategic Projects leading a team focused on AI data and evaluation work. Responsibilities include managing SPLs, driving project delivery (data pipelines, labeling workflows), translating needs into plans, owning performance metrics, ensuring a good experience for fellows, and partnering with Product/Engineering on tooling. Success involves consistent delivery, improved operational metrics, and strong team leadership. Requires 5+ years in operations, 2+ years managing teams, and experience with complex projects, ideally in AI data operations or ML ops.	DataEval Gate	Engineering	San Francisco, CA	Dec '25	7

Frequently asked questions

What AI roles is Handshake hiring for?
Handshake currently has 65 active AI-related roles in our index. The most common open titles are: Music Producer - AI Trainer (2), Strategic Projects Lead, Coding (2), 3D Slicer Specialist - AI Trainer , AI Red Teamer, LLM Generalist, Analog Engineer - AI Trainer. Most positions are in Engineering and Research.
What stage of AI development does Handshake focus on?
Handshake's active AI hiring is concentrated in: data (66%), agents (15%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
Where is Handshake hiring AI talent?
Handshake is hiring AI talent in: United States (22 roles), India (4 roles).
What technologies does Handshake's AI team work with?
Job postings at Handshake most frequently reference: evals, synthetic data, model serving, agent orchestration, llm observability.
How many AI roles has Handshake posted recently?
In the past 30 days, Handshake has posted 10 new AI-related roles. That is a -57% change versus the prior 30 days (23 → 10).

Title

Stage

Function

Location

First seen

AI score

AI Red Teamer, CBRNE

This role focuses on evaluating AI models for safety and security, specifically concerning CBRNE threats. The Red Teamer will design adversarial prompts, assess model outputs for dangerous knowledge gaps, and document findings to help labs improve model defenses before they reach the real world. This requires deep domain expertise in CBRNE fields, strong ethical judgment, and the ability to think like a threat actor within a structured evaluation framework.

Eval Gate

Research

Seattle, WA

4w ago

AI Red Teamer, LLM Generalist

The AI Red Teamer, LLM Generalist role focuses on stress-testing large language models by designing creative, adversarial prompts to expose vulnerabilities in AI safety, guardrails, and robustness. This involves probing models across various risk categories (content safety, CBRN, cybersecurity, etc.) and potentially across different modalities (text, image, voice, agentic). The role requires strong prompt crafting skills, ethical judgment, and collaboration with engineers and researchers to share findings and strengthen defenses. It is a generalist role that may involve working with sensitive content.

Eval GateAgent

Engineering

Seattle, WA

4w ago

Senior Forward Deployed Engineer, Handshake AI Enterprise

Senior Forward Deployed Engineer to embed within enterprise customer environments, define AI-driven solutions, build and deploy production-grade AI agents, and design/run evals to measure performance. The role requires full-stack ownership, deep understanding of customer business, and iteration until performance improves. Emphasis on real-world AI application shipping and systematic improvement of AI performance.

AgentEval Gate

Engineering

San Francisco, CA

8w ago

AI PhD Student Researcher - Fall 2026

Handshake AI is seeking a PhD Student Researcher to work on novel RLHF/GRPO pipelines, instruction-following refinements, reasoning-trace supervision, multilingual/long-horizon/domain-specific benchmarks, automatic vs. human preference studies, robustness diagnostics, active-learning loops, data value estimation, synthetic data generation, and low-resource fine-tuning strategies. The goal is to produce an archive-ready manuscript or top-tier conference submission.

Post-trainEval Gate

Research

San Francisco, CA

Apr 8

Engineering Manager, RLE

This role is for a Senior Software Engineer to build and scale a Reinforcement Learning Environments (RLE) platform. This platform simulates real-world workflows for AI models to learn, generating data for training and evaluation. The engineer will drive architecture, build plug-and-play domains, and ensure system reliability and quality, working closely with research, product, and operations teams. Strong applied AI experience is required.

DataEval Gate

Engineering

India

3w ago

Manager Strategic Projects India

Manager for a team of Strategic Project Leads (SPLs) focused on AI data and evaluation projects. The role involves leading delivery, quality, and scalability, managing a team, translating needs into project plans, owning performance metrics, and partnering with Product and Engineering. The role operates in a high-pressure, fast-changing environment with a focus on operational excellence and continuous improvement in AI data pipelines and labeling workflows.

DataEval Gate

Engineering

India

4w ago

AI Red Teamer, Cybersecurity

This role focuses on evaluating AI models, specifically LLMs, for cybersecurity vulnerabilities. The AI Red Teamer will craft adversarial prompts and multi-turn interactions to test if models can be manipulated into generating functional malware, exploit code, or attack tooling. The core responsibility is to assess the output's real-world exploitability and contribute to improving model safety guardrails. This involves deep cybersecurity expertise and understanding attacker methodologies.

Eval GateAgent

Engineering

Seattle, WA

4w ago

Senior Product Manager, RL Environments — Handshake AI

Senior Product Manager to own the product surface that turns RL environment creation from a bespoke, weeks-long lift into a repeatable factory. This role will design and ship the platform that compresses lead time, replaces hand-built workflows with self-serve tooling, and lets a small team of operators turn out high-quality environments for any vertical.

DataEval Gate

Product

San Francisco, CA

5w ago

Applied AI Engineer, Handshake AI Enterprise

Applied AI Engineer role focused on building and deploying production-grade AI agents within enterprise customer environments. The role involves understanding customer business needs, developing agents, running evaluations, and iterating on performance to drive measurable business impact. Requires backend engineering depth and experience with AI/ML systems in production.

Agent

Engineering

San Francisco, CA

5w ago

Senior Applied AI Engineer, Handshake AI Enterprise

Senior Applied AI Engineer role focused on embedding within enterprise customer environments to build and deploy production-grade AI agents. The role involves defining AI-driven solutions, owning end-to-end delivery, designing and running evaluations, and iterating on performance. It requires strong applied AI and backend experience, with a focus on real-world application and systems thinking.

Agent

Engineering

San Francisco, CA

5w ago

Forward Deployed Engineer, Handshake AI Enterprise

Forward Deployed Engineer to embed within enterprise customer environments, define AI-driven solutions, build and deploy production-grade AI agents, and design evals to measure and improve performance. The role requires full-stack capabilities with strong backend depth and real-world experience shipping AI applications.

AgentEval Gate

Engineering

San Francisco, CA

6w ago

Software Engineer, Agentic Infrastructure

Software Engineer focused on building the core agent orchestration layer, including tool use, memory, and multi-step reasoning systems, to power AI-driven features for millions of users. The role also involves designing evaluation, observability, and reliability frameworks for agent behavior and establishing engineering standards for agentic development.

Agent

Engineering

San Francisco, CA

7w ago

Senior Software Engineer, Agentic Infrastructure

Senior Software Engineer to architect and build the foundational systems for AI agents, including tool use, memory, and multi-step reasoning. The role involves designing evaluation and observability frameworks, establishing engineering standards for agentic development, and partnering with ML/product teams to ship agent-powered features.

Agent

Engineering

San Francisco, CA

7w ago

Technical Lead Manager, Handshake AI

Technical Lead Manager for Handshake AI, focusing on building and shipping production AI solutions. This player-coach role requires hands-on coding, system architecture, and team leadership, working with frontier AI labs on data, evals, and AI systems.

Ship

Engineering

San Francisco, CA

7w ago

Staff Software Engineer, RLE

Staff Software Engineer to lead the architecture and evolution of Handshake's Reinforcement Learning Environments (RLE) platform, focusing on scalable systems, data pipelines, and enabling rapid domain creation for frontier AI models. This role involves technical leadership, system design, and cross-team collaboration to ensure reliability, observability, and performance.

DataEval Gate

Engineering

Remote

Apr 23

Senior Engineering Manager, Reinforcement Learning Environments (RLE)

Senior Engineering Manager to lead the Reinforcement Learning Environments (RLE) team, responsible for building interactive sandboxes that simulate end-to-end workflows for frontier models. The team generates high-signal interaction data used for training and evaluating models on task completion, quality, and robustness. The role involves leading a team of engineers, owning the RLE roadmap, driving architecture for scalable systems, and ensuring reliability and data quality.

Post-trainEval Gate

Engineering

San Francisco, CA

Feb 18

Data Development, Principal

This Principal Data Development role focuses on sourcing, negotiating, and closing data partnerships with companies and institutions to supply proprietary real-world data to frontier AI labs for training next-generation AI models. The role involves translating data requirements between AI labs and enterprise leaders, structuring commercial and compensation models, and managing senior stakeholder relationships.

Data

Product

San Francisco, CA

1w ago

Senior Manager, Forward Deployed Engineering

Senior Manager, Forward Deployed Engineering at Handshake AI. This role involves technical leadership and people management for a team of 10+ engineers focused on customer-facing AI solutions. The manager will contribute to technical strategy, architecture, and code, while also developing the team and defining the FDE operating model. The role requires strong technical credibility, customer engagement skills, and experience in building reusable platforms.

Agent

Engineering

San Francisco, CA

3w ago

Strategic Projects Lead

This role is responsible for owning the execution of large-scale human data programs that directly power frontier AI model training and evaluation. The role involves managing hundreds to thousands of expert Fellows, designing staffing models, and partnering with AI labs and senior stakeholders to deliver programs with significant ARR-equivalent impact. The ideal candidate has a technical background, strong analytical and problem-solving skills, and experience in technical or analytical roles.

Data

Engineering

India

4w ago

Strategic Projects Lead, Coding

This role leads coding data initiatives for AI and platform teams, managing SWE Fellows, designing evaluation workflows, and ensuring delivery, margins, and quality. It involves writing coding assessments, building review processes, instrumenting quality signals, and adapting workflows. The role requires strong technical and analytical skills, with experience in coding, data quality, and stakeholder management, bridging ML/product/engineering and operations.

Data

Engineering

India

4w ago

Senior Forward Deployed Engineer

Senior Forward Deployed Engineer at Handshake AI, focusing on technical leadership for AI lab deployments. This role involves understanding customer needs, architecting and building solutions, and scaling them for production, operating across the stack in ambiguous environments. Experience with LLMs and customer-facing roles is highly valued.

Agent

Engineering

San Francisco, CA

6w ago

Strategic Projects Lead, Coding

This role involves leading coding data initiatives for AI and platform teams, coordinating SWE Fellows, designing and owning technical evaluation and annotation workflows, and ensuring delivery, margins, quality, and customer relationships. Responsibilities include writing and validating coding assessments, building rubric-driven code review processes, instrumenting quality signals, and adapting workflows. The role requires strong technical and analytical skills, coding proficiency, and stakeholder management.

Data

Engineering

San Francisco, CA

7w ago

AI Tutor, Electrochemistry & Functional Materials Specialist (contract), Handshake AI

This role involves designing and evaluating chemistry prompts for AI models, focusing on scientific reasoning and identifying model breakdowns. The specialist will act as a subject matter expert in electrochemistry and functional materials, applying adversarial prompting and assessing the accuracy of AI-generated responses.

Eval Gate

Research

Remote

8w ago

AI Tutor, Organic & Polymer Chemistry Specialist (NMR/Spectroscopy) (contract), Handshake AI

This role focuses on evaluating AI models in chemistry, specifically using expertise in organic chemistry, polymer chemistry, and spectroscopy (NMR) to design prompts, assess model outputs, and identify reasoning errors. The specialist will contribute to quality standards and provide feedback to the AI team.

Eval Gate

Research

Remote

8w ago

AI Tutor, Biophysical & Computational Chemistry Specialist (contract), Handshake AI

Role focuses on evaluating AI models in chemistry, designing prompts, assessing outputs, and identifying reasoning errors. Requires PhD in Chemistry and experience in AI data annotation or RLHF.

Eval Gate

Research

Remote

8w ago

AI Tutor, Biology Specialist (contract), Handshake AI

The role focuses on evaluating and stress-testing complex scientific prompts for large language models, specifically in biology. The specialist will design high-difficulty prompts, identify reasoning errors and weaknesses in model outputs, and apply adversarial prompting techniques. This is a research-oriented role focused on improving AI model capabilities through expert evaluation.

Eval Gate

Research

Remote

8w ago

AI Tutor, Physics Specialist (contract), Handshake AI

This role focuses on evaluating AI models, specifically in physics, by crafting and assessing challenging problems, probing model reasoning, and identifying failures using adversarial prompting. It involves providing expert critique of AI responses and ensuring quality benchmarks are met. Prior experience in AI data annotation or RLHF is required, with a PhD in Physics or a related field.

Eval GatePost-train

Research

Remote

8w ago

Technical Lead Manager, Forward Deployed Engineering

Technical Lead Manager, Forward Deployed Engineering at Handshake AI. This player-coach role involves shipping end-to-end AI solutions for strategic partners, designing and building integrations, tooling, APIs, and workflows, and managing a small team of FDEs. The focus is on building production-ready systems and scaling team output through reusable components, while staying hands-on with coding.

Agent

Engineering

San Francisco, CA

8w ago

Machine Learning Engineer, PhD Intern

Machine Learning Engineer Intern at Handshake AI, focusing on building intelligent product experiences for job seekers. The role involves developing, evaluating, and deploying ML models for search, recommendations, and matching systems in a production environment. Requires a PhD candidate with Python, PyTorch/TensorFlow, and ML operations experience.

AgentEval Gate

Engineering

San Francisco, CA

8w ago

Software Engineer II, RLE

Software Engineer to build and scale Reinforcement Learning Environments (RLE) platform, which are interactive systems for frontier AI models to learn real-world tasks. The role involves owning components end-to-end, designing backend systems and data pipelines, and improving system reliability and performance, supporting model training and evaluation.

DataEval Gate

Engineering

San Francisco, CA

8w ago

Software Engineer I , Coding Pod

Software Engineer on the Coding Pod will build data infrastructure and pipelines for frontier AI coding models, focusing on creating large-scale, high-quality benchmark datasets for evaluating model performance on coding tasks. This role involves owning end-to-end data pipelines, integrating with developer ecosystems, and working with evaluation systems and agentic coding tools.

DataEval Gate

Engineering

San Francisco, CA

Apr 27

Associate Software Engineer, RLE

Associate Software Engineer to build Reinforcement Learning Environments (RLE) platform, including supporting infrastructure, backend systems, frontend interfaces, and data pipelines for model training and evaluation. The role involves creating modular workflow domains and working with senior engineers to improve system reliability and performance.

DataPost-train

Engineering

San Francisco, CA

Apr 23

Software Engineer I, RLE

Software Engineer to build and scale the Reinforcement Learning Environments (RLE) platform, which involves designing and implementing backend systems, data pipelines, and modular workflow domains to support frontier AI model training and evaluation. The role requires experience in backend/distributed systems, ML-adjacent infrastructure, and cloud technologies.

DataEval Gate

Engineering

San Francisco, CA

Apr 23

Senior Software Engineer, RLE

Senior Software Engineer to build and scale Reinforcement Learning Environments (RLE) platform, simulating real-world workflows for AI model training and evaluation. This role involves driving architecture for scalable systems and data generation pipelines, partnering with research and product teams, and ensuring system reliability and observability.

DataEval Gate

Engineering

Remote

Apr 22

Senior Software Engineer, FDE

Senior Forward Deployed Engineer to serve as a technical leader at the intersection of engineering and strategic customers (leading AI labs). Owns end-to-end lifecycle of high-impact deployments, architecting, building, and scaling solutions to improve customer workflows and model performance. Operates across the stack in ambiguous, fast-changing environments.

Agent

Engineering

San Francisco, CA

Apr 20

Machine Learning PhDs - AI Trainer

Machine Learning PhDs needed for hourly contract work to evaluate AI-generated content and provide feedback on machine learning reasoning, proof construction, and technical problem-solving. This role focuses on assessing AI responses for accuracy, rigor, and relevance to real-world physics research.

Eval Gate

Research

Remote

Apr 9

Machine Learning Engineer I

This Machine Learning Engineer role focuses on developing and deploying ML models that directly impact user experience and business metrics for a consumer platform. The role involves end-to-end ownership of the ML lifecycle, working with cutting-edge infrastructure like embedding-based retrieval and multi-stage rankers, and contributing to responsible AI practices.

Ship

Engineering

San Francisco, CA

Apr 6

Senior Engineering Manager, Forward Deployed Engineering

Senior Engineering Manager to lead and scale a team of Forward Deployed Engineers (FDEs) focused on customer-facing AI solutions for strategic partners. The role involves building the organizational structure, managing technical execution, and ensuring reliability and maintainability of AI products.

Ship

Engineering

San Francisco, CA

Apr 4

Associate Machine Learning Engineer

Associate Machine Learning Engineer for the Growth Relevance team, focusing on developing, deploying, and enhancing ML systems for lifecycle optimization, personalized notifications, and monetization strategies. The role involves working with embedding-based retrieval, GNNs, and multi-stage rankers, and contributing to responsible AI practices.

AgentServe

Engineering

San Francisco, CA

Apr 2

Staff Forward Deployed Engineer

Staff Forward Deployed Engineer role at Handshake AI, focusing on defining and driving technical strategy for engineered solutions to strategic customers, including leading AI labs. The role involves architecting and delivering production-grade systems, setting technical direction, and influencing product and platform architecture. It requires deep customer engagement and scaling forward-deployed engineering as a function, with a strong emphasis on customer-facing AI products.

Ship

Engineering

San Francisco, CA

Feb 9

Software Engineer, Consumer Experience

Software Engineer role focused on building core consumer experiences, including agentic AI features for students using OpenAI APIs and agentic frameworks. The company also has a separate AI data business focused on frontier AI labs.

Agent

Engineering

San Francisco, CA

Jan 28

Manager, Strategic Projects

Manager, Strategic Projects leading a team focused on AI data and evaluation work. Responsibilities include managing SPLs, driving project delivery (data pipelines, labeling workflows), translating needs into plans, owning performance metrics, ensuring a good experience for fellows, and partnering with Product/Engineering on tooling. Success involves consistent delivery, improved operational metrics, and strong team leadership. Requires 5+ years in operations, 2+ years managing teams, and experience with complex projects, ideally in AI data operations or ML ops.

DataEval Gate

Engineering

San Francisco, CA

Dec '25