Handshake currently has 68 active AI-related job listings. The majority of these roles, 60%, are focused on data, with a further 15% in agents. Research and Engineering are the most frequent functions hiring for these positions. Recent hiring activity shows a significant increase, with 23 new AI roles posted in the last 30 days, representing a 92% rise compared to the preceding 30-day period.
Currently tracking 23 active AI roles, down 48% versus the prior 4 weeks. Primary focus: Agent · Engineering.
Handshake currently has 65 active AI-related roles in our index. The most common open titles are: Music Producer - AI Trainer (2), Strategic Projects Lead, Coding (2), 3D Slicer Specialist - AI Trainer , AI Red Teamer, LLM Generalist, Analog Engineer - AI Trainer. Most positions are in Engineering and Research.
Handshake's active AI hiring is concentrated in: data (66%), agents (15%), application (8%). These categories follow a seven-stage AI lifecycle: data, pre-training, post-training, serving infrastructure, agents, evaluation, and application.
Handshake is hiring AI talent in: United States (22 roles), India (4 roles).
Job postings at Handshake most frequently reference: evals, synthetic data, model serving, agent orchestration, llm observability.
In the past 30 days, Handshake has posted 10 new AI-related roles. That is a -57% change versus the prior 30 days (23 → 10).
| Title | Stage | AI score |
|---|---|---|
| AI Red Teamer, CBRNE This role focuses on evaluating AI models for safety and security, specifically concerning CBRNE threats. The Red Teamer will design adversarial prompts, assess model outputs for dangerous knowledge gaps, and document findings to help labs improve model defenses before they reach the real world. This requires deep domain expertise in CBRNE fields, strong ethical judgment, and the ability to think like a threat actor within a structured evaluation framework. | Eval Gate | 9 |
| AI Red Teamer, LLM Generalist The AI Red Teamer, LLM Generalist role focuses on stress-testing large language models by designing creative, adversarial prompts to expose vulnerabilities in AI safety, guardrails, and robustness. This involves probing models across various risk categories (content safety, CBRN, cybersecurity, etc.) and potentially across different modalities (text, image, voice, agentic). The role requires strong prompt crafting skills, ethical judgment, and collaboration with engineers and researchers to share findings and strengthen defenses. It is a generalist role that may involve working with sensitive content. |
| Eval GateAgent |
| 9 |
| AI Red Teamer, Cybersecurity This role focuses on evaluating AI models, specifically LLMs, for cybersecurity vulnerabilities. The AI Red Teamer will craft adversarial prompts and multi-turn interactions to test if models can be manipulated into generating functional malware, exploit code, or attack tooling. The core responsibility is to assess the output's real-world exploitability and contribute to improving model safety guardrails. This involves deep cybersecurity expertise and understanding attacker methodologies. | Eval GateAgent | 8 |
| AI Tutor, Electrochemistry & Functional Materials Specialist (contract), Handshake AI This role involves designing and evaluating chemistry prompts for AI models, focusing on scientific reasoning and identifying model breakdowns. The specialist will act as a subject matter expert in electrochemistry and functional materials, applying adversarial prompting and assessing the accuracy of AI-generated responses. | Eval Gate | 7 |
| AI Tutor, Organic & Polymer Chemistry Specialist (NMR/Spectroscopy) (contract), Handshake AI This role focuses on evaluating AI models in chemistry, specifically using expertise in organic chemistry, polymer chemistry, and spectroscopy (NMR) to design prompts, assess model outputs, and identify reasoning errors. The specialist will contribute to quality standards and provide feedback to the AI team. | Eval Gate | 7 |
| AI Tutor, Biophysical & Computational Chemistry Specialist (contract), Handshake AI Role focuses on evaluating AI models in chemistry, designing prompts, assessing outputs, and identifying reasoning errors. Requires PhD in Chemistry and experience in AI data annotation or RLHF. | Eval Gate | 7 |
| AI Tutor, Biology Specialist (contract), Handshake AI The role focuses on evaluating and stress-testing complex scientific prompts for large language models, specifically in biology. The specialist will design high-difficulty prompts, identify reasoning errors and weaknesses in model outputs, and apply adversarial prompting techniques. This is a research-oriented role focused on improving AI model capabilities through expert evaluation. | Eval Gate | 7 |
| AI Tutor, Physics Specialist (contract), Handshake AI This role focuses on evaluating AI models, specifically in physics, by crafting and assessing challenging problems, probing model reasoning, and identifying failures using adversarial prompting. It involves providing expert critique of AI responses and ensuring quality benchmarks are met. Prior experience in AI data annotation or RLHF is required, with a PhD in Physics or a related field. | Eval GatePost-train | 7 |
| Machine Learning PhDs - AI Trainer Machine Learning PhDs needed for hourly contract work to evaluate AI-generated content and provide feedback on machine learning reasoning, proof construction, and technical problem-solving. This role focuses on assessing AI responses for accuracy, rigor, and relevance to real-world physics research. | Eval Gate | 7 |