Runtime safety filters that detect or block harmful, off-policy, or unsafe LLM outputs before they reach end users; the production complement to alignment research.
Primary AI lifecycle stage: evaluation and serving infrastructure.
As of today, 1,219 active AI roles across 172 companies in our index reference Guardrails. Hiring concentrates at the agents (74%) and serving infrastructure (8%) stages. Most common sectors: Enterprise, Big Tech, Banking.
Runtime safety filters that detect or block harmful, off-policy, or unsafe LLM outputs before they reach end users; the production complement to alignment research. Primary AI lifecycle stage: evaluation and serving infrastructure.
1,219 active AI roles across 172 companies in our index reference Guardrails as of today.
The companies with the most active Guardrails listings are: Capital One (96 roles), Amazon (84 roles), JPMorgan Chase (74 roles), Google (55 roles), OpenAI (49 roles).
Guardrails primarily belongs to the evaluation and serving infrastructure stages of the AI lifecycle. In current hiring, Guardrails roles concentrate at: agents (74%), serving infrastructure (8%).
The sectors with the most active Guardrails hiring are: Enterprise, Big Tech, Banking.
160 AI roles tagged guardrails.
| Company | Title | Sector | AI score | Other tags |
|---|---|---|---|---|
| OpenAI | Researcher, Misalignment Research | AI Frontier | 10 | Evals · Agent research · Frontier research |
| OpenAI | Researcher, Loss of Control | AI Frontier | 10 | Agent orchestration · Tool use · Evals · LLM observability · Agent research |
| Anthropic | Research Engineer, Frontier Red Team (Autonomy) | AI Frontier | 10 | Agent orchestration · Tool use · Evals · Embodied AI · RL robotics · Agent research |
| Anthropic | Anthropic AI Safety Fellow, UK | AI Frontier | 10 | Frontier research · Interpretability · Evals · RLHF |
| Anthropic | Anthropic AI Safety Fellow, US | AI Frontier | 10 | Frontier research · Interpretability · Evals · RL post-training |
| Anthropic | Software Engineer, Safeguards Evals | AI Frontier | 9 | Agent orchestration · Evals · LLM observability · Synthetic data · Agent research · RL post-training |
| OpenAI | Software Engineer, Cyber Frontier | AI Frontier | 9 | Evals · Model serving · Frontier research |
| OpenAI | Security Researcher, Agentic AI Threats | AI Frontier | 9 | Agent orchestration · Evals |
| Writer | AI engineer | AI Frontier | 9 | Agent orchestration · Model serving · Inference infra · LLM observability |
| Writer | AI engineer (UK) | AI Frontier | 9 | Agent orchestration · Model serving · Inference infra · LLM observability |
| Writer | Security engineer, detection and response (US) | AI Frontier | 9 | Inference infra · Model serving |
| Writer | Security engineer, detection and response (UK) | AI Frontier | 9 | Inference infra · Model serving · Evals |
| Mistral AI | Model Behavior Architect | AI Frontier | 9 | Evals · LLM observability · Agent orchestration · Tool use · Fine-tuning · RL post-training |
| OpenAI | Researcher, Alignment Science | AI Frontier | 9 | RL post-training · Evals · LLM observability · Interpretability |
| Anthropic | Research Engineer, Safeguards Labs | AI Frontier | 9 | Evals · Agent orchestration · Agent research · Fine-tuning · RL post-training |
| Writer | Security engineer, detection and response (US) | AI Frontier | 9 | Agent orchestration · LLM observability · Inference infra · Model serving |
| Writer | Security engineer, detection and response (UK) | AI Frontier | 9 | Model serving · Inference infra · LLM observability |
| Anthropic | Anthropic Fellows Program — AI Safety | AI Frontier | 9 | Interpretability · Evals · RL post-training |
| OpenAI | Researcher, Safety & Privacy | AI Frontier | 9 | Interpretability · Evals · LLM observability |
| Lila Sciences | Staff / Principal Research Engineer, AI Safety, Technical Mitigations | AI Frontier | 9 | Evals · RL post-training · Model serving |
| Perplexity | Member of Technical Staff (Secure Intelligence Institute) | AI Frontier | 9 | Agent orchestration · Evals · Agent research |
| OpenAI | Machine Learning Engineer, Integrity | AI Frontier | 9 | Fine-tuning · Model serving · LLM observability · Evals |
| Anthropic | Security Labs Engineer | AI Frontier | 9 | Inference infra · Model serving |
| Cohere | Product Manager, Safety Research | AI Frontier | 9 | Agent orchestration · Evals · LLM observability · Agent research |
| Anthropic | Research Lead, Training Insights | AI Frontier | 9 | Evals · LLM observability · Agent research · RL post-training · Frontier research |
| OpenAI | Threat Modeler, Preparedness | AI Frontier | 9 | Evals · Interpretability · Agent research |
| OpenAI | Researcher, Automated Red Teaming | AI Frontier | 9 | Evals · Agent orchestration · Tool use · LLM observability |
| OpenAI | Researcher, Frontier Cybersecurity Risks | AI Frontier | 9 | Agent orchestration · LLM observability · Evals · Model serving |
| Anthropic | Prompt Engineer, Agent Prompts & Evals | AI Frontier | 9 | Agent orchestration · Evals · LLM observability · Fine-tuning · Model serving |
| Anthropic | Research Scientist, Frontier Red Team (Emerging Risks) | AI Frontier | 9 | Evals · Agent research · LLM observability · Embodied AI |