Week 2025-W02
4 new AI roles opened across 2 companies. Highest-signal roles first.
Anthropic· 3 roles
- Research Engineer, Frontier Red Team (RSP Evaluations) Eval Gate · Research 9Research Engineer focused on developing and running "gold standard" evaluations for catastrophic risks to ensure safe release of frontier AI models, aligning with the Responsible Scaling Policy (RSP). The role involves creating evaluation systems, collaborating with domain experts, building sandboxed testing environments, and informing critical deployment decisions.
- Research Scientist, Frontier Red Team (Autonomy) Eval Gate · Research 9Research Scientist role focused on developing and productionizing advanced autonomy evaluations for AI Safety Level (ASL) determination of models. This involves risk and capability modeling, designing, implementing, and running large-scale experiments to evaluate autonomous capabilities and forecast future capabilities, with potential for people management.
- Staff Software Engineer, AI Reliability Engineering Serve · Engineering 8Staff Software Engineer focused on AI Reliability Engineering at Anthropic, responsible for defining and achieving reliability metrics for LLM serving and training systems. This includes designing monitoring, implementing high-availability infrastructure, leading incident response, and optimizing costs for large-scale AI infrastructure.
Figure AI· 1 role
- Humanoid Robot Operations Associate Ship · Engineering 7Figure AI is seeking a Humanoid Robot Operations Associate to deploy and operate their humanoid robots in automotive manufacturing settings. This role involves monitoring robot performance, identifying and documenting issues, and relaying feedback to engineering teams. The associate will also collect data for AI training and assist in refining robot behaviors, potentially using teleoperation for training purposes. The position requires a HS Diploma, fluency in English, physical ability to work in a manufacturing environment, and a willingness to embrace feedback and work with minimal supervision.