Staff Machine Learning Research Engineer, Agent Post-training - Enterprise Genai

Scale AI Scale AI · Data AI · San Francisco, CA · Enterprise Engineering

Scale AI is seeking a Staff Machine Learning Research Engineer focused on post-training algorithms for complex agents in enterprise GenAI applications. The role involves building a next-generation Agent RL training platform, integrating cutting-edge research, and training state-of-the-art models for enterprise customers, including cybersecurity and healthtech use cases. Experience with LLM training, post-training methods like RLHF/RLVR, and publications in top conferences are required.

What you'd actually do

  1. Train state of the art models, developed both internally and from the community, to deploy to our enterprise customers.
  2. Research cutting edge algorithms to integrate directly into our training stack.
  3. Design solutions that enable complex multi-agent systems to directly learn from both process + outcome based rewards.

Skills

Required

  • 5+ years of LLM training in a production environment
  • Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc.
  • Publications in top conferences such as NEURIPS, ICLR, or ICML within the last two years
  • PhD or Masters in Computer Science or a related field

What the JD emphasized

  • post-training algorithms
  • complex agents
  • enterprise use-cases
  • next-generation AI cybersecurity firewall LLMs
  • training foundation healthtech search models
  • LLM training in a production environment
  • post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc.
  • Publications in top conferences such as NEURIPS, ICLR, or ICML within the last two years

Other signals

  • post-training algorithms
  • complex agents
  • enterprise use-cases
  • next-gen AI cybersecurity firewall LLMs
  • foundation healthtech search models