Helix AI Engineer, Agentic Systems

Figure AI Figure AI · Robotics · HQ · AI - Helix Team

Figure AI is seeking an AI Engineer for Agentic Systems to build multimodal reasoning systems for autonomous humanoid robots. The role involves designing, training, and deploying agents that operate from raw sensory input, maintain memory, plan long-term, and execute tasks reliably in real environments. Responsibilities include developing agent architectures, infrastructure, perception-reasoning-action loops, evaluation harnesses, and applying reinforcement learning techniques. The position requires strong Python, deep learning framework proficiency, and software engineering skills.

What you'd actually do

  1. Design, train, and deploy multimodal agents that operate autonomously for hours to days
  2. Build agents that reason from raw sensory inputs (pixels, environment state, proprioception) to structured actions
  3. Implement episodic memory systems for persistent state, retrieval, and long-horizon reasoning
  4. Develop planning, reasoning, and tool-use mechanisms for multi-step task execution
  5. Build reliable perception → reasoning → action loops with strong stability and failure recovery

Skills

Required

  • Experience building autonomous agents that run continuously and complete multi-step tasks
  • Experience developing agents that reason from pixel inputs or raw environment observations
  • Experience implementing agent memory, planning, reasoning, or tool-use systems
  • Experience training or fine-tuning multimodal or foundation models
  • Strong proficiency in Python and modern deep learning frameworks (e.g., PyTorch)
  • Strong experimental rigor and ability to design, analyze, and iterate on ML systems
  • Strong software engineering skills and ability to build reliable, maintainable systems
  • Ability to work independently and own complex technical problems end-to-end

Nice to have

  • Experience with embodied AI, robotics learning, or robot policy training
  • Experience building multimodal foundation models (vision-language or vision-language-action)
  • Background in agentic AI systems or long-horizon planning architectures
  • Experience working with large-scale distributed training systems
  • Publication record in machine learning, robotics, or embodied AI

What the JD emphasized

  • autonomous general-purpose humanoid robots
  • humanoid robots with human-level intelligence
  • multimodal reasoning systems
  • agents that operate autonomously
  • episodic memory systems
  • long-horizon reasoning
  • tool-use mechanisms
  • multi-step task execution
  • perception → reasoning → action loops
  • evaluation harnesses, benchmarks, and metrics
  • data studies across the training lifecycle
  • pretraining, mid-training, and post-training
  • reinforcement learning, reward modeling, and post-training techniques
  • robot reasoning, planning, and reliability
  • scalable model training, distributed experimentation, and agent evaluation

Other signals

  • building multimodal reasoning systems
  • agents that operate autonomously
  • humanoid robots with human-level intelligence