Staff Research Engineer

Decagon Decagon · Vertical AI · New York, NY · Engineering

Staff Research Engineer at Decagon, a conversational AI platform company. The role focuses on building and deploying industry-leading conversational AI models and agents for enterprise customer experiences. Responsibilities include leading research and engineering for core conversational capabilities, building end-to-end models and pipelines, and integrating them into production systems. Requires experience in post-training and deploying LLMs, Python, ML tooling, and a track record of delivering production impact from research ideas.

What you'd actually do

  1. Lead research and engineering efforts to improve core conversational capabilities in production, including instruction following, retrieval, memory, and long-horizon task completion
  2. Build and iterate on end-to-end models and pipelines that optimize for quality, efficiency, and user experience
  3. Partner with platform and product engineers to integrate new models into production systems
  4. Break down ambiguous research ideas into clear, iterative milestones and roadmaps.
  5. Mentor other researchers/engineers, set technical direction, and establish best practices for applied research and engineering

Skills

Required

  • Python
  • modern ML tooling (training, evaluation, data pipelines)
  • post-training LLMs
  • deploying LLMs in production
  • research ideas to production impact
  • roadmap definition
  • ambiguity breakdown
  • cross-functional execution

Nice to have

  • instruction following
  • retrieval
  • memory
  • long-horizon task completion

What the JD emphasized

  • 8+ years of experience in AI/ML engineering or research
  • Prior experience post-training and deploying LLMs in production environments
  • Track record of taking research ideas from prototype → reliable, measurable production impact
  • Ability to define a roadmap, break ambiguity into milestones, and lead cross-functional execution

Other signals

  • building core models and algorithms
  • shipping real improvements
  • multi-quarter initiatives
  • pushing the agent's reliability, capability, and efficiency