Forward Deployed Engineer - ML

Modal Modal · Data AI · New York, NY · Engineering

The Forward Deployed ML Engineer will partner with leading AI companies to help them achieve state-of-the-art performance on demanding AI workloads like LLM serving and model training. This role involves hands-on work with customers to architect and optimize their AI workloads on the Modal platform, contribute to open-source projects, and collaborate with product and sales teams. Requires 2+ years of ML engineering experience, familiarity with serving and training toolchains, and strong communication skills.

What you'd actually do

  1. Work hands-on with companies like Suno, Lovable, Cognition, and Meta to architect and optimize production AI workloads on Modal
  2. Contribute to open-source projects — members of the team are active contributors to SGLang — and publish technical content that demonstrates Modal's capabilities across the AI stack
  3. Collaborate with Modal's product and sales teams, contributing to the platform as both an engineer and a product stakeholder
  4. Build trusted relationships with technical leaders (CTOs, VPs of Engineering, ML leads) at companies doing frontier AI work
  5. Conduct technical demos, experiments, and proof-of-concepts that make Modal's performance advantages tangible

Skills

Required

  • 2+ years of professional ML engineering experience
  • hands-on work in inference optimization, model training, GPU programming, or ML infrastructure
  • Familiarity with the serving (e.g., vLLM, SGLang) and training (e.g., slime, verl, TRL) toolchains
  • Strong communicator
  • Genuine interest in working directly with customers

Nice to have

  • side projects, open-source contributions, or published work you're proud of in ML or systems performance

What the JD emphasized

  • production AI workloads
  • LLM serving
  • model training
  • inference optimization
  • model training
  • GPU programming
  • ML infrastructure
  • serving
  • training

Other signals

  • customer-facing
  • ML infrastructure
  • inference optimization
  • model training