Senior Staff Machine Learning Scientist, Assets

Webflow Webflow · Enterprise · CA · Remote · Engineering

Senior Staff Machine Learning Scientist role focused on advancing state-of-the-art in computer vision, multimodal understanding, and visual generation. The role involves developing novel models, algorithms, and training methodologies, translating research into practical improvements, and optimizing large-scale foundation models. It requires setting technical direction, mentoring, and staying at the forefront of research in vision and multimodal AI for a consumer-facing platform.

What you'd actually do

  1. Lead and drive ambitious research initiatives that advance the state of the art in computer vision, multimodal understanding, and visual generation.
  2. Develop novel models, algorithms, and training methodologies for challenging vision problems such as image understanding, video understanding, visual search, scene representation, segmentation, detection, generation, and multimodal reasoning.
  3. Translate cutting-edge research into practical model improvements that can shape product direction and unlock new user experiences.
  4. Design, implement, train, and optimize large-scale vision and multimodal foundation models across diverse datasets and tasks.
  5. Partner closely with applied scientists, ML engineers, and product teams to move research from exploration to production-ready systems.

Skills

Required

  • Deep theoretical and practical expertise in computer vision
  • Strong foundations in areas such as representation learning, visual recognition, image or video generation, multimodal learning, and large-scale model training
  • Advanced degree in Computer Science, Electrical Engineering, Statistics, Applied Mathematics, or a related field
  • 8+ years of relevant industry and/or research experience, or equivalent impact
  • Strong experience with state-of-the-art vision and multimodal model architectures, including transformers, diffusion models, contrastive learning approaches, and foundation models
  • Proven ability to formulate research problems clearly, design rigorous experiments, and translate findings into meaningful model or product advances
  • Strong coding and prototyping skills in Python
  • Proficiency in modern deep learning frameworks such as PyTorch and TensorFlow
  • Demonstrated technical leadership, including mentoring scientists or engineers and influencing research direction across a team or organization
  • Strong communication skills

Nice to have

  • PhD strongly preferred
  • Stay curious and open to growth — demonstrating a proactive embrace of AI, and actively building and applying fluency in emerging technologies to elevate how we work, drive faster outcomes, and expand collective impact.

What the JD emphasized

  • track record of driving major research initiatives in computer vision or multimodal machine learning
  • PhD strongly preferred
  • 8+ years of relevant industry and/or research experience

Other signals

  • Develop novel models, algorithms, and training methodologies
  • Translate cutting-edge research into practical model improvements
  • Design, implement, train, and optimize large-scale vision and multimodal foundation models
  • Set technical direction for high-impact research areas
  • Stay at the forefront of research in computer vision, multimodal learning, generative modeling, and foundation models