Distinguished Machine Learning Engineer - Safety

Roblox Roblox · Consumer · San Mateo, CA · Machine Learning

Distinguished Machine Learning Engineer/Technical Director in the Safety organization at Roblox, driving technical vision and execution for ML initiatives focused on user safety and civility. The role involves owning technical direction, leading engineers, building large-scale ML models, and collaborating with cross-functional teams to define the ML roadmap for Trust and Safety.

What you'd actually do

  1. Own the technical direction and implementation of machine learning solutions for safety-related systems
  2. Lead and mentor other engineers, fostering a culture of technical excellence and inclusivity
  3. Break down long-term product requirements into iterative deliverable stages, ensuring continuous improvement
  4. Craft and build large-scale machine learning models with billions of parameters, ensuring production-readiness
  5. Facilitate challenging technical decisions across multiple teams, demonstrating empathy and finding common understanding

Skills

Required

  • 10+ years of experience delivering and improving large-scale machine learning systems
  • Proven expertise in creating and launching machine learning models from scratch
  • Ability to handle data problems, train models with large datasets, and ensure system reliability at scale
  • Experience with distributed systems, data architecture, and model extraction
  • Strong programming skills
  • Prior experience in a leadership role with a technical focus, including mentoring engineering teams

Nice to have

  • NLP
  • computer vision
  • maintaining and evolving safety systems at scale

What the JD emphasized

  • 10+ years of experience delivering and improving large-scale machine learning systems
  • Proven expertise in creating and launching machine learning models from scratch
  • Ability to handle data problems, train models with large datasets, and ensure system reliability at scale
  • Experience with distributed systems, data architecture, and model extraction
  • maintaining and evolving safety systems at scale

Other signals

  • large-scale machine learning models with billions of parameters
  • production-readiness
  • distributed systems
  • data architecture
  • model extraction
  • safety-related systems