Research Scientist – Controlled 3d Generation

Stability AI Stability AI · AI Frontier · Remote · Research

Research Scientist focused on 3D generation using flow matching and diffusion models, aiming to create consistent, editable, and physically grounded 3D assets and scenes. Responsibilities include research, designing training pipelines, developing conditioning techniques, analyzing model behavior, and publishing results.

What you'd actually do

  1. Conduct cutting-edge research on flow-matching, diffusion, and score-based methods for 3D generation and reconstruction.
  2. Design and implement scalable training pipelines for controllable 3D generation (meshes, Gaussians, NeRFs, voxels, implicit fields).
  3. Develop techniques for conditioning and control (text, sketch, pose, camera, physics) and multi-view consistency.
  4. Analyse model behaviour through ablations, visualisations, and quantitative metrics.
  5. Collaborate with cross-disciplinary research, graphics, and infrastructure teams to translate research into production-ready systems.

Skills

Required

  • PhD (or equivalent experience) in Machine Learning, Computer Vision, or Computer Graphics
  • PyTorch, JAX, or CUDA-level optimisation
  • Understanding of 3D representations (meshes, Gaussians, signed-distance fields, volumetric grids, implicit networks)
  • Solid grasp of geometry processing, multi-view consistency, and differentiable rendering

Nice to have

  • Experience generating coherent 3D scenes with multiple interacting objects, lighting, and spatial layout
  • Familiarity with scene-level control (object placement, camera path, simulation, or text-to-scene composition)
  • Knowledge of video-to-3D, image-to-scene, or 4D temporal generation
  • Background in physically-based rendering, simulation, or world-model architectures
  • Track record of impactful publications or open-source releases

What the JD emphasized

  • Published work on diffusion, flow-matching, or score-based generative models (2D or 3D)
  • PyTorch, JAX, or CUDA-level optimisation
  • 3D representations (meshes, Gaussians, signed-distance fields, volumetric grids, implicit networks)
  • geometry processing, multi-view consistency, and differentiable rendering

Other signals

  • 3D generation
  • flow matching
  • diffusion models
  • controllable 3D content creation