Black Forest Labs
Multimodal · Flux image-generation foundation models (Berlin)
Hiring velocityscroll left for older weeks
Jobs (3)
| Title | Stage | AI score |
|---|---|---|
| Member of Technical Staff - Pretraining Research role focused on leading large-scale pretraining experiments for multimodal foundation models (image, video, audio), involving architecture, objective functions, and training algorithms. Requires prior experience leading pretraining for production models and strong distributed training skills. | Pretrain | 10 |
| Member of Technical Staff - VLM Research role focused on developing and integrating state-of-the-art vision-language models (VLMs) into the FLUX generative AI stack, innovating on architectures and improving multimodal understanding for enhanced generation quality and controllability. | PretrainPost-train | 9 |
| Member of Technical Staff - Image / Video Generation Research role focused on training and fine-tuning large-scale diffusion models for image and video generation, involving rigorous experimentation, ablation studies, and understanding speed-quality tradeoffs in production settings. | Post-train | 9 |