Sr Data Scientist

Disney Disney · Media · Santa Monica, CA +1

Senior Data Scientist role focused on designing, developing, and deploying advanced NLP, multimodal ML, and LLM solutions for content understanding use cases across Disney+, Hulu, and ESPN+. Responsibilities include building and adapting models for classification, similarity, retrieval, and enrichment, fine-tuning foundation models, and optimizing training/inference workflows. Requires strong Python, deep learning framework experience, and production ML systems knowledge.

What you'd actually do

  1. Lead the design, development, evaluation, and deployment of advanced NLP, LLM, and multimodal ML solutions for content understanding use cases.
  2. Build and adapt models for tasks such as text classification, semantic similarity, retrieval, ranking, metadata enrichment, and multimodal understanding across text, image, and video.
  3. Fine-tune, customize, and optimize open-source and foundation models using modern techniques such as supervised fine-tuning, parameter-efficient tuning, retrieval-augmented generation, and embedding-based methods.
  4. Partner closely with product, engineering, analytics, and business stakeholders to translate ambiguous business needs into scalable machine learning solutions.
  5. Optimize training and inference workflows using GPU infrastructure, distributed systems, and production best practices.

Skills

Required

  • Python
  • PyTorch or TensorFlow
  • deep learning
  • NLP
  • embeddings
  • transformer-based architectures
  • large language model systems
  • attention mechanisms
  • tokenization
  • representation learning
  • modern evaluation methodologies
  • production ML systems
  • CI/CD
  • job orchestration
  • containerization
  • monitoring
  • MLOps practices
  • evaluating ML systems
  • communication skills

Nice to have

  • Master’s degree or Ph.D.
  • multimodal models
  • retrieval systems
  • vector databases
  • semantic search
  • retrieval-augmented generation (RAG)
  • fine-tuning foundation models
  • distributed training
  • distributed inference
  • GPU infrastructure
  • hardware optimization
  • RLHF
  • parameter-efficient tuning
  • model adaptation methods

What the JD emphasized

  • deep expertise in modern deep learning architectures
  • hands-on experience adapting and evaluating foundation models
  • strong track record of delivering production-grade AI systems

Other signals

  • LLM solutions
  • foundation models
  • production-grade AI systems