Sr. Manager, Applied Science - Generative AI Data Research

Adobe Adobe · Enterprise · San Jose, CA +2

Sr. Manager for Generative AI Data Research at Adobe, leading a team to build multimodal datasets for foundation model training. Focuses on data acquisition, filtering, curation, and pipeline development, with a strong emphasis on system design and applied ML workflows.

What you'd actually do

  1. Lead and grow a team of applied scientists and engineers working on multimodal data research and generative AI training systems
  2. Own and evolve key parts of the training and experimentation datasets.
  3. Partner closely with applied research and engineering teams to support the full lifecycle from experimentation to production
  4. Drive technical design and architecture decisions for large-scale data pipelines and dataset creation and maintenance.
  5. Establish standard processes for system robustness, testing, observability, and reproducibility

Skills

Required

  • MS or PhD in Computer Science, Engineering, AI/ML, or a related technical field, or equivalent practical experience
  • At least 4+ years of experience leading or managing technical teams in AI/ML domains, with a large component of working on data.
  • Strong hands-on experience working on model training and data.
  • Ability to operate as a technical leader, making sound design tradeoffs and unblocking complex engineering problems
  • Strong communication skills and the ability to collaborate across research, engineering, and product organizations
  • Comfort working in fast-moving, ambiguous environments typical of generative AI development

What the JD emphasized

  • multimodal data research
  • datasets that support training and development of foundation models
  • foundational model training
  • large-scale data
  • scalable, maintainable data pipelines and datasets
  • system design
  • applied ML workflows
  • technical leader
  • data for foundation model training
  • working on data

Other signals

  • leading a team
  • multimodal data research
  • datasets for foundation model training
  • scalable data pipelines