Advisor - Applied Deep Learning Architect

Eli Lilly Eli Lilly · Pharma · Indianapolis, IN +2

Research Scientist role focused on designing, implementing, and evaluating deep learning architectures (transformers, diffusion models, GNNs) for protein design and engineering in drug discovery. The role involves multi-modal representation learning, cross-modality molecular modeling, and physics-informed training objectives, collaborating with computational biology and IT teams.

What you'd actually do

  1. Design, implement, and evaluate generative and predictive deep learning architectures—transformers, diffusion models, flow-matching models, and graph neural networks. Share actionable strategies and drive architectural decisions to improve the performance of foundational models for biologics drug discovery.
  2. Develop multi-modal embeddings that unify protein sequence, structure, and molecular fingerprints, researching novel tokenization schemes and fusion mechanisms that improve both generation quality and property prediction.
  3. Research approaches for jointly modeling proteins and small molecules within the foundational architecture, enabling applications to ADCs, antibody–peptide conjugates, T-cell engagers, and other multi-component biotherapeutic formats.
  4. Partner with internal MD scientists to integrate physics-based priors, molecular dynamics, and energy-aware learning objectives into model training to ground generative outputs in physical reality and improve the developability of designed molecules across complex modalities.
  5. Stay at the frontier of AI/ML and computational biology research; identify high-impact directions that advance the foundational model platform and translate them into actionable strategies for the team. Educate and transfer knowledge to other domain experts effectively to drive cross-functional collaboration on cutting-edge science.

Skills

Required

  • Ph.D. in Computer Science, Artificial Intelligence, Theoretical Computer Science, Applied Mathematics, Computational Biology, Physics, or a related field
  • Strong expertise in modern deep learning architectures, including transformers, diffusion models, flow-matching networks, variational autoencoders, and graph neural networks
  • Proficiency in Python and modern AI/ML frameworks (PyTorch or TensorFlow)
  • Familiarity with good software engineering practices including Git version control, code review, testing, and documentation

Nice to have

  • 1-3 years of industry experience in development and deployment of Novel Deep Learning Architecture
  • Familiarity with protein engineering, protein sequence and structure representation, protein language models (e.g., ESM, AbLang), generative protein models (RFDiffusion, Boltz, Chai, etc.) or related biomolecular ML
  • Experience applying ML to antibody, nanobody, or peptide design
  • Experience with multi-modal architectures that jointly model sequence, structure, and functional annotations, and that fuse molecular representations across different molecule modalities (e.g., protein-peptide, protein–ligand, protein–small molecule)
  • Protein structure understanding; experience integrating molecular dynamics simulations, force-field representations, or physics-based priors into machine learning models for molecular design or optimization
  • Experience with distributed training, GPU-accelerated workflows, and writing performant code for large-scale model training and inference
  • Prior exposure to experimental biologics workflows (phage display, yeast display, directed evolution) that informs practical design constraints
  • Demonstrated history of high-impact publications in top-tier machine learning, AI, or computational biology venues
  • Strong oral and written communication skills

What the JD emphasized

  • deep learning expertise
  • deep learning architectures
  • multi-modal model design
  • protein design
  • computational biology

Other signals

  • deep learning architectures
  • protein design
  • multi-modal model design
  • in-silico drug discovery