Scientific Lead - Forward Deployed AI Engineer, Applied Intelligence for Discovery

Eli Lilly Eli Lilly · Pharma · San Francisco, CA

The Forward Deployed AI Engineer will embed with research teams to translate scientific use-cases into production AI systems for drug discovery. This role involves applying LLMs, RAG, and agentic frameworks to solve scientific problems, running evaluation loops, and distilling learnings into reusable components. The engineer will own end-to-end deployments, ensuring reliability and integrating with AI/LLMOps platforms, with a focus on measurable workflow impact and evidence-based feedback loops.

What you'd actually do

  1. Embed with computational biology and disease biology teams in your assigned therapeutic area to develop deep understanding of their workflows, data, tools, and bottlenecks
  2. Translate use-cases into concrete, testable prototypes with clear success criteria. Rapidly turn ideas into a working demo, complete with evaluation benchmarks that tighten acceptance criteria over time
  3. Design and ship production systems quickly that solve specific scientific problems; owning integrations, data provenance, reliability, and on-call readiness
  4. Apply LLM, retrieval-augmented generation (RAG), text-to-SQL, agentic AI frameworks, and other emerging approaches to drug discovery challenges including target identification, biomarker prioritization, mechanism of action studies, and extraction of insight from large-scale multi-omics datasets
  5. Run evaluation loops that measure model and system quality against workflow-specific scientific benchmarks; use results to drive model selection, product changes, and iterative evidence generation that tightens acceptance criteria over time

Skills

Required

  • PhD in computational biology, bioinformatics, data science, computer science, or a related field, with 3+ years of software/ML engineering or technical deployment experience; or equivalent demonstrated experience building and deploying AI/ML tools for scientific applications in biotech, pharma, or scientific software; MS in computational biology, bioinformatics, data science, computer science, or a related field, with 5+ years of software/ML engineering or technical deployment experience; or equivalent demonstrated experience building and deploying AI/ML tools for scientific applications in biotech, pharma, or scientific software
  • Strong programming skills in Python
  • Experience with LLMs (API usage, prompt engineering, fine-tuning)
  • Experience with common frameworks (PyTorch, HuggingFace, LangChain/LlamaIndex, or similar)
  • Owned AI deployments end-to-end from scoping through production adoption, and improved them through evaluation design, error analysis, and iterative evidence generation
  • Sufficient biological knowledge to have productive conversations with computational scientists and understand the research context behind their problems
  • Experience building data-driven applications including interactive dashboards, natural language interfaces, or automated analysis pipelines
  • Communicate clearly across scientific, computational, technical, and executive audiences

Nice to have

  • Familiarity with cloud computing environments (AWS preferred)
  • Familiarity with version control (Git)
  • Experience in pharmaceutical, biotech, or life sciences R&D environments
  • Familiarity with agentic AI frameworks and building AI-powered workflows that chain multiple models or tools together
  • Experience with biological foundation models (e.g., scGPT, Geneformer for single-cell; ESM for proteins; AlphaFold) or their application to research problems
  • Knowledge of biomedi

What the JD emphasized

  • production systems
  • evaluation benchmarks
  • evaluation loops
  • scientific benchmarks
  • production adoption
  • workflow impact
  • evidence-based impact measurement

Other signals

  • Embed directly with research teams
  • Translate use-cases into concrete, testable prototypes
  • Design and ship production systems quickly
  • Apply LLM, RAG, text-to-SQL, agentic AI frameworks
  • Run evaluation loops that measure model and system quality
  • Distill deployment learnings into hardened primitives
  • Partner closely with AI/LLMOps engineers
  • Contribute to a culture of experimentation, speed, and evidence-based impact measurement