Post Doctoral Scientist – Human Genomics and Translational Data Sciences

Eli Lilly · Pharma · Boston, MA

This role focuses on applying statistical and computational approaches, including machine learning, to analyze large-scale multi-omics (genomic, proteomic, metabolomic) and clinical data from biobanks and population cohorts. The goal is to identify novel therapeutic targets and biomarkers for cardiometabolic diseases. The role involves developing and implementing bioinformatics pipelines, contributing to novel statistical methods, and collaborating with interdisciplinary teams to guide therapeutic development. While the primary focus is on data analysis and method development (L0/L2), the ultimate aim is to inform drug discovery and development.

What you'd actually do

  1. Apply statistical and computational approaches to analyze WGS/WES, proteomics, metabolomics, and clinical data for biomarker discovery.
  2. Conduct rigorous analyses of large-scale population cohorts and biobank datasets to identify genetic variants and causal genes associated with disease outcomes.
  3. Develop and implement machine learning and bioinformatics pipelines to integrate multi-omics data.
  4. Collaborate with interdisciplinary teams, including geneticists, epidemiologists, and clinicians, to interpret findings and guide therapeutic development.
  5. Prepare scientific reports, presentations, and publications detailing research outcomes.

Skills

Required

  • PhD in statistical genetics, bioinformatics, computational biology, biostatistics, or a related quantitative field
  • whole genome and whole exome sequencing analysis, proteomics, metabolomics and other molecular data analysis, and clinical outcomes research
  • statistical modeling
  • machine learning
  • high-dimensional data analysis
  • programming languages such as R, Python, and SQL
  • genetic association studies, GWAS, and polygenic risk scores
  • functional genomics and multi-omics data integration

Nice to have

  • Experience working with large biobank and cohort datasets (e.g., UK Biobank, All of Us, FinnGen).
  • Excellent communication and collaboration skills
  • Experience in pharmaceutical or biotech industry settings
  • Strong publication record
  • Prior experience in cardiometabolic research.
  • Prior experience with polygenic risk score models.

What the JD emphasized

  • whole genome sequencing (WGS), proteomics, and clinical outcomes analysis
  • large-scale population cohorts and biobank datasets
  • machine learning and bioinformatics pipelines
  • multi-omics data integration
  • novel statistical methods

Other signals

  • statistical and computational approaches
  • large-scale population cohorts and biobank datasets
  • machine learning and bioinformatics pipelines
  • multi-omics data integration
  • novel statistical methods