Postdoctoral Fellow – Ai-driven Multi-omics Integration for Predictive Toxicology

Pfizer · Pharma · CT

Pfizer is seeking a Postdoctoral Fellow to apply foundation models and machine learning to integrate multi-omics data for predictive toxicology. The role involves benchmarking models, building AI pipelines to connect omics readouts to pathology, and identifying biomarkers for drug safety. The goal is to improve cross-species prediction of human toxicity risks from preclinical data, supporting drug development decisions. The position emphasizes scientific contributions through publications and potentially open-source tools.

What you'd actually do

  1. Benchmark foundation and linear models against a curated cross-species omics dataset library spanning decision-relevant toxicity endpoints (liver, cardiac, hematopoietic), defining performance criteria that are meaningful for safety go/no-go decisions.
  2. Develop and validate AI-driven integration pipelines that combine multi-omics data from early toxicology studies with historical endpoints — pathology scores, clinical chemistry, and PK data — using foundation models and interpretable ML approaches.
  3. Perform retrospective compound analyses to quantify where early omics-based model outputs could have anticipated findings from GLP or Phase 1 studies and prospectively integrate targeted in vitro datasets to measure the incremental predictive value of each data modality.
  4. Implement scalable Python and/or R workflows for data ingestion, model training, evaluation, and visualization, including APIs or interactive applications to support internal stakeholder adoption
  5. Collaborate with toxicologists, pathologists, data scientists, and external partners to integrate in silico, in vitro, and in vivo results into translational safety frameworks.

Skills

Required

  • Ph.D. in computational biology, bioinformatics, computational toxicology, systems biology, or a related scientific field.
  • Experience: 0–2 years postdoctoral or post-PhD research experience (candidate must have completed doctorate within the past 2 years).
  • Proficiency in programming and data analysis using Python and/or R.
  • Strong statistical and machine learning skills for analyzing complex biological datasets.
  • Familiarity with high-dimensional biological data (such as transcriptomics, genomics, proteomics)
  • Hands-on experience with modern deep learning frameworks (e.g. PyTorch, scikit-learn)
  • Experience with single-cell or bulk RNA-seq analysis pipelines (e.g. Scanpy, Seurat, DESeq2).

Nice to have

  • basic understanding of molecular biology or toxicology
  • Exposure to representation learning, transformer-based architectures, or self-supervised learning on biological or biomedical data is a strong plus.
  • Knowledge of toxicology, pharmacology, or biomarker discovery
  • Ability to interpret and validate model results in a biological/toxicological context
  • History of interdisciplinary collaboration
  • Experience working in cross-functional research teams or with external collaborators (academia or industry) on complex data projects

What the JD emphasized

  • minimum 2-year commitment
  • first-author publication published or submitted

Other signals

  • foundation model
  • machine learning
  • multi-omics data integration
  • predictive toxicology
  • biomarker discovery