Readings

Papers are organized by course module and topic. Some papers appear under multiple headings where relevant. Papers marked with ⭐ are especially recommended as entry points to a topic.

Background: Causality & Generalization in Biohealth

Module I, Weeks 1–2

⭐ From Statistical to Causal Learning (Schölkopf et al., 2022)
⭐ Causality in digital medicine (Glocker et al., 2021)
⭐ The Seven Tools of Causal Inference, with Reflections on Machine Learning (Pearl, 2019)
Hernán & Robins, “Causal Inference: What If” — Chapters 1–3 (Hernán & Robins, 2020)
⭐ Toward a foundation model of causal cell and tissue biology with a Perturbation Cell and Tissue Atlas (Rood et al., 2024)
Causal modelling of gene effects from regulators to programs to traits (Ota et al., 2025)
Causal machine learning for single-cell genomics (Tejada-Lapuerta et al., 2025)
⭐ Applications of machine learning in drug discovery and development (Vamathevan et al., 2019)
Variable generalization performance of a deep learning model to detect pneumonia in chest radiographs (Zech et al., 2018)
Clinical AI tools must convey predictive uncertainty for each individual patient (Banerji et al., 2023)
From development to deployment: dataset shift, causality, and shift-stable models in health AI (Subbaswamy & Saria, 2020)
The Clinician and Dataset Shift in Artificial Intelligence (Finlayson et al., 2021)
Buyer Beware: confounding factors and biases abound when predicting omics-based biomarkers from histological images (Dawood et al., 2024)
Domain-Invariant Feature Learning for Patient-Level Phenotype Prediction from Single-Cell Data (Perez et al., 2025)
Mendelian randomization: genetic anchors for causal inference in epidemiological studies (Davey Smith & Hemani, 2014)

Hybrid & Mechanistic Modeling

Module II, Week 3

Foundational Methods

⭐ Neural Ordinary Differential Equations (Chen et al., 2018)
⭐ Universal Differential Equations for Scientific Machine Learning (Rackauckas et al., 2020)
⭐ Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations (Raissi et al., 2019)
Neural Controlled Differential Equations for Irregular Time Series (Kidger et al., 2020)
Latent ODEs for Irregularly-Sampled Time Series (Rubanova et al., 2019)

Reviews & Surveys

⭐ Physics-informed machine learning (Karniadakis et al., 2021)
Integrating Scientific Knowledge with Machine Learning for Engineering and Environmental Systems (Willard et al., 2022)
⭐ Breiman’s Two Cultures: You Don’t Have to Choose Sides (Miller, Foti & Fox, 2021)

Clinical & Biomedical Applications

⭐ Learning Insulin-Glucose Dynamics in the Wild (Miller, Foti & Fox, 2020)
⭐ Hybrid² Neural ODE Causal Modeling and an Application to Glycemic Response (Zou et al., 2024)
Interpretable Mechanistic Representations for Meal-level Glycemic Control in the Wild (Wang & Fox, 2023)
⭐ Integrating Expert ODEs into Neural ODEs: Pharmacology and Disease Progression (Qian et al., 2021)
Neural Pharmacodynamic State Space Modeling (Hussain et al., 2021)
Real-time electronic health record mortality prediction during the COVID-19 pandemic: A prospective cohort study (Sottile et al., 2021)
Conditional universal differential equations capture population dynamics and interindividual variation in c-peptide production (de Rooij et al., 2025)
Low-dimensional neural ODEs and their application in pharmacokinetics (Bräm et al., 2023)
Modeling personalized heart rate response to exercise and environmental factors with wearables data (Nazaret et al., 2023)
A foundation model for continuous glucose monitoring data (Lutsker et al., 2025)
Large-scale Training of Foundation Models for Wearable Biosignals (Abbaspourazad et al., 2023)
Causal Graph ODE: Continuous Treatment Effect Modeling in Multi-agent Dynamical Systems (Huang et al., 2024)
Trajectory Flow Matching with Applications to Clinical Time Series Modeling (Huang et al., 2024)
SPADE: Inferring Transcriptional Dynamics from Spatial Transcriptomics with Physics-Informed Deep Learning (Wang et al., 2025)
Data-driven causal model discovery and personalized prediction in Alzheimer’s disease (Zheng et al., 2022)

Virtual Cell Models & Digital Twins

Modules I/III, Weeks 2 & 6

⭐ How to build the virtual cell with artificial intelligence: Priorities and opportunities (Bunne et al., 2024)
⭐ Virtual Cells as Causal World Models: A Perspective on Evaluation (Callahan et al., 2025)
Virtual Cells Need Context, Not Just Scale (Dibaeinia et al., 2026)
TwinCell: Large Causal Cell Model for Reliable and Interpretable Therapeutic Target Prioritisation (Morlot et al., 2026)
X-CELL (Wang et al., 2026)
Stack: Simulating cellular conditions via prompt engineering (Dong et al., 2026)
Digital twin to enable the vision of precision cardiology (Corral-Acero et al., 2020)

Causal Representation Learning

Module III, Week 4

⭐ Towards Causal Representation Learning (Schölkopf et al., 2021)
⭐ Invariant Risk Minimization (Arjovsky et al., 2019)
⭐ Causal inference using invariant prediction (Peters et al., 2016)
Causal Structure and Representation Learning with Biomedical Applications (Uhler et al., 2025)
Identifiability Guarantees for Causal Disentanglement from Soft Interventions (Zhang et al., 2023)
Identifiability Guarantees for Causal Disentanglement from Purely Observational Data (Welch et al., 2024)
Unpaired Multi-Domain Causal Representation Learning (Sturma et al., 2023)
Learning Causal Representations of Single Cells via Sparse Mechanism Shift Modeling (Lopez et al., 2022)
The Third Pillar of Causal Analysis? A Measurement Perspective on Causal Representations (Yao et al., 2025)
Towards Causal Foundation Model: on Duality between Causal Inference and Attention (Zhang et al., 2023)
Use What You Know: Causal Foundation Models with Partial Graphs (Robertson et al., 2026)
An Information Criterion for Controlled Disentanglement of Multimodal Data (Wang et al., 2024)
Causal disentanglement for single-cell representations and controllable counterfactual generation (Gao et al., 2024)
oi-VAE: Output Interpretable VAEs for Nonlinear Group Factor Analysis (Ainsworth et al., 2018)

Learning from Interventional Data

Module III, Weeks 5–6

Perturbation Response Prediction

⭐ Learning single-cell perturbation responses using neural optimal transport (Bunne et al., 2023)
⭐ Learning Genetic Perturbation Effects with Variational Causal Inference (Liu et al., 2025)
scGen predicts single-cell perturbation responses (Lotfollahi et al., 2019)
In silico biological discovery with large perturbation models (Miladinovic et al., 2025)
Combinatorial prediction of therapeutic perturbations using causally inspired neural networks (Gonzalez et al., 2025)
Few-shot learning creates predictive models of drug response that translate from high-throughput screens to individual patients (Ma et al., 2021)
Latent Causal Diffusions for Single-Cell Perturbation Modeling (Lorch et al., 2026)
Predicting how perturbations reshape cellular trajectories with PerturbGen (Ly et al., 2026)
MORPH: Predicting the Single-Cell Outcome of Genetic Perturbations Across Conditions and Data Modalities (He et al., 2025)
Predicting cellular responses to perturbation across diverse contexts with State (Adduri et al., 2025)
TxPert: Leveraging Biochemical Relationships for Out-of-Distribution Transcriptomic Perturbation Prediction (Wenkel et al., 2025)
SynthPert: Enhancing LLM Biological Reasoning via Synthetic Reasoning Traces for Cellular Perturbation Prediction (Phillips et al., 2025)
scGenePT: Is language all you need for modeling single-cell perturbations? (Istrate et al., 2024)
Fast and scalable Wasserstein-1 neural optimal transport solver for single-cell perturbation prediction (Chen et al., 2024)
Cross-modality Matching and Prediction of Perturbation Responses with Labeled Gromov-Wasserstein Optimal Transport (Ryu et al., 2024)
Mapping and reprogramming human tissue microenvironments with MintFlow (Akbarnejad et al., 2025)
Predicting and interpreting cell-type-specific drug responses in the small-data regime using inductive priors (Alsulami et al., 2026)

Counterfactual Inference in Single Cells

Integrating multi-covariate disentanglement with counterfactual analysis (CellDISECT) (Megas et al., 2025)
Causal Disentanglement of Treatment Effects in Single-Cell RNA Sequencing Through Counterfactual Inference (An et al., 2025)
Causal disentanglement for single-cell representations and controllable counterfactual generation (Gao et al., 2024)
Causal differential expression analysis under unmeasured confounders with causarray (Du et al., 2025)
Single-cell disentangled representations for perturbation modeling and treatment effect estimation (Sun et al., 2025)

Generative Models for Cellular Morphology

CellFlux: Simulating Cellular Morphology Changes via Flow Matching (Zhang et al., 2025)
Predicting cell morphological responses to perturbations using generative modeling (IMPA) (Palma et al., 2023)
MorphoDiff: Cellular Morphology Painting with Diffusion Models (Navidi et al., 2024)
Modeling Complex System Dynamics with Multi-Marginal Flow Matching (Rohbeck et al., 2025)

Seminal Experimental Papers

⭐ Perturb-Seq: Dissecting Molecular Circuits with Scalable Single-Cell RNA Profiling of Pooled Genetic Screens (Dixit et al., 2016)
⭐ Mapping information-rich genotype-phenotype landscapes with genome-scale Perturb-seq (Replogle et al., 2022)

Experimental Design & Active Learning

⭐ Active learning for optimal intervention design in causal models (Zhang et al., 2023)

Methods & Reviews

Optimal transport for single-cell and spatial omics (Bunne et al., 2024)

Foundation Models for Cell Biology & Biomedicine

Module III, Week 6

⭐ Towards multimodal foundation models in molecular cell biology (Cui et al., 2025)
⭐ Toward learning a foundational representation of cells and genes (Lotfollahi et al., 2024)
Large-scale foundation model on single-cell transcriptomics (scFoundation) (Hao et al., 2024)
Scaling Large Language Models for Next-Generation Single-Cell Analysis (Rizvi et al., 2025)
A Cross-Species Generative Cell Atlas (TranscriptFormer) (Pearce et al., 2025)
GREmLN: A Cellular Regulatory Network-Aware Transcriptomics Foundation Model (Zhang et al., 2025)
ViTally Consistent: Scaling Biological Representation Learning for Cell Microscopy (Kenyon-Dean et al., 2024)
SubCell: Proteome-aware vision foundation models for microscopy (Gupta et al., 2024)
scVAEDer: integrating deep diffusion models and variational autoencoders for single-cell transcriptomics (Sadria et al., 2025)
AUTOENCODIX: a framework to train and evaluate autoencoders for biological representation learning (Joas et al., 2025)
Group Contrastive Learning for Weakly Paired Multimodal Data (GROOVE) (Gorla et al., 2026)
A Multimodal Foundation Model for Discovering Genetic Associations with Brain Imaging Phenotypes (Machado Reyes et al., 2024)
GraphComm predicts cell-cell communication using a graph-based deep learning method in single-cell RNA sequencing data (So et al., 2025)

Causal Discovery

Module III, cross-cutting

⭐ Large-scale causal discovery using interventional data sheds light on gene network structure (Barth et al., 2025)
Sample, estimate, aggregate: A recipe for causal discovery foundation models (Wu et al., 2024)
DynGFN: Towards Bayesian Inference of Gene Regulatory Networks with GFlowNets (Atanackovic et al., 2023)
Efficient Differentiable Discovery of Causal Order (Chevalley et al., 2024)
Causal gene regulatory network inference from Perturb-seq via adaptive instrumental variable modeling (Sun et al., 2026)
Phenome-scale causal network discovery with bidirectional mediated Mendelian randomization (Brown et al., 2020)
Data-driven causal model discovery and personalized prediction in Alzheimer’s disease (Zheng et al., 2022)
⭐ Neural Granger Causality (Tank et al., 2021)

Causal Transportability & External Validity

Module IV, Week 8

⭐ External Validity: From Do-Calculus to Transportability Across Populations (Pearl & Bareinboim, 2014)
⭐ Causal inference and the data-fusion problem (Bareinboim & Pearl, 2016)
A Review of Generalizability and Transportability (Degtiar & Rose, 2023)
Extending inferences from a randomized trial to a target population (Dahabreh et al., 2019)
Synthetic and external controls in clinical trials (Thorlund et al., 2020)
⭐ Virtual Cells Need Context, Not Just Scale (Dibaeinia et al., 2026)
Deciphering causal proteins in Alzheimer’s disease: Mendelian randomization integrated with AlphaFold3 (Yao et al., 2023)

Policy Learning & Treatment Optimization

Module IV, Week 7

⭐ Learning Explainable Treatment Policies with Clinician-Informed Representations: A Practical Approach (Ferstad et al., 2024)
⭐ Guidelines for Reinforcement Learning in Healthcare (Gottesman et al., 2019)
Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models (Oberst & Sontag, 2019)
Confounding-Robust Policy Evaluation in Infinite-Horizon Reinforcement Learning (Kallus & Zhou, 2020)

Mechanistic & Biological Interpretability

Module III, Week 6

⭐ Sparse autoencoders uncover biologically interpretable features in protein language model representations (Gujral et al., 2025)
Learning biologically relevant features in a pathology foundation model using sparse autoencoders (Le et al., 2024)
MechSci: Scaling Clinical Science via Mechanistic Interpretability of Multimodal Medical Foundation Models (2025)
Beyond the Black Box: Identifiable Interpretation and Control in Generative Models via Causal Minimality (Kong et al., 2025)
Biologically Guided Variational Inference for Interpretable Multimodal Single-Cell Integration (Arnoldt et al., 2025)
Interpretable single-cell factor decomposition using sciRED (Pouyabahar et al., 2025)

Agentic AI & LLMs for Scientific Discovery

Module V, Week 9

⭐ Empowering biomedical discovery with AI agents (Gao et al., 2024)
The Virtual Biotech: A Multi-Agent AI Framework for Therapeutic Discovery (Zhang et al., 2026)
TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools (Gao et al., 2025)
BioDiscoveryAgent: An AI Agent for Designing Genetic Perturbation Experiments (Roohani et al., 2024)
Kosmos: An AI Scientist for Autonomous Discovery (Mitchener et al., 2025)
BioReason: Incentivizing Multimodal Biological Reasoning within a DNA-LLM Model (Fallahpour et al., 2025)
rbio1: training scientific reasoning LLMs with biological world models (Istrate et al., 2025)
The Anatomy of a Personal Health Agent (Heydari et al., 2025)
Causal AI Scientist: Facilitating Causal Data Science with Large Language Models (Verma et al., 2025)
CURE-Bench: AI Reasoning for Therapeutics (Zitnik et al., 2025)
How Well Do Multimodal Models Reason on ECG Signals? (Xu et al., 2026)

Benchmarking & Evaluation

Cross-cutting

⭐ Deep learning-based predictions of gene perturbation effects do not yet outperform simple linear baselines (Ahlmann-Eltze et al., 2024)
Foundation Models Improve Perturbation Response Prediction (Cole et al., 2026)
Systema: a framework for evaluating genetic perturbation response prediction beyond systematic variation (Torné et al., 2025)
Limitations of cell embedding metrics assessed using drifting islands (Wang et al., 2025)
Zero-shot evaluation reveals limitations of single-cell foundation models (Kedzierska et al., 2025)
scGeneScope: A Treatment-Matched Single Cell Imaging and Transcriptomics Dataset and Benchmark for Treatment Response Modeling (Dapello et al., 2025)
BixBench: a Comprehensive Benchmark for LLM-based Agents in Computational Biology (Mitchener et al., 2025)
Random feature baselines provide distributional performance benchmarks for clinical and omic machine learning (Ellis et al., 2024)
Out-of-distribution evaluations of channel agnostic masked autoencoders in fluorescence microscopy (Hurry et al., 2025)