Molecular cartography uncovers evolutionary and microenvironmental dynamics in sporadic colorectal tumors
Heiser, Cody Nicholas
0000-0002-1299-1628
:
2023-03-14
Abstract
Systems biology leverages high-dimensional molecular data at single-cell and spatial resolution to elucidate and characterize emergent phenomena in human health and disease. Through data-driven modeling and integration, large atlas-style datasets derived from human specimens can be analyzed to generate hypotheses about tissue dynamics and disease progression. Accordingly, data science and machine learning tools are instrumental in building modular data processing pipelines that yield insights including novel biomarkers for patient stratification and potential therapeutic targets. Colorectal cancer (CRC) exhibits dynamic cellular and genetic heterogeneity during progression from precursor lesions toward malignancy. Leveraging spatial molecular information to construct a phylogeographic map of tumor evolution can reveal individualized growth trajectories with diagnostic and therapeutic potential. We have developed several open-source software packages, frameworks, and protocols that enable integrative analysis of high-dimensional single-cell and spatial datasets. Applying these tools to spatial multi-omic data from 31 colorectal specimens revealed simultaneous microenvironmental and clonal alterations as a function of CRC progression. Copy number variation served to re-stratify microsatellite stable and unstable tumors into chromosomally unstable (CIN+) and hypermutated (HM) classes. Phylogeographical maps classified tumors by their evolutionary dynamics, and clonal regions were placed along a global pseudotemporal progression trajectory. Cell-state discovery from a single-cell cohort revealed recurring epithelial gene signatures and infiltrating immune states in spatial transcriptomics. Charting these states along progression pseudotime, we observed a transition to immune exclusion in CIN+ tumors as characterized by a novel gene expression signature comprised of DDR1, TGFBI, PAK4, and DPEP1. We demonstrated how these genes and their protein products are key regulators of extracellular matrix components, are associated with lower cytotoxic immune infiltration, and show prognostic value in external cohorts. Through high-dimensional data integration, this atlas provides insights into co-evolution of tumors and their microenvironments, serving as a resource for stratification and targeted treatment of CRC.