An image analysis pipeline to quantify the spatial distribution of cell markers in stroma-rich tumors

Avatar
Poster
Voice is AI-generated
Connected to paperThis paper is a preprint and has not been certified by peer review

An image analysis pipeline to quantify the spatial distribution of cell markers in stroma-rich tumors

Authors

Ruzette, A. A.; Kozlova, N.; Cruz, K. A.; Muranen, T.; Norrelykke, S. F.

Abstract

Aggressive cancers, such as pancreatic ductal adenocarcinoma (PDAC), are often characterized by a complex and desmoplastic tumor microenvironment rich in stroma, a supportive connective tissue composed primarily of extracellular matrix (ECM) and non-cancerous cells. Desmoplasia, which is a dense deposition of stroma, is a major reason for therapy resistance, acting both as a physical barrier that interferes with drug penetration and as a supportive niche that protects cancer cells through diverse mechanisms. A precise understanding of spatial cell interactions within the tumor microenvironment in stroma-rich cancers is essential for optimizing therapeutic responses. It allows detailed mapping of stromal-tumor interfaces, comprehensive phenotyping of diverse cell types and their functional states, and insights into changes in cellular distribution and tissue architecture, thus leading to an improved assessment of drug responses. Recent advances in multiplexed immunofluorescence imaging have enabled the acquisition of large batches of whole-slide tumor images, but scalable and reproducible methods to analyze the spatial distribution of cell states relative to stromal regions remain limited. To address this gap, we developed an open-source computational pipeline that integrates QuPath (Bankhead et al. 2017), StarDist (Schmidt et al. 2018), and custom Python scripts to quantify biomarker expression at a single- and sub-cellular resolution across entire tumor sections. Our workflow includes: (i) automated nuclei segmentation using StarDist, (ii) machine learning-based cell classification using multiplexed marker expression, (iii) modeling of stromal regions based on fibronectin staining, (iv) sensitivity analyses on classification thresholds to ensure robustness across heterogeneous datasets, and (v) distance-based quantification of the proximity of each cell to the stromal border. To improve consistency across slides with variable staining intensities, we introduce a statistical strategy that translates classification thresholds by propagating a chosen reference percentile across the distribution of marker-related cell measurement in each image. We apply this approach to quantify spatial patterns of distribution of the phosphorylated form of the N-Myc downregulated gene 1 (NDRG1), a novel DNA repair protein that conveys signals from the ECM to the nucleus to maintain replication fork homeostasis, and a known cell proliferation marker Ki67 in fibronectin-defined stromal regions in PDAC xenografts. The pipeline is applicable for the analysis of various stroma-rich tissues and is publicly available: https://github.com/HMS-IAC/stroma-spatial-analysis-web.

Follow Us on

0 comments

Add comment