Science Cast

Scaling and Generalization of Discrete Diffusion Models for Tumor Phylogenies

Siddharth SabataMarch 27, 2026 3:56am

Views (1)
Comments (0)

Export Citation

Voice is AI-generated

Connected to paperThis paper is a preprint and has not been certified by peer review

Scaling and Generalization of Discrete Diffusion Models for Tumor Phylogenies

bioRxivPDFMarch 26, 2026 12:00am

Authors

Sabata, S.; Schwartz, R.

Abstract

Tumor phylogenies - rooted trees encoding clonal ancestry and mutation acquisition - are central to understanding cancer evolution, yet generating realistic phylogenies remains challenging. We investigate whether discrete graph diffusion can learn the structural constraints of tumor phylogenies directly from data. Working with approximately 12,500 synthetic phylogenies across twelve evolutionary regimes, we train graph transformer models that denoise typed graphs through a learned reverse diffusion process. Scaling experiments reveal a non-monotonic capacity-performance relationship: a mid-scale model achieves high structural validity and close distributional match to held-out data, while a deeper model fails under fixed optimization hyperparameters. Low-data cross-regime experiments show that diverse training produces more transferable representations than single-regime specialization. These results establish that phylogenetic structural constraints can be learned implicitly through unconditional discrete diffusion, suggesting a viable path toward generative models of tumor evolution.

TwitterandLinkedIn

0 comments

Add comment

Scaling and Generalization of Discrete Diffusion Models for Tumor Phylogenies

Scaling and Generalization of Discrete Diffusion Models for Tumor Phylogenies

AI-powered Paper ChatBeta

AI-powered Paper ChatBeta

0 comments