NovaClone: A Network-Based Algorithm for Clonal and Subclonal Genotyping of Barcoded Transgene Integrations
NovaClone: A Network-Based Algorithm for Clonal and Subclonal Genotyping of Barcoded Transgene Integrations
Prillo, S.; Rimini, D.; Olivares-Chauvet, P.; Song, Y. S.; Yosef, N.
AbstractSingle-cell lineage tracing technologies are providing new and powerful ways to interrogate the evolution and divergence of cell populations in cancer, development, and other contexts. A key initial step in any such analysis is the grouping of cells into clonal populations, based on clone-level marks. Unfortunately, clone calling is prone to technical effects due to sequencing errors, missing data, multiplets, background noise, and accidental sharing of clonal barcodes between unrelated clones (homoplasy). We present NovaClone, a principled algorithm for hierarchical clone calling that is broadly applicable to all current tracing technologies, including both static barcoding and the more recent evolving tracers. We benchmark NovaClone on simulated and real data to show that it outperforms the current solutions in terms of both quality and speed, thereby helping to mitigate one of the most prevalent problems with single-cell lineage tracing. To complement NovaClone, we introduce a suite of algorithm-agnostic quality control metrics to evaluate clone calls when ground truth is not available. NovaClone and the associated QCs are available through the open source Python package nova-clone.