SIEVE: joint inference of single-nucleotide variants and cell phylogeny from single-cell DNA sequencing data
Preprint 2022 en
Authors
SK
Senbai Kang
NB
Nico Borgsmüller
MV
Monica Valecha
Abstract
1 min read
Abstract Single-cell DNA sequencing (scDNA-seq) has enabled the identification of single nucleotide somatic variants and the reconstruction of cell phylogenies. However, statistical phylogenetic models for cell phylogeny reconstruction from raw sequencing data are still in their infancy. Here we present SIEVE (SIngle-cell EVolution Explorer), a statistical method for the joint inference of somatic variants and cell phylogeny under the finite-sites assumption from scDNA-seq reads. SIEVE leverages raw read counts for all nucleotides at candidate variant sites, and corrects the acquisition bias of branch lengths. In our simulations, SIEVE outperforms other methods both in phylogenetic accuracy and variant calling accuracy. We apply SIEVE to three scDNA-seq datasets, for colorectal (CRC) and triple-negative breast cancer (TNBC), one of them generated by us. On simulated data, SIEVE reliably infers homo-and heterozygous somatic variants. The analysis of real data uncovers that double mutant genotypes are rare in CRC but unexpectedly frequent in TNBC samples.
Senbai Kang, Nico Borgsmüller, Monica Valecha, Jack Kuipers, João M. Alves, Sonia Prado‐Lòpez, Débora Chantada, Niko Beerenwinkel, David Posada, Ewa Szczurek
Discussion(0)
No comments yet. Be the first to comment.