Abstract
2 min read<strong>Supplementary Data</strong> <strong>Tables</strong> <strong>Table S1:</strong> Virus characteristics including abbreviation, family, order, tax id, sequence length threshold (for nearly complete genomes), vaccination status, mode of transmission, circulation (human or human/zoonotic), available treatment, and disease progression (acute or chronic). <strong>Table S2:</strong> Sequence IDs of outgroup constituents. <strong>Table S3:</strong> GISAID acknowledgements for the SARS-CoV-2 sequences used in this study. <strong>Table S4:</strong> Summary of manual sequence curation indicating lab- or vaccine-related keywords used to prune sequences. <strong>Table S5:</strong> Sequence IDs for EVA and H3N2 e10, e100, d10, and d100 subtrees. <strong>Table S6:</strong> Sequence IDs for ML and GL lineages. <strong>Table S7:</strong> Sequence IDs for the H3N2 “new” and “old” subtrees. <strong>Table S8:</strong> Mutation rates (in units of nucleotide substitutions per site per year) and estimated dates for the last common ancestor of all tree/subtrees. Negative dates represent years prior to 0 CE. <strong>Table S9:</strong> Mean dN/dS values. <strong>Table S10:</strong> Yearly viral cases and generation time estimations (with references). <strong>Table S11:</strong> Metadata including the sequence ID, host, date of isolation, location of isolation, and subtype. <strong>Table S12:</strong> Overview of key parameters including number of GLs N, Ne, and MRCA for each virus species as well as the mean over all species. <strong>Directories</strong> <strong>Alignments</strong> ORFeome alignments (excluding stop codons) with the exception of SARS-CoV-2. <strong>Correlated Subtrees</strong> Subtrees representing all correlated-clades (genealogical lineages; GL) in Newick format. <strong>Diversity TMRCA and Skyline Plots</strong> .png files. <strong>e/d/n/o Subtrees</strong> EVA and H3N2 subtrees, evenly and diversely sampled as well as “early” and “late” subtrees for H3N2. <strong>Genealogical Trees</strong> Including global topologies (.main), subtrees, and grafted trees (.grafted). <strong>Maximally Diverse Subtrees:</strong> Genealogical trees sampled to contain the same number of leaves as each respective tree in the Simulated Trees directory. <strong>ORF References</strong> The first and last nucleotide of each ORF in the reference sequence. <strong>Redundancy Tables</strong> List of all redundant isolates and the corresponding representative in the ORFeome-unique alignment. <strong>Reference Sequences</strong> Genbank page for all reference sequences. <strong>Rooted Trees</strong> Global topologies for each virus. <strong>Simulated Trees:</strong> Neutral models, based on Yule-Harding, for each virus.<br> Ultrametric Trees For each global topology. <strong>Ultrametric Trees </strong>For each global topology.
Discussion(0)
No comments yet. Be the first to comment.