[PubMed automatic search; some publications might not appear]
2026
Cancer evolution and multi-omic profile of relapsed colorectal liver metastases after treatment
Cancer evolution and multi-omic profile of relapsed colorectal liver metastases after treatment
Recurrence following resection of colorectal cancer liver metastases remains a major obstacle to prolonged patient survival, often resulting in treatment-refractory disease with limited understanding of the underlying evolutionary drivers. To investigate these mechanisms, we performed an in-depth, patient-specific study of genomic and microenvironmental alterations in relapsed colorectal cancer liver metastases from nine individuals.
Unraveling the Phylogenetic Signal of Gene Expression from Single-cell RNA-seq Data
Unraveling the Phylogenetic Signal of Gene Expression from Single-cell RNA-seq Data
Single-cell RNA sequencing (scRNA-seq) has transformed our understanding of phenotypic heterogeneity. Although the predominant focus of scRNA-seq analyses has been assessing gene expression changes, several approaches have been proposed in recent years to identify changes at the DNA level from scRNA-seq data. In this study, we evaluated the relative performance of six strategies for calling single-nucleotide variants from scRNA-seq data using 381 single-cell transcriptomes from five cancer patients. Specifically, we focused on the quality of the inferred genotypes and the resulting single-cell phylogenies. We found that scAllele, Monopogen, and Monovar consistently returned phylogenetically informative genotype calls, providing more precise signals of discrimination between tumor and normal cells within heterogeneous samples and among distinct subclonal lineages in longitudinal samples. In addition, we evaluated the evolution of gene expression along the cell phylogenies. While most transcriptomic variation was very plastic and did not correlate with the cell phylogeny, a group of genes associated with cell cycle processes showed a strong phylogenetic signal in one of the patients, underscoring a potential link between gene expression patterns and lineage-specific traits in the context of cancer progression. In summary, our study highlights the potential of scRNA-seq data for inferring cell phylogenies to decipher the evolutionary dynamics of cell populations.
Whole-Genome Amplification of Single Circulating Tumor Cells from Mice Xenografts
Whole-Genome Amplification of Single Circulating Tumor Cells from Mice Xenografts
The genomic analysis of circulating tumor cells (CTCs) can help us identify their specific characteristics and reveal intratumor heterogeneity while also reducing the need for invasive sampling. However, studying single CTCs can be challenging, as whole-genome amplification (WGA) is needed to obtain enough genetic material for high-throughput sequencing. Here, we present a protocol for isolating single CTCs from mouse xenografts, followed by WGA using the Ampli1WGA Plus kit.
2025
Genomic diversity and BCL9L mutational status in circulating tumor cells predict overall survival in metastatic colorectal cancer
Genomic diversity and BCL9L mutational status in circulating tumor cells predict overall survival in metastatic colorectal cancer
Metastatic colorectal cancer (mCRC) remains a major cause of cancer-related mortality, but few noninvasive biomarkers exist to track disease progression or inform treatment strategies. Circulating tumor cells (CTCs) offer a minimally invasive source of tumor material, yet the prognostic significance of their genomic diversity remains unclear.
DelSIEVE: cell phylogeny modeling of single nucleotide variants and deletions from single-cell DNA sequencing data
DelSIEVE: cell phylogeny modeling of single nucleotide variants and deletions from single-cell DNA sequencing data
With rapid advancements in single-cell DNA sequencing (scDNA-seq), various computational methods have been developed to study evolution and call variants on single-cell level. However, modeling deletions remains challenging because they affect total coverage in ways that are difficult to distinguish from technical artifacts. We present DelSIEVE, a statistical method that infers cell phylogeny and single-nucleotide variants, accounting for deletions, from scDNA-seq data. DelSIEVE distinguishes deletions from mutations and artifacts, detecting more evolutionary events than previous methods. Simulations show high performance, and application to cancer samples reveals varying amounts of deletions and double mutants in different tumors.
International importance and spread of SARS-CoV-2 variants Alpha, Delta, and Omicron BA.1 into Spain
International importance and spread of SARS-CoV-2 variants Alpha, Delta, and Omicron BA.1 into Spain
The spread of SARS-CoV-2 has been influenced by multiple factors, from the inherent transmission capabilities of the different variants to the control measurements put in place. Understanding how new variants enter a country is essential for managing future outbreaks. This study investigates how three major variants-Alpha, Delta, and Omicron (BA.1)-entered Spain and how different restrictions potentially affected their introduction.
A Comparison of Methods for the Optimal Recovery of the Human Fecal Virome
A Comparison of Methods for the Optimal Recovery of the Human Fecal Virome
Human virome studies are gaining attention as viruses are increasingly acknowledged as key modulators of microbial communities and human health. However, viral metagenomics presents distinct challenges, including the low abundance and diversity of viruses in biological samples, the lack of universal marker genes, and protocol-induced biases. Although various virome protocols have been benchmarked using viral particles or nucleic acids from mock communities, these often fail to replicate the complexity and heterogeneity of natural viromes. In this study, we systematically evaluated protocol modifications for the metagenomic analysis of human fecal samples, testing alternatives for viral enrichment, nucleic acid extraction, genome amplification, and library preparation. We assessed the impact of each modification on key inferences, including taxonomic and functional assignment, contig quality, viral diversity, and genome structure. Our results highlight important trade-offs between viral genome recovery and contamination removal, underscoring how methodological choices can shape virome composition. Based on our findings, we propose an optimized protocol that enhances the recovery of viral DNA and RNA genomes while minimizing contamination from non-viral sequences, providing a robust framework for future gut virome studies.
Differential performance of strategies for single-cell whole-genome amplification
Differential performance of strategies for single-cell whole-genome amplification
Single-cell genomics enables studying tissues and organisms at the highest resolution. However, since a cell contains a small amount of DNA, single-cell DNA sequencing (scDNA-seq) typically requires single-cell whole-genome amplification (scWGA). Unfortunately, scWGA methods introduce technical biases that complicate the interpretation of scDNA-seq data. We compared six scWGA methods, three MDA (multiple displacement amplification; GenomiPhi, REPLI-g, and TruePrime) and three non-MDA (Ampli1, MALBAC, and PicoPLEX), on 206 tumoral and 24 healthy human cells. scWGA methods performed differently depending on the parameter of interest. REPLI-g minimized regional amplification bias, while non-MDA methods showed a more uniform and reproducible amplification. Ampli1 exhibited the lowest allelic imbalance and dropout, the most accurate insertion or deletion (indel) and copy-number detection, and a low polymerase error rate. However, REPLI-g yielded higher DNA quantities, longer amplicons, and greater genome coverage. We offer a comprehensive guide for selecting a scWGA approach, outlining trade-offs that influence the interpretation of scDNA-seq data.
Intraline genomic heterogeneity of the triple-negative breast cancer MDA-MB-231-luc-GFP cell line
Intraline genomic heterogeneity of the triple-negative breast cancer MDA-MB-231-luc-GFP cell line
Cancer cell lines are valuable models for studying tumor biology, yet their genomic evolution during culture can compromise experimental reproducibility. We conducted a detailed genomic analysis of the triple-negative breast cancer cell line MDA-MB-231-luc-GFP, examining sublines obtained from different sources, at various time points, and across distinct passages. We introduce the concept of intraline heterogeneity (ILH) to highlight the genomic variability observed among these sublines. Our analyses revealed extensive genomic diversity, including differences in single-nucleotide variants (SNVs) and copy number alterations (CNAs). In particular, CNAs exhibited remarkable heterogeneity, with pronounced chromosomal gains and losses between sublines, underscoring the impact of genomic instability on ILH. These findings suggest that ILH may influence experimental outcomes, emphasizing the importance of considering passage-specific genomic characterization to ensure consistency and reliability in cancer research.
2024
Dispersal history of SARS-CoV-2 in Galicia, Spain
Dispersal history of SARS-CoV-2 in Galicia, Spain
The dynamics of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) transmission are influenced by a variety of factors, including social restrictions and the emergence of distinct variants. In this study, we delve into the origins and dissemination of the Alpha, Delta, and Omicron-BA.1 variants of concern in Galicia, northwest Spain. For this, we leveraged genomic data collected by the EPICOVIGAL Consortium and from the GISAID database, along with mobility information from other Spanish regions and foreign countries. Our analysis indicates that initial introductions during the Alpha phase were predominantly from other Spanish regions and France. However, as the pandemic progressed, introductions from Portugal and the United States became increasingly significant. The number of detected introductions varied from 96 and 101 for Alpha and Delta to 39 for Omicron-BA.1. Most of these introductions left a low number of descendants (<10), suggesting a limited impact on the evolution of the pandemic in Galicia. Notably, Galicia's major coastal cities emerged as critical hubs for viral transmission, highlighting their role in sustaining and spreading the virus. This research emphasizes the critical role of regional connectivity in the spread of SARS-CoV-2 and offers essential insights for enhancing public health strategies and surveillance measures.
Dispersal history of SARS-CoV-2 variants Alpha, Delta, and Omicron (BA.1) in Spain
Dispersal history of SARS-CoV-2 variants Alpha, Delta, and Omicron (BA.1) in Spain
Different factors influence the spread of SARS-CoV-2, from the inherent transmission capabilities of the different variants to the control measurements put in place. Here we studied the introduction of the Alpha, Delta, and Omicron-BA.1 variants of concern (VOCs) into Spain. For this, we collected genomic data from the GISAID database and combined it with connectivity data from different countries with Spain to perform a phylodynamic Bayesian analysis of the introductions. Our findings reveal that the introductions of these VOCs predominantly originated from France, especially in the case of Alpha. As travel restrictions were eased during the Delta and Omicron-BA.1 waves, the number of introductions from distinct countries increased, with the United Kingdom and Germany becoming significant sources of the virus. The largest number of introductions detected corresponded to the Delta wave, which was associated with fewer restrictions and the summer period, when Spain receives a considerable number of tourists. This research underscores the importance of monitoring international travel patterns and implementing targeted public health measures to manage the spread of SARS-CoV-2.
Crykey: Rapid identification of SARS-CoV-2 cryptic mutations in wastewater
Crykey: Rapid identification of SARS-CoV-2 cryptic mutations in wastewater
Wastewater surveillance for SARS-CoV-2 provides early warnings of emerging variants of concerns and can be used to screen for novel cryptic linked-read mutations, which are co-occurring single nucleotide mutations that are rare, or entirely missing, in existing SARS-CoV-2 databases. While previous approaches have focused on specific regions of the SARS-CoV-2 genome, there is a need for computational tools capable of efficiently tracking cryptic mutations across the entire genome and investigating their potential origin. We present Crykey, a tool for rapidly identifying rare linked-read mutations across the genome of SARS-CoV-2. We evaluated the utility of Crykey on over 3,000 wastewater and over 22,000 clinical samples; our findings are three-fold: i) we identify hundreds of cryptic mutations that cover the entire SARS-CoV-2 genome, ii) we track the presence of these cryptic mutations across multiple wastewater treatment plants and over three years of sampling in Houston, and iii) we find a handful of cryptic mutations in wastewater mirror cryptic mutations in clinical samples and investigate their potential to represent real cryptic lineages. In summary, Crykey enables large-scale detection of cryptic mutations in wastewater that represent potential circulating cryptic lineages, serving as a new computational tool for wastewater surveillance of SARS-CoV-2.
Dispersal history of SARS-CoV-2 in Galicia, Spain
Dispersal history of SARS-CoV-2 in Galicia, Spain
The dynamics of SARS-CoV-2 transmission are influenced by a variety of factors, including social restrictions and the emergence of distinct variants. In this study, we delve into the origins and dissemination of the Alpha, Delta, and Omicron variants of concern in Galicia, northwest Spain. For this, we leveraged genomic data collected by the EPICOVIGAL Consortium and from the GISAID database, along with mobility information from other Spanish regions and foreign countries. Our analysis indicates that initial introductions during the Alpha phase were predominantly from other Spanish regions and France. However, as the pandemic progressed, introductions from Portugal and the USA became increasingly significant. Notably, Galicia's major coastal cities emerged as critical hubs for viral transmission, highlighting their role in sustaining and spreading the virus. This research emphasizes the critical role of regional connectivity in the spread of SARS-CoV-2 and offers essential insights for enhancing public health strategies and surveillance measures.
2023
Evolutionary and spatiotemporal analyses reveal multiple introductions and cryptic transmission of SARS-CoV-2 VOC/VOI in Malta
Evolutionary and spatiotemporal analyses reveal multiple introductions and cryptic transmission of SARS-CoV-2 VOC/VOI in Malta
Our study provides insights into the evolution of the coronavirus disease 2019 (COVID-19) pandemic in Malta, a highly connected and understudied country. We combined epidemiological and phylodynamic analyses to analyze trends in the number of new cases, deaths, tests, positivity rates, and evolutionary and dispersal patterns from August 2020 to January 2022. Our reconstructions inferred 173 independent severe acute respiratory syndrome coronavirus 2 introductions into Malta from various global regions. Our study demonstrates that characterizing epidemiological trends coupled with phylodynamic modeling can inform the implementation of public health interventions to help control COVID-19 transmission in the community.
Somatic evolution of marine transmissible leukemias in the common cockle, Cerastoderma edule
Somatic evolution of marine transmissible leukemias in the common cockle, Cerastoderma edule
Transmissible cancers are malignant cell lineages that spread clonally between individuals. Several such cancers, termed bivalve transmissible neoplasia (BTN), induce leukemia-like disease in marine bivalves. This is the case of BTN lineages affecting the common cockle, Cerastoderma edule, which inhabits the Atlantic coasts of Europe and northwest Africa. To investigate the evolution of cockle BTN, we collected 6,854 cockles, diagnosed 390 BTN tumors, generated a reference genome and assessed genomic variation across 61 tumors. Our analyses confirmed the existence of two BTN lineages with hemocytic origins. Mitochondrial variation revealed mitochondrial capture and host co-infection events. Mutational analyses identified lineage-specific signatures, one of which likely reflects DNA alkylation. Cytogenetic and copy number analyses uncovered pervasive genomic instability, with whole-genome duplication, oncogene amplification and alkylation-repair suppression as likely drivers. Satellite DNA distributions suggested ancient clonal origins. Our study illuminates long-term cancer evolution under the sea and reveals tolerance of extreme instability in neoplastic genomes.
Crykey: Rapid Identification of SARS-CoV-2 Cryptic Mutations in Wastewater
Crykey: Rapid Identification of SARS-CoV-2 Cryptic Mutations in Wastewater
We present Crykey, a computational tool for rapidly identifying cryptic mutations of SARS-CoV-2. Specifically, we identify co-occurring single nucleotide mutations on the same sequencing read, called linked-read mutations, that are rare or entirely missing in existing databases, and have the potential to represent novel cryptic lineages found in wastewater. While previous approaches exist for identifying cryptic linked-read mutations from specific regions of the SARS-CoV-2 genome, there is a need for computational tools capable of efficiently tracking cryptic mutations across the entire genome and for tens of thousands of samples and with increased scrutiny, given their potential to represent either artifacts or hidden SARS-CoV-2 lineages. Crykey fills this gap by identifying rare linked-read mutations that pass stringent computational filters to limit the potential for artifacts. We evaluate the utility of Crykey on >3,000 wastewater and >22,000 clinical samples; our findings are three-fold: i) we identify hundreds of cryptic mutations that cover the entire SARS-CoV-2 genome, ii) we track the presence of these cryptic mutations across multiple wastewater treatment plants and over a three years of sampling in Houston, and iii) we find a handful of cryptic mutations in wastewater mirror cryptic mutations in clinical samples and investigate their potential to represent real cryptic lineages. In summary, Crykey enables large-scale detection of cryptic mutations representing potential cryptic lineages in wastewater.
Single-cell phylogenies reveal changes in the evolutionary rate within cancer and healthy tissues
Single-cell phylogenies reveal changes in the evolutionary rate within cancer and healthy tissues
Cell lineages accumulate somatic mutations during organismal development, potentially leading to pathological states. The rate of somatic evolution within a cell population can vary due to multiple factors, including selection, a change in the mutation rate, or differences in the microenvironment. Here, we developed a statistical test called the Poisson Tree (PT) test to detect varying evolutionary rates among cell lineages, leveraging the phylogenetic signal of single-cell DNA sequencing (scDNA-seq) data. We applied the PT test to 24 healthy and cancer samples, rejecting a constant evolutionary rate in 11 out of 15 cancer and five out of nine healthy scDNA-seq datasets. In six cancer datasets, we identified subclonal mutations in known driver genes that could explain the rate accelerations of particular cancer lineages. Our findings demonstrate the efficacy of scDNA-seq for studying somatic evolution and suggest that cell lineages often evolve at different rates within cancer and healthy tissues.
Wastewater early warning system for SARS-CoV-2 outbreaks and variants in a Coruña, Spain
Wastewater early warning system for SARS-CoV-2 outbreaks and variants in a Coruña, Spain
Wastewater-based epidemiology has been widely used as a cost-effective method for tracking the COVID-19 pandemic at the community level. Here we describe COVIDBENS, a wastewater surveillance program running from June 2020 to March 2022 in the wastewater treatment plant of Bens in A Coruña (Spain). The main goal of this work was to provide an effective early warning tool based in wastewater epidemiology to help in decision-making at both the social and public health levels. RT-qPCR procedures and Illumina sequencing were used to weekly monitor the viral load and to detect SARS-CoV-2 mutations in wastewater, respectively. In addition, own statistical models were applied to estimate the real number of infected people and the frequency of each emerging variant circulating in the community, which considerable improved the surveillance strategy. Our analysis detected 6 viral load waves in A Coruña with concentrations between 10 and 10 SARS-CoV-2 RNA copies/L. Our system was able to anticipate community outbreaks during the pandemic with 8-36 days in advance with respect to clinical reports and, to detect the emergence of new SARS-CoV-2 variants in A Coruña such as Alpha (B.1.1.7), Delta (B.1.617.2), and Omicron (B.1.1.529 and BA.2) in wastewater with 42, 30, and 27 days, respectively, before the health system did. Data generated here helped local authorities and health managers to give a faster and more efficient response to the pandemic situation, and also allowed important industrial companies to adapt their production to each situation. The wastewater-based epidemiology program developed in our metropolitan area of A Coruña (Spain) during the SARS-CoV-2 pandemic served as a powerful early warning system combining statistical models with mutations and viral load monitoring in wastewater over time.
2022
Comparative analysis of capture methods for genomic profiling of circulating tumor cells in colorectal cancer
Comparative analysis of capture methods for genomic profiling of circulating tumor cells in colorectal cancer
The genomic profiling of circulating tumor cells (CTCs) in the bloodstream should provide clinically relevant information on therapeutic efficacy and help predict cancer survival. Here, we contrasted the genomic profiles of CTC pools recovered from metastatic colorectal cancer (mCRC) patients using different enrichment strategies (CellSearch, Parsortix, and FACS). Mutations inferred in the CTC pools differed depending on the enrichment strategy and, in all cases, represented a subset of the mutations detected in the matched primary tumor samples. However, the CTC pools from Parsortix, and in part, CellSearch, showed diversity estimates, mutational signatures, and drug-suitability scores remarkably close to those found in matching primary tumor samples. In addition, FACS CTC pools were enriched in apparent sequencing artifacts, leading to much higher genomic diversity estimates. Our results highlight the utility of CTCs to assess the genomic heterogeneity of individual tumors and help clinicians prioritize drugs in mCRC.
SIEVE: joint inference of single-nucleotide variants and cell phylogeny from single-cell DNA sequencing data
SIEVE: joint inference of single-nucleotide variants and cell phylogeny from single-cell DNA sequencing data
We present SIEVE, a statistical method for the joint inference of somatic variants and cell phylogeny under the finite-sites assumption from single-cell DNA sequencing. SIEVE leverages raw read counts for all nucleotides and corrects the acquisition bias of branch lengths. In our simulations, SIEVE outperforms other methods in phylogenetic reconstruction and variant calling accuracy, especially in the inference of homozygous variants. Applying SIEVE to three datasets, one for triple-negative breast (TNBC), and two for colorectal cancer (CRC), we find that double mutant genotypes are rare in CRC but unexpectedly frequent in the TNBC samples.
Clonality and timing of relapsing colorectal cancer metastasis revealed through whole-genome single-cell sequencing
Clonality and timing of relapsing colorectal cancer metastasis revealed through whole-genome single-cell sequencing
Recurrence of tumor cells following local and systemic therapy is a significant hurdle in cancer. Most patients with metastatic colorectal cancer (mCRC) will relapse, despite resection of the metastatic lesions. A better understanding of the evolutionary history of recurrent lesions is required to identify the spatial and temporal patterns of metastatic progression and expose the genetic and evolutionary determinants of therapeutic resistance. With this goal in mind, here we leveraged a unique single-cell whole-genome sequencing dataset from recurrent hepatic lesions of an mCRC patient. Our phylogenetic analysis confirms that the treatment induced a severe demographic bottleneck in the liver metastasis but also that a previously diverged lineage survived this surgery, possibly after migration to a different site in the liver. This lineage evolved very slowly for two years under adjuvant drug therapy and diversified again in a very short period. We identified several non-silent mutations specific to this lineage and inferred a substantial contribution of chemotherapy to the overall, genome-wide mutational burden. All in all, our study suggests that mCRC subclones can migrate locally and evade resection, keep evolving despite rounds of chemotherapy, and re-expand explosively.
Phylogenomic Analyses of 2,786 Genes in 158 Lineages Support a Root of the Eukaryotic Tree of Life between Opisthokonts and All Other Lineages
Phylogenomic Analyses of 2,786 Genes in 158 Lineages Support a Root of the Eukaryotic Tree of Life between Opisthokonts and All Other Lineages
Advances in phylogenomics and high-throughput sequencing have allowed the reconstruction of deep phylogenetic relationships in the evolution of eukaryotes. Yet, the root of the eukaryotic tree of life remains elusive. The most popular hypothesis in textbooks and reviews is a root between Unikonta (Opisthokonta + Amoebozoa) and Bikonta (all other eukaryotes), which emerged from analyses of a single-gene fusion. Subsequent, highly cited studies based on concatenation of genes supported this hypothesis with some variations or proposed a root within Excavata. However, concatenation of genes does not consider phylogenetically-informative events like gene duplications and losses. A recent study using gene tree parsimony (GTP) suggested the root lies between Opisthokonta and all other eukaryotes, but only including 59 taxa and 20 genes. Here we use GTP with a duplication-loss model in a gene-rich and taxon-rich dataset (i.e., 2,786 gene families from two sets of 155 and 158 diverse eukaryotic lineages) to assess the root, and we iterate each analysis 100 times to quantify tree space uncertainty. We also contrasted our results and discarded alternative hypotheses from the literature using GTP and the likelihood-based method SpeciesRax. Our estimates suggest a root between Fungi or Opisthokonta and all other eukaryotes; but based on further analysis of genome size, we propose that the root between Opisthokonta and all other eukaryotes is the most likely.
Phylovar: toward scalable phylogeny-aware inference of single-nucleotide variations from single-cell DNA sequencing data
Phylovar: toward scalable phylogeny-aware inference of single-nucleotide variations from single-cell DNA sequencing data
Single-nucleotide variants (SNVs) are the most common variations in the human genome. Recently developed methods for SNV detection from single-cell DNA sequencing data, such as SCIΦ and scVILP, leverage the evolutionary history of the cells to overcome the technical errors associated with single-cell sequencing protocols. Despite being accurate, these methods are not scalable to the extensive genomic breadth of single-cell whole-genome (scWGS) and whole-exome sequencing (scWES) data.
Single-cell mtDNA heteroplasmy in colorectal cancer
Single-cell mtDNA heteroplasmy in colorectal cancer
Human mitochondria can be genetically distinct within the same individual, a phenomenon known as heteroplasmy. In cancer, this phenomenon seems exacerbated, and most mitochondrial mutations seem to be heteroplasmic. How this genetic variation is arranged within and among normal and tumor cells is not well understood. To address this question, here we sequenced single-cell mitochondrial genomes from multiple normal and tumoral locations in four colorectal cancer patients. Our results suggest that single cells, both normal and tumoral, can carry various mitochondrial haplotypes. Remarkably, this intra-cell heteroplasmy can arise before tumor development and be maintained afterward in specific tumoral cell subpopulations. At least in the colorectal patients studied here, the somatic mutations in the single-cells do not seem to have a prominent role in tumorigenesis.
Mitochondrial genome sequencing of marine leukaemias reveals cancer contagion between clam species in the Seas of Southern Europe
Mitochondrial genome sequencing of marine leukaemias reveals cancer contagion between clam species in the Seas of Southern Europe
Clonally transmissible cancers are tumour lineages that are transmitted between individuals via the transfer of living cancer cells. In marine bivalves, leukaemia-like transmissible cancers, called hemic neoplasia (HN), have demonstrated the ability to infect individuals from different species. We performed whole-genome sequencing in eight warty venus clams that were diagnosed with HN, from two sampling points located more than 1000 nautical miles away in the Atlantic Ocean and the Mediterranean Sea Coasts of Spain. Mitochondrial genome sequencing analysis from neoplastic animals revealed the coexistence of haplotypes from two different clam species. Phylogenies estimated from mitochondrial and nuclear markers confirmed this leukaemia originated in striped venus clams and later transmitted to clams of the species warty venus, in which it survives as a contagious cancer. The analysis of mitochondrial and nuclear gene sequences supports all studied tumours belong to a single neoplastic lineage that spreads in the Seas of Southern Europe.
SARS-CoV-2 Evolution and Spike-Specific CD4+ T-Cell Response in Persistent COVID-19 with Severe HIV Immune Suppression
SARS-CoV-2 Evolution and Spike-Specific CD4+ T-Cell Response in Persistent COVID-19 with Severe HIV Immune Suppression
Intra-host evolution of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has been reported in cases with persistent coronavirus disease 2019 (COVID-19). In this study, we describe a severely immunosuppressed individual with HIV-1/SARS-CoV-2 coinfection with a long-term course of SARS-CoV-2 infection. A 28-year-old man was diagnosed with HIV-1 infection (CD4+ count: 3 cells/µL nd 563000 HIV-1 RNA copies/mL) and simultaneous pneumonia, disseminated infection and SARS-CoV-2 infection. SARS-CoV-2 real-time reverse transcription polymerase chain reaction positivity from nasopharyngeal samples was prolonged for 15 weeks. SARS-CoV-2 was identified as variant Alpha (PANGO lineage B.1.1.7) with mutation S:E484K. Spike-specific T-cell response was similar to HIV-negative controls although enriched in IL-2, and showed disproportionately increased immunological exhaustion marker levels. Despite persistent SARS-CoV-2 infection, adaptive intra-host SARS-CoV-2 evolution, was not identified. Spike-specific T-cell response protected against a severe COVID-19 outcome and the increased immunological exhaustion marker levels might have favoured SARS-CoV-2 persistence.
CellPhy: accurate and fast probabilistic inference of single-cell phylogenies from scDNA-seq data
CellPhy: accurate and fast probabilistic inference of single-cell phylogenies from scDNA-seq data
We introduce CellPhy, a maximum likelihood framework for inferring phylogenetic trees from somatic single-cell single-nucleotide variants. CellPhy leverages a finite-site Markov genotype model with 16 diploid states and considers amplification error and allelic dropout. We implement CellPhy into RAxML-NG, a widely used phylogenetic inference package that provides statistical confidence measurements and scales well on large datasets with hundreds or thousands of cells. Comprehensive simulations suggest that CellPhy is more robust to single-cell genomics errors and outperforms state-of-the-art methods under realistic scenarios, both in accuracy and speed. CellPhy is freely available at https://github.com/amkozlov/cellphy .
Limited genomic reconstruction of SARS-CoV-2 transmission history within local epidemiological clusters
Limited genomic reconstruction of SARS-CoV-2 transmission history within local epidemiological clusters
A detailed understanding of how and when severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) transmission occurs is crucial for designing effective prevention measures. Other than contact tracing, genome sequencing provides information to help infer who infected whom. However, the effectiveness of the genomic approach in this context depends on both (high enough) mutation and (low enough) transmission rates. Today, the level of resolution that we can obtain when describing SARS-CoV-2 outbreaks using just genomic information alone remains unclear. In order to answer this question, we sequenced forty-nine SARS-CoV-2 patient samples from ten local clusters in NW Spain for which partial epidemiological information was available and inferred transmission history using genomic variants. Importantly, we obtained high-quality genomic data, sequencing each sample twice and using unique barcodes to exclude cross-sample contamination. Phylogenetic and cluster analyses showed that consensus genomes were generally sufficient to discriminate among independent transmission clusters. However, levels of intrahost variation were low, which prevented in most cases the unambiguous identification of direct transmission events. After filtering out recurrent variants across clusters, the genomic data were generally compatible with the epidemiological information but did not support specific transmission events over possible alternatives. We estimated the effective transmission bottleneck size to be one to two viral particles for sample pairs whose donor-recipient relationship was likely. Our analyses suggest that intrahost genomic variation in SARS-CoV-2 might be generally limited and that homoplasy and recurrent errors complicate identifying shared intrahost variants. Reliable reconstruction of direct SARS-CoV-2 transmission based solely on genomic data seems hindered by a slow mutation rate, potential convergent events, and technical artifacts. Detailed contact tracing seems essential in most cases to study SARS-CoV-2 transmission at high resolution.
Somatic variant calling from single-cell DNA sequencing data
Somatic variant calling from single-cell DNA sequencing data
Single-cell sequencing has gained popularity in recent years. Despite its numerous applications, single-cell DNA sequencing data is highly error-prone due to technical biases arising from uneven sequencing coverage, allelic dropout, and amplification error. With these artifacts, the identification of somatic genomic variants becomes a challenging task, and over the years, several methods have been developed explicitly for this type of data. Single-cell variant callers implement distinct strategies, make different use of the data, and typically result in many discordant calls when applied to real data. Here, we review current approaches for single-cell variant calling, emphasizing single nucleotide variants. We highlight their potential benefits and shortcomings to help users choose a suitable tool for their data at hand.