Viromics: Computational Analysis of Viral Genomes

Viromics, also referred to as viral metagenomics, is the field of study dedicated to the comprehensive characterization of viral communities (viromes) through the computational analysis of genetic material extracted directly from environmental, animal, or clinical samples [1]. This discipline has fundamentally transformed our understanding of viral diversity, ecology, and evolution by enabling the discovery of viruses that are recalcitrant to traditional culture-based isolation methods [2, 1]. The computational analysis of viral genomes from metagenomic data presents unique challenges distinct from those encountered in bacterial metagenomics, including high sequence diversity, the absence of universal marker genes, and the prevalence of uncharacterized "viral dark matter" [3]. This article provides an exhaustive, publication-grade overview of the computational methods, workflows, and analytical tools that constitute the core of modern viromics, with an emphasis on applications in veterinary medicine and environmental surveillance.

The Viromics Workflow: A Computational Pipeline

The computational analysis of viral genomes follows a structured pipeline, which can be subdivided into several key stages: sample preparation and sequencing, quality control and preprocessing, viral sequence identification, genome assembly, binning, taxonomic classification, host prediction, and functional annotation [4, 5, 6]. Each stage employs specific algorithms and bioinformatic tools, and the choice of these tools can profoundly impact the results [7, 8].

graph TD
    A[Sample Collection & Processing], > B[Nucleic Acid Extraction & Sequencing]
    B, > C[Quality Control & Preprocessing]
    C, > D{Viral Sequence Identification}
    D, > D1[Alignment-based (BLAST, DIAMOND)]
    D, > D2[Feature-based (VirFinder, DeepVirFinder)]
    D, > D3[Composition-based (VirSorter, VIBRANT)]
    D1 & D2 & D3, > E[Assembly of Viral Genomes]
    E, > F[Viral Genome Binning]
    F, > G[Taxonomic Classification]
    F, > H[Host Prediction]
    G & H, > I[Functional Annotation & Downstream Analysis]
    I, > J[Ecological & Evolutionary Interpretation]

1. Sample Preparation and Sequencing Strategies

The starting material for viromics is typically virus-like particles (VLPs) that are purified and concentrated from a sample matrix, such as feces, plasma, tissue homogenate, or environmental water [9, 10]. This purification step, often involving filtration and ultracentrifugation or polyethylene glycol precipitation, aims to enrich for viral particles while depleting cellular debris and non-viral nucleic acids [11, 9]. The choice between bulk metagenomic sequencing (total DNA/RNA from a sample) and VLP-enriched sequencing introduces significant biases [11]. While VLP enrichment enhances the proportion of viral reads, it can also lead to the underrepresentation of large double-stranded DNA (dsDNA) viruses and integrated prophages [11]. Bulk metagenomics captures a broader diversity, including endogenous viral elements and prophages, but requires greater sequencing depth and computational effort to detect low-abundance viruses [11, 12].

Nucleic acid extraction must be compatible with the diverse genome types of viruses (dsDNA, single-stranded DNA (ssDNA), double-stranded RNA (dsRNA), positive-sense single-stranded RNA (+ssRNA), negative-sense single-stranded RNA (-ssRNA), and retro-transcribing viruses). Separate protocols for DNA and RNA, or total nucleic acid extraction followed by enzymatic treatment (e.g., RNase-free DNase or DNase-free RNase), are often employed. For RNA virome studies, ribosomal RNA (rRNA) depletion is a critical step to avoid overwhelming the sequencing data with host or bacterial rRNA sequences, although various methods show variable efficacy [13]. The metatranscriptomic approach, which sequences the total RNA of a sample, can capture actively replicating RNA viruses and provide simultaneous host transcriptomic data [14, 15].

2. Quality Control and Preprocessing

Raw sequencing reads from high-throughput sequencers undergo a rigorous quality control (QC) process. This typically includes the removal of adapter sequences, trimming of low-quality bases, and filtering of reads based on length and quality scores using tools such as Fastp or Trimmomatic. Host genome subtraction is a critical step, where reads that align to the host genome (e.g., the chicken, bovine, or porcine reference genome) are removed to reduce the dataset size and computational load for downstream analysis. This is typically performed using sequence aligners such as Bowtie2 or BWA-MEM.

3. Viral Sequence Identification

Identifying which sequencing reads or assembled contigs are of viral origin is the most challenging step in the viromics pipeline [16]. This process relies on a combination of alignment-dependent and alignment-independent approaches [17, 18].

Alignment-dependent methods search for sequence similarity against known viral databases. Tools such as BLASTn, BLASTx, and DIAMOND (a fast protein-level aligner) are used to query sequences against databases like the NCBI RefSeq viral database, the Virus-Host DB, or the Virosaurus database [19]. The primary limitation of this approach is its reliance on previously characterized viruses. Highly divergent viruses, which represent a large fraction of the virosphere, will not be detected [20, 2, 18].

Alignment-independent or feature-based methods have been developed to overcome this limitation. These tools, including VirFinder and its successor DeepVirFinder, use machine learning (ML) models, such as deep neural networks, trained on known viral and prokaryotic genomic signatures [17]. These signatures include k-mer frequencies, dinucleotide biases, and codon usage patterns. DeepVirFinder, for instance, has demonstrated high accuracy and recall in identifying viral sequences from metagenomic data, including those that are highly divergent from known viruses [17].

Compositional and structural approaches parse genomic features such as the presence of specific protein families related to viral replication, structural proteins, or integration machinery. Tools like VirSorter2, VIBRANT, and the newer ViromeXplore [4] combine sequence similarity searches with a curated set of viral protein family (VPF) Hidden Markov Models (HMMs) and heuristic rules. These tools can identify both free viral contigs and prophage sequences integrated into host genomes. The CRESSENT toolkit provides specialized tools for the annotation and exploration of circular Rep-encoding ssDNA (CRESS-DNA) viruses, which are often missed by standard pipelines [21].

4. Genome Assembly

After the identification of viral reads or the initial step of assembly of all metagenomic reads, the goal is to reconstruct full or near-complete viral genomes from short sequencing reads. For metagenomic assembly, specialized assemblers like metaSPAdes and the dedicated Metaviral SPAdes module have been developed [22]. Metaviral SPAdes uses a multi-step approach: it first assembles all reads, then identifies viral contigs using a combination of coverage depth and k-mer signature analysis, and then performs a second round of assembly specifically using the reads that were assigned to the viral contigs, improving contiguity and completeness for viral genomes [22].

For RNA viruses, genome assembly is more complex due to the high error rates of reverse transcriptase and the need to deal with overlapping fragments. Tools like Trinity or rnaSPAdes are often used for de novo assembly of RNA-seq data. The resulting viral contigs can then be refined and curated. The use of long-read sequencing technologies can dramatically improve genome assembly, resolving repetitive regions and yielding complete genomes, particularly for phages and anelloviruses [23, 24]. Hybrid assembly strategies that combine long reads with high-accuracy short reads are becoming the gold standard for generating high-quality viral genomes [24].

5. Viral Genome Binning and Genome Recovery

Given that a single metagenomic sample can contain hundreds of viral genomes, the next step is to group assembled contigs into bins that likely originate from the same viral population (binning) [25, 6]. For bacterial metagenomics, binning relies on sequence composition and abundance patterns across multiple samples. For viruses, these features are less reliable due to the smaller genome sizes. However, tools like vRhyme address this by using a combination of sequence homology, co-abundance, and genomic coverage to bin viral contigs [26]. The MVP (Modular Viromics Pipeline) offers a modular framework that integrates several binning tools, allowing for consensus-based viral genome binning [25]. The aim is to produce viral metagenome-assembled genomes (vMAGs), which are the foundation for downstream functional and taxonomical analysis [2].

6. Taxonomic Classification and Phylogenetics

Assigning taxonomy to novel viral sequences is a major challenge. The International Committee on Taxonomy of Viruses (ICTV) has moved towards a rank-based taxonomy that is increasingly reliant on genomic and phylogenetic data. Taxonomic assignment of vMAGs is performed using a hierarchy of methods:

Hierarchical classification: Tools like ViWrap [6] and VPF-Class [27] assign taxonomy by detecting viral hallmark genes and comparing them to pre-computed taxonomic databases of viral protein families. This approach can assign sequences to known families and genera or place them in the context of proposed new higher-level taxa.

Phylogenetic placement: For more precise classification, especially at deeper evolutionary levels, phylogenetic trees are constructed using conserved marker genes. Common marker genes include the RNA-dependent RNA polymerase (RdRp) for RNA viruses [28, 29, 30], the major capsid protein (MCP) for dsDNA viruses, and the replication-associated protein (REP) for ssDNA viruses [31, 32]. The recent discovery of the "Paraxenoviridae" family of globally distributed marine dsRNA bacteriophages was validated through phylogenetic analysis of the RdRp gene, demonstrating the power of this approach for describing new higher-order taxa [28, 29]. The megataxonomy of the virosphere, a recent framework that groups viruses into realms based on the type of capsid protein and replication machinery, is itself defined by these phylogenetic relationships [33].

7. Host Prediction

Predicting the host of a novel virus is crucial for understanding its ecology, transmission patterns, and potential pathogenic impact, especially for veterinary applications [2, 6]. Several computational strategies exist, each with its own strengths and limitations:

Sequence homology: If a viral sequence shares high similarity to a virus with a known host, the host is inferred. This is the simplest but least powerful method for novel viruses.

CRISPR spacer matching: Prokaryotic CRISPR-Cas systems store short sequences (spacers) from prior viral infections. By identifying these spacers in the host genome and matching them to a viral contig, a direct and highly specific host linkage can be established [2, 6, 34].

tRNA matching: Some viruses, particularly phages, carry tRNA genes that match the codon usage bias of their host. Identifying highly similar host and viral tRNA sequences can provide evidence of a host relationship [2].

Co-abundance analysis: Across multiple metagenomic samples, the abundance profile of a virus often correlates with the abundance profile of its host. This statistical approach can predict hosts for viruses that show significant co-occurrence patterns with particular prokaryotic or eukaryotic taxa [2, 35].

Machine learning: Advanced methods like HostG and PHISDetector use a combination of genomic features, sequence composition, and network analysis to predict virus-host interactions for both prokaryotic and eukaryotic viruses [20, 2].

8. Functional Annotation

Functional annotation assigns putative functions to predicted viral open reading frames (ORFs). This is typically performed using sequence similarity searches against databases of known protein functions, such as the NCBI Conserved Domain Database (CDD), Pfam, or the KEGG Orthology (KO) database. For veterinary viromics, annotation of virulence factors, auxillary metabolic genes (AMGs), and genes involved in immune modulation is of particular interest [36]. The discovery of histone modification mimicry motifs in viral proteins through virome-wide analysis is a recent example of how functional annotation can reveal sophisticated host manipulation strategies [37]. Metabolic modeling approaches can integrate this functional data to predict the molecular transactions between viruses and their hosts within a system, such as the rumen microbiome [38].

The Scope of Veterinary Viromics

The application of viromics in veterinary medicine spans several critical areas, building upon the foundational computational methods described above.

Pathogen Discovery and Surveillance: Viromics is a powerful tool for unbiased detection of novel and emerging viral pathogens in livestock, poultry, and companion animals. Studies have used viromics to characterize the virome of swine on backyard farms, revealing a high diversity of known and novel viruses [39]. Similarly, virome analysis of field-collected chilli plants uncovered a surprising diversity of plant viruses, highlighting the role of agricultural crops as viral reservoirs [40]. Surveillance in wildlife, such as bats, is critical for pandemic preparedness, as these animals harbor a wide range of viruses with zoonotic potential [41]. The ability to detect viruses at a single-cell resolution is emerging as a powerful method to link viral presence directly to host gene expression changes, providing insights into host-pathogen interactions at an unprecedented level [42, 43].

Understanding Phage Ecology and Host Interaction: The gut virome, predominantly composed of bacteriophages, exerts a profound influence on the composition and function of the gut microbiome [44, 12, 45]. Viromics allows for the detailed analysis of these phage communities, including the prediction of their bacterial hosts (e.g., Bacteroides and Phocaeicola species) [35, 46, 47]. The discovery of the globally abundant CrAss-like phages, which exclusively infect bacteria of the Bacteroidetes phylum, is a landmark achievement of viromics [12, 47, 48]. This knowledge is foundational for developing phage therapy strategies against bacterial pathogens in production animals, such as necrotic enteritis in broiler chickens [49].

Viral Evolution and Metagenomic Context: Large-scale viromics projects are dramatically expanding the known virosphere. The estimated total genome and protein space of viruses is many times larger than that of cellular life, with much of it remaining unexplored [50]. The discovery of novel viral families, such as the "Paraxenoviridae" from marine metagenomes, underscores how computational analysis can redefine our understanding of the origins and evolution of RNA viruses [28, 29, 30]. Within the veterinary context, understanding the evolutionary dynamics of viruses like highly pathogenic avian influenza (HPAI) or porcine reproductive and respiratory syndrome virus (PRRSV) through a metagenomic lens can reveal transmission pathways and the emergence of new variants. The study of recombination dynamics in viruses, such as adenovirus D, is facilitated by robust computational analysis [31].

Challenges and Biases in Viromics

Despite its power, computational viromics is fraught with challenges and biases that must be carefully considered.

Database Bias: Most viral sequence databases are heavily biased towards culturable viruses that cause disease in humans and economically important species. This results in a profound underrepresentation of environmental and commensal viruses [20, 2, 3].

Computational Bias: The choice of bioinformatic tools significantly impacts the results. For example, replication of wastewater virome analyses across different tools revealed substantial inconsistencies and disparities in the viral communities detected [7]. Benchmarking studies are essential to guide tool selection for specific research questions [51, 16].

Annotation Challenges: A large proportion of predicted ORFs in viral genomes have no known function, a phenomenon termed "viral dark matter" [3]. Accurate annotation of these genes, particularly for divergent viruses, remains a significant bottleneck.

Protocol Heterogeneity: The lack of standardized protocols for sample processing, sequencing, and data analysis impedes comparability across studies [52]. Efforts to improve the reporting of metagenomic virome-scale data, such as the Minimum Information about an Uncultivated Virus Genome (MIUViG) standard, aim to address this issue [52].

Conclusion

Viromics is an indispensable discipline in modern virology. The computational analysis of viral genomes, from raw sequencing reads to fully annotated vMAGs, requires a sophisticated, multi-step workflow that integrates alignment, machine learning, assembly algorithms, and phylogenetic inference. The application of these methods to veterinary science is leading to the discovery of novel pathogens, a richer understanding of viral ecology within animal hosts, and the development of new strategies for disease surveillance and control. As sequencing technologies continue to advance and computational tools become more refined, viromics will remain at the forefront of exploring the vast, largely uncharted viral universe.

References

[1] Roux S, Coclet C. Viromics approaches for the study of viral diversity and ecology in microbiomes. Nat Rev Genet. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/40691354/

[2] Werner A, Chibani CM, Schmitz RA. Navigating prokaryotic viral genome analysis from metagenomic data. mSystems. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42012165/

[3] Aevarsson A, Kaczorowska AK, Adalsteinsson BT, et al. Going to extremes - a metagenomic journey into the dark matter of life. FEMS Microbiol Lett. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/34114607/

[4] Hernández-Velázquez R, Ziemski M, Bokulich NA. ViromeXplore: integrative workflows for complete and reproducible virome characterization. Brief Bioinform. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/41348596/

[5] Roach MJ, Beecroft SJ, Mihindukulasuriya KA, et al. Hecatomb: an integrated software platform for viral metagenomics. Gigascience. 2024. URL: https://pubmed.ncbi.nlm.nih.gov/38832467/

[6] Zhou Z, Martin C, Kosmopoulos JC, et al. ViWrap: A modular pipeline to identify, bin, classify, and predict viral-host relationships for viruses from metagenomes. Imeta. 2023. URL: https://pubmed.ncbi.nlm.nih.gov/38152703/

[7] Santos AFB, Nunes M, Filipa-Silva A, et al. Wastewater Metavirome Diversity: Exploring Replicate Inconsistencies and Bioinformatic Tool Disparities. Int J Environ Res Public Health. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/40427823/

[8] Martínez-Puchol S, Rusiñol M, Fernández-Cassi X, et al. Characterisation of the sewage virome: comparison of NGS tools and occurrence of significant pathogens. Sci Total Environ. 2020. URL: https://pubmed.ncbi.nlm.nih.gov/31955099/

[9] Gu Liu C, Thompson BE, Chang JD, et al. Construction and characterization of DNA libraries from cultured phages and environmental viromes. Appl Environ Microbiol. 2024. URL: https://pubmed.ncbi.nlm.nih.gov/39315792/

[10] Larsen F, Jakobsen RR, Mao X, et al. Purification and Up-Concentration of Bacteriophages and Viruses from Fecal Samples. Methods Mol Biol. 2024. URL: https://pubmed.ncbi.nlm.nih.gov/37966594/

[11] Li Y, Sun C, Zhu J, et al. Biases and complementarity in gut viromes obtained from bulk and virus-like particle-enriched metagenomic sequencing. Microbiol Spectr. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/40600714/

[12] Benler S, Yutin N, Antipov D, et al. Thousands of previously unknown phages discovered in whole-community human gut metagenomes. Microbiome. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/33781338/

[13] Shiwa Y, Baba T, Sierra MA, et al. Evaluation of rRNA depletion methods for capturing the RNA virome from environmental surfaces. BMC Res Notes. 2023. URL: https://pubmed.ncbi.nlm.nih.gov/37420286/

[14] Rajagopala SV, Bakhoum NG, Pakala SB, et al. Metatranscriptomics to characterize respiratory virome, microbiome, and host response directly from clinical samples. Cell Rep Methods. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/34790908/

[15] Chen XX, Wu WC, Shi M. Discovery and Characterization of Actively Replicating DNA and Retro-Transcribing Viruses in Lower Vertebrate Hosts Based on RNA Sequencing. Viruses. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/34072878/

[16] Glickman C, Hendrix J, Strong M. Simulation study and comparative evaluation of viral contiguous sequence identification tools. BMC Bioinformatics. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/34130621/

[17] Mo Y, Ahlgren N, Fuhrman JA, et al. A Beginner's Guide to Using DeepVirFinder for Viral Sequence Identification From Metagenomic Datasets. Curr Protoc. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41609929/

[18] Dorlass EG, Amgarten DE. Bioinformatic Approaches for Comparative Analysis of Viruses. Methods Mol Biol. 2024. URL: https://pubmed.ncbi.nlm.nih.gov/38819566/

[19] Gleizes A, Laubscher F, Guex N, et al. Virosaurus A Reference to Explore and Capture Virus Genetic Diversity. Viruses. 2020. URL: https://pubmed.ncbi.nlm.nih.gov/33139591/

[20] Kosmopoulos JC, Anantharaman K. Computational Microbial and Viral Ecology Analysis. Methods Mol Biol. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/42156652/

[21] Pavan RR, Sullivan MB, Tisza MJ. CRESSENT: a bioinformatics toolkit to explore and improve ssDNA virus annotation. Microb Genom. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41642221/

[22] Antipov D, Raiko M, Lapidus A, et al. Metaviral SPAdes: assembly of viruses from metagenomic data. Bioinformatics. 2020. URL: https://pubmed.ncbi.nlm.nih.gov/32413137/

[23] Anantharam R, Duchen D, Cox AL, et al. Long-Read Nanopore-Based Sequencing of Anelloviruses. Viruses. 2024. URL: https://pubmed.ncbi.nlm.nih.gov/38793605/

[24] Chen J, Sun C, Dong Y, et al. Efficient Recovery of Complete Gut Viral Genomes by Combined Short- and Long-Read Sequencing. Adv Sci (Weinh). 2024. URL: https://pubmed.ncbi.nlm.nih.gov/38240578/

[25] Coclet C, Camargo AP, Roux S. MVP: a modular viromics pipeline to identify, filter, cluster, annotate, and bin viruses from metagenomes. mSystems. 2024. URL: https://pubmed.ncbi.nlm.nih.gov/39352141/

[26] Kieft K, Adams A, Salamzade R, et al. vRhyme enables binning of viral genomes from metagenomes. Nucleic Acids Res. 2022. URL: https://pubmed.ncbi.nlm.nih.gov/35544285/

[27] Pons JC, Paez-Espino D, Riera G, et al. VPF-Class: taxonomic assignment and host prediction of uncultivated viruses based on viral protein families. Bioinformatics. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/33471063/

[28] Yoshida M, Medvedeva S, Fukudome A, et al. "Paraxenoviridae", a putative family of globally distributed marine bacteriophages with double-stranded RNA genomes. ISME J. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/40613633/

[29] Yoshida M, Medvedeva S, Fukudome A, et al. "Paraxenoviridae", a putative family of ubiquitous marine bacteriophages with double-stranded RNA genomes. bioRxiv. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/40568107/

[30] Wolf YI, Silas S, Wang Y, et al. Doubling of the known set of RNA viruses by metagenomic analysis of an aquatic virome. Nat Microbiol. 2020. URL: https://pubmed.ncbi.nlm.nih.gov/32690954/

[31] Park A, Lee C, Lee JY. Genomic Evolution and Recombination Dynamics of Human Adenovirus D Species: Insights from Comprehensive Bioinformatic Analysis. J Microbiol. 2024. URL: https://pubmed.ncbi.nlm.nih.gov/38451451/

[32] Butkovic A, Kraberger S, Smeele Z, et al. Evolution of anelloviruses from a circovirus-like ancestor through gradual augmentation of the jelly-roll capsid protein. Virus Evol. 2023. URL: https://pubmed.ncbi.nlm.nih.gov/37325085/

[33] Koonin EV, Kuhn JH, Dolja VV, et al. Megataxonomy and global ecology of the virosphere. ISME J. 2024. URL: https://pubmed.ncbi.nlm.nih.gov/38365236/

[34] Soto-Perez P, Bisanz JE, Berry JD, et al. CRISPR-Cas System of a Prevalent Human Gut Bacterium Reveals Hyper-targeting against Phages in a Human Virome Catalog. Cell Host Microbe. 2019. URL: https://pubmed.ncbi.nlm.nih.gov/31492655/

[35] Stopnisek N, Hedžet S, Accetto T, et al. Insights into diversity, host-range, and temporal stability of Bacteroides and Phocaeicola prophages. BMC Microbiol. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/40011806/

[36] Silveira CB, Coutinho FH, Cavalcanti GS, et al. Genomic and ecological attributes of marine bacteriophages encoding bacterial virulence genes. BMC Genomics. 2020. URL: https://pubmed.ncbi.nlm.nih.gov/32024463/

[37] Xiao Y, Yuan S, Qiu Y, et al. Virome-wide analysis of histone modification mimicry motifs carried by viral proteins. Virol Sin. 2024. URL: https://pubmed.ncbi.nlm.nih.gov/39293541/

[38] Islam MM, Fernando SC, Saha R. Metabolic Modeling Elucidates the Transactions in the Rumen Microbiome and the Shifts Upon Virome Interactions. Front Microbiol. 2019. URL: https://pubmed.ncbi.nlm.nih.gov/31866953/

[39] Chauhan RP, San JE, Gordon ML. Metagenomic Analysis of RNA Fraction Reveals the Diversity of Swine Oral Virome on South African Backyard Swine Farms in the uMgungundlovu District of KwaZulu-Natal Province. Pathogens. 2022. URL: https://pubmed.ncbi.nlm.nih.gov/36015047/

[40] Netla VR, Hiremath S, Muttappagol M, et al. Virome analysis of field-collected chilli samples reveals diverse viruses. Virol J. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/40275264/

[41] Li Z, Tang C, Li Y, et al. Virome survey of the bat, Rhinolophus affinis, in Hainan Province, China. Microbes Infect. 2024. URL: https://pubmed.ncbi.nlm.nih.gov/38537769/

[42] Luebbert L, Sullivan DK, Carilli M, et al. Detection of viral sequences at single-cell resolution identifies novel viruses associated with host gene expression changes. Nat Biotechnol. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/40263451/

[43] Luebbert L, Sullivan DK, Carilli M, et al. Efficient and accurate detection of viral sequences at single-cell resolution reveals putative novel viruses perturbing host gene expression. bioRxiv. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/38168363/

[44] Zakharevich N, Strokach A, Shitikov E, et al. Bacteriophages in gut metagenomes: from analysis to application. Virol J. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41572308/

[45] Camarillo-Guerrero LF, Almeida A, Rangel-Pineros G, et al. Massive expansion of human gut bacteriophage diversity. Cell. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/33606979/

[46] Papudeshi B, Vega AA, Souza C, et al. Host interactions of novel Crassvirales species belonging to multiple families infecting bacterial host, Bacteroides cellulosilyticus WH2. bioRxiv. 2023. URL: https://pubmed.ncbi.nlm.nih.gov/36945541/

[47] Yutin N, Benler S, Shmakov SA, et al. Analysis of metagenome-assembled viral genomes from the human gut reveals diverse putative CrAss-like phages with unique genomic features. Nat Commun. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/33594055/

[48] Koonin EV, Yutin N. The crAss-like Phage Group: How Metagenomics Reshaped the Human Virome. Trends Microbiol. 2020. URL: https://pubmed.ncbi.nlm.nih.gov/32298613/

[49] Fujimoto K, Kimura Y, Shimohigoshi M, et al. Metagenome Data on Intestinal Phage-Bacteria Associations Aids the Development of Phage Therapy against Pathobionts. Cell Host Microbe. 2020. URL: https://pubmed.ncbi.nlm.nih.gov/32652061/

[50] Lu C, Wu Y, Zhang Z, et al. Estimated size of the total genome and protein space of viruses. mSphere. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/39998179/

[51] Arizmendi Cárdenas YO, Neuenschwander S, Malaspinas AS. Benchmarking metagenomics classifiers on ancient viral DNA: a simulation study. PeerJ. 2022. URL: https://pubmed.ncbi.nlm.nih.gov/35356467/

[52] Chang WS, Harvey E, Mahar JE, et al. Improving the reporting of metagenomic virome-scale data. Commun Biol. 2024. URL: https://pubmed.ncbi.nlm.nih.gov/39706917/

[53] Marra M, Rotunno S, Frascati F, et al. The analysis of the virome associated with Freesia refracta plants with necrotic disorder sheds new light on the phylogenetic relationships in the Konkoviridae and Yueviridae families. Virol J. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41821017/

[54] Santos RMF, Ferreira LYM, Santos JPN, et al. Metaviromic insights into the viral community associated with Dendrobium catenatum. Braz J Microbiol. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41604022/

[55] Yang F, Xiong L, Li M, et al. A signature-protein-based approach for accurate and efficient profiling of the human gut virome. Cell Rep Methods. 2026. URL: https://pubmed.ncbi.nlm.nih.gov/41365303/

[56] Zhang Y, Mei T, Wang M, et al. Analysis of the overall development trends and hotspots in the research field of the human gut virome. Virol J. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/40618116/

[57] Zielezinski A, Gudyś A, Barylski J, et al. Ultrafast and accurate sequence alignment and clustering of viral genomes. Nat Methods. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/40374946/

[58] Popov N, Sonets I, Evdokimova A, et al. AliMarko: A Pipeline for Virus Identification Using an Expert-Guided Approach. Viruses. 2025. URL: https://pubmed.ncbi.nlm.nih.gov/40143285/

[59] Ferreira LYM, de Sousa AG, Silva JL, et al. Characterization of the Virome Associated with the Ubiquitous Two-Spotted Spider Mite, Tetranychus urticae. Viruses. 2024. URL: https://pubmed.ncbi.nlm.nih.gov/39459865/

[60] Rahlff J, Westmeijer G, Weissenbach J, et al. Surface microlayer-mediated virome dissemination in the Central Arctic. Microbiome. 2024. URL: https://pubmed.ncbi.nlm.nih.gov/39449105/

[61] Conradie T, Caparros-Martin JA, Egan S, et al. Exploring the Complexity of the Human Respiratory Virome through an In Silico Analysis of Shotgun Metagenomic Data Retrieved from Public Repositories. Viruses. 2024. URL: https://pubmed.ncbi.nlm.nih.gov/38932245/

[62] Keehner J, Abeles SR, Longhurst CA, et al. Integrated Genomic and Social Network Analyses of SARS-CoV-2 Transmission in the Healthcare Setting. Clin Infect Dis. 2024. URL: https://pubmed.ncbi.nlm.nih.gov/38227643/

[63] Rampelli S, Turroni S, Candela M. From Whole Genome Shotgun Sequencing to Viral Community Profiling: The ViromeScan Tool. Methods Mol Biol. 2024. URL: https://pubmed.ncbi.nlm.nih.gov/38060115/

[64] Walt HK, Kooienga E, Cammack JA, et al. Bioinformatic Surveillance Leads to Discovery of Two Novel Putative Bunyaviruses Associated with Black Soldier Fly. Viruses. 2023. URL: https://pubmed.ncbi.nlm.nih.gov/37631997/

[65] Ghorbani M, Ferreira D, Maioli S. A metagenomic study of gut viral markers in amyloid-positive Alzheimer's disease patients. Alzheimers Res Ther. 2023. URL: https://pubmed.ncbi.nlm.nih.gov/37608325/

[66] Tong CCL, Lin X, Seckar T, et al. A metagenomic analysis of the virome of inverted papilloma and squamous cell carcinoma. Int Forum Allergy Rhinol. 2023. URL: https://pubmed.ncbi.nlm.nih.gov/37189250/

[67] Andrade-Martínez JS, Camelo Valera LC, Chica Cárdenas LA, et al. Computational Tools for the Analysis of Uncultivated Phage Genomes. Microbiol Mol Biol Rev. 2022. URL: https://pubmed.ncbi.nlm.nih.gov/35311574/

[68] Tripathi G, Sharma N, Bindal V, et al. Protocol for global proteome, virome, and metaproteome profiling of respiratory specimen (VTM) in COVID-19 patient by LC-MS/MS-based analysis. STAR Protoc. 2022. URL: https://pubmed.ncbi.nlm.nih.gov/34870243/

[69] Van Espen L, Bak EG, Beller L, et al. A Previously Undescribed Highly Prevalent Phage Identified in a Danish Enteric Virome Catalog. mSystems. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/34665009/

[70] Diaz-Lara A, Stevens KA, Klaassen V, et al. Sequencing a Strawberry Germplasm Collection Reveals New Viral Genetic Diversity and the Basis for New RT-qPCR Assays. Viruses. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/34452308/

[71] Shen-Gunther J, Xia Q, Cai H, et al. HPV DeepSeq: An Ultra-Fast Method of NGS Data Analysis and Visualization Using Automated Workflows and a Customized Papillomavirus Database in CLC Genomics Workbench. Pathogens. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/34451490/

[72] Wallace MA, Coffman KA, Gilbert C, et al. The discovery, distribution, and diversity of DNA viruses associated with Drosophila melanogaster in Europe. Virus Evol. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/34408913/

[73] Qiao Y, Li S, Zhang J, et al. Integrated diversity and shared species analyses of human viromes. Arch Virol. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/34327587/

[74] Jo Y, Back CG, Kim KH, et al. Using RNA-Sequencing Data to Examine Tissue-Specific Garlic Microbiomes. Int J Mol Sci. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/34202675/

[75] Sidharthan VK, Kalaivanan NS, Baranwal VK. Discovery of putative novel viruses in the transcriptomes of endangered plant species native to India and China. Gene. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/33798682/

[76] Kim KW, Deveson IW, Pang CNI, et al. Respiratory viral co-infections among SARS-CoV-2 cases confirmed by virome capture sequencing. Sci Rep. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/33594223/

[77] Fujimoto K, Kimura Y, Allegretti JR, et al. Functional Restoration of Bacteriomes and Viromes by Fecal Microbiota Transplantation. Gastroenterology. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/33577875/

[78] Hufsky F, Beerenwinkel N, Meyer IM, et al. The International Virus Bioinformatics Meeting 2020. Viruses. 2020. URL: https://pubmed.ncbi.nlm.nih.gov/33291220/

[79] Schuele L, Cassidy H, Lizarazo E, et al. Assessment of Viral Targeted Sequence Capture Using Nanopore Sequencing Directly from Clinical Samples. Viruses. 2020. URL: https://pubmed.ncbi.nlm.nih.gov/33260903/

[80] Altmäe S, Molina NM, Sola-Leyva A. Omission of non-poly(A) viral transcripts from the tissue level atlas of the healthy human virome. BMC Biol. 2020. URL: https://pubmed.ncbi.nlm.nih.gov/33239005/

[81] Zhang M, Huang Y, Godson DL, et al. Assessment of Metagenomic Sequencing and qPCR for Detection of Influenza D Virus in Bovine Respiratory Tract Samples. Viruses. 2020. URL: https://pubmed.ncbi.nlm.nih.gov/32731471/

[82] Khan Mirzaei M, Xue J, Costa R, et al. Challenges of Studying the Human Virome - Relevant Emerging Technologies. Trends Microbiol. 2021. URL: https://pubmed.ncbi.nlm.nih.gov/32622559/

[83] Parry R, Naccache F, Ndiaye EH, et al. Identification and RNAi Profile of a Novel Iflavirus Infecting Senegalese Aedes vexans arabiensis Mosquitoes. Viruses. 2020. URL: https://pubmed.ncbi.nlm.nih.gov/32295109/

[84] Birnberg L, Temmam S, Aranda C, et al. Viromics on Honey-Baited FTA Cards as a New Tool for the Detection of Circulating Viruses in Mosquitoes. Viruses. 2020. URL: https://pubmed.ncbi.nlm.nih.gov/32121402/

[85] Du J, Li F, Han Y, et al. Characterization of viromes within mosquito species in China. Sci China Life Sci. 2020. URL: https://pubmed.ncbi.nlm.nih.gov/31834603/

[86] Mostert I, Visser M, Gazendam I, et al. Complete genome sequence of a novel polerovirus in Ornithogalum thyrsoides from South Africa. Arch Virol. 2020. URL: https://pubmed.ncbi.nlm.nih.gov/31781858/

[87] Ren Y, Xu Y, Lee WM, et al. In-depth serum virome analysis in patients with acute liver failure with indeterminate etiology. Arch Virol. 2020. URL: https://pubmed.ncbi.nlm.nih.gov/31741097/

[88] Ma Y, Marais A, Lefebvre M, et al. Phytovirome Analysis of Wild Plant Populations: Comparison of Double-Stranded RNA and Virion-Associated Nucleic Acid Metagenomic Approaches. J Virol. 2019. URL: https://pubmed.ncbi.nlm.nih.gov/31597769/

[89] Zhang Z, Cai Z, Tan Z, et al. Rapid identification of human-infecting viruses. Transbound Emerg Dis. 2019. URL: https://pubmed.ncbi.nlm.nih.gov/31373773/

[90] Deng L, Silins R, Castro-Mejía JL, et al. A Protocol for Extraction of Infective Viromes Suitable for Metagenomics Sequencing from Low Volume Fecal Samples. Viruses. 2019. URL: https://pubmed.ncbi.nlm.nih.gov/31330855/

[91] LaPierre N, Mangul S, Alser M, et al. MiCoP: microbial community profiling method for detecting viral and fungal organisms in metagenomic samples. BMC Genomics. 2019. URL: https://pubmed.ncbi.nlm.nih.gov/31167634/