Skip to main content
Advertisement
Browse Subject Areas
?

Click through the PLOS taxonomy to find articles in your field.

For more information about PLOS Subject Areas, click here.

  • Loading metrics

Metagenomic analyses of 7000 to 5500 years old coprolites excavated from the Torihama shell-mound site in the Japanese archipelago

  • Luca Nishimura,

    Roles Conceptualization, Formal analysis, Funding acquisition, Investigation, Methodology, Software, Visualization, Writing – original draft

    Affiliations Human Genetics Laboratory, National Institute of Genetics, Mishima, Shizuoka, Japan, Department of Genetics, School of Life Science, The Graduate University for Advanced Studies (SOKENDAI), Mishima, Shizuoka, Japan

  • Akio Tanino,

    Roles Data curation, Investigation, Methodology

    Affiliation Kitasato University Graduate School of Medical Sciences, Sagamihara, Kanagawa, Japan

  • Mayumi Ajimoto,

    Roles Resources

    Affiliation Wakasa History Museum, Obama, Fukui, Japan

  • Takafumi Katsumura,

    Roles Supervision

    Affiliation Department of Anatomy, Kitasato University School of Medicine, Sagamihara, Kanagawa, Japan

  • Motoyuki Ogawa,

    Roles Supervision

    Affiliation Department of Anatomy, Kitasato University School of Medicine, Sagamihara, Kanagawa, Japan

  • Kae Koganebuchi,

    Roles Supervision

    Affiliation Department of Biological Sciences, Graduate School of Science, University of Tokyo, Tokyo, Japan

  • Daisuke Waku,

    Roles Methodology, Supervision

    Affiliation Department of International Agricultural Development, Faculty of International Agriculture and Food Studies, Tokyo University of Agriculture, Tokyo, Japan

  • Masahiko Kumagai,

    Roles Methodology, Supervision, Writing – review & editing

    Affiliation Research Center for Advanced Analysis, National Agriculture and Food Research Organization, Tsukuba, Ibaraki, Japan

  • Ryota Sugimoto,

    Roles Funding acquisition, Methodology, Validation, Writing – review & editing

    Affiliation Department of Genetics, School of Life Science, The Graduate University for Advanced Studies (SOKENDAI), Mishima, Shizuoka, Japan

  • Hirofumi Nakaoka,

    Roles Investigation, Writing – review & editing

    Affiliation Department of Cancer Genome Research, Sasaki Institute, Tokyo, Japan

  • Hiroki Oota,

    Roles Funding acquisition, Project administration, Supervision, Validation, Writing – review & editing

    Affiliation Department of Biological Sciences, Graduate School of Science, University of Tokyo, Tokyo, Japan

  • Ituro Inoue

    Roles Conceptualization, Funding acquisition, Project administration, Supervision, Writing – review & editing

    itinoue@nig.ac.jp

    Affiliations Human Genetics Laboratory, National Institute of Genetics, Mishima, Shizuoka, Japan, Department of Genetics, School of Life Science, The Graduate University for Advanced Studies (SOKENDAI), Mishima, Shizuoka, Japan

Abstract

Coprolites contain various kinds of ancient DNAs derived from gut micro-organisms, viruses, and foods, which can help to determine the gut environment of ancient peoples. Their genomic information should be helpful in elucidating the interaction between hosts and microbes for thousands of years, as well as characterizing the dietary behaviors of ancient people. We performed shotgun metagenomic sequencing on four coprolites excavated from the Torihama shell-mound site in the Japanese archipelago. The coprolites were found in the layers of the Early Jomon period, corresponding stratigraphically to 7000 to 5500 years ago. After shotgun sequencing, we found that a significant number of reads showed homology with known gut microbe, viruses, and food genomes typically found in the feces of modern humans. We detected reads derived from several types of phages and their host bacteria simultaneously, suggesting the coexistence of viruses and their hosts. The food genomes provide biological evidence for the dietary behavior of the Jomon people, consistent with previous archaeological findings. These results indicate that ancient genomic analysis of coprolites is useful for understanding the gut environment and lifestyle of ancient peoples.

Introduction

Fossils and bones can provide insights into the physical characteristics and habitat of ancient organisms from over tens of thousands of years ago. Moreover, genome information from such remains could be available if the endogenous DNAs are in a good state of preservation. Ancient DNA analysis has recently developed into the field of ancient genomics with the development of massive sequencing technologies. Ancient genomic studies have offered important information, such as the lineages, migration, and history of human populations [16]. Biological remains, such as dental stones and coprolites, contain DNAs derived from the intraoral and enteral microbes of ancient humans and/or archaic hominins [7, 8]. As the microbes in the body have been highlighted as playing roles in the health and disease of individuals, analyses of ancient microbial genomes could provide indicators of lifestyles, such as hunter–gathering or farming [9], and pathogenic states, such having Hepatitis B viruses, Yersinia pestis, or others [1012].

Previously, Nishimura et al. (2021) have reported the ancient viral sequences found in the dental pulps of Jomon individuals [13]. The Jomon people were inhabitants in the Japanese archipelago about 16,000 to 2900 years ago [14]. In particular, a complete sequence of siphovirus contig89 was reported from dental pulp, providing the evolutional process of the virus. To comprehend the microbiomes of the Jomon people, we analyzed coprolites (7000–5500 years ago) excavated from the Torihama shell-mound site, located in the middle of the Japanese archipelago. Significant numbers of reads obtained by shotgun sequencing were aligned to known microbial genomes reported in the modern human gut environment. We detected viruses (mostly phages) and their host-derived sequences simultaneously, in order to analyze the host–viral relationships that seem to have been maintained for more than thousands of years. We also obtained genomic sequences from what appears to be ingested foods, such as salmon.

Materials and methods

Sampling

More than 400 coprolites were excavated from the Torihama shell-mound site in Fukui Prefecture, Japan, during the 1970s [15]. We sampled ten of these, which were estimated to belong the Early Jomon period (7000–5500 BP) according to the stratigraphy. We used four of the ten coprolites, which showed relatively high-concentration NGS libraries, for metagenome shotgun sequencing analyses (S1 Table). All necessary permits were obtained for the described study, which complied with all relevant regulations. We obtained research material usage approval from the Wakasa History Museum in Fukui Prefecture, Japan, on July 26, 2018.

DNA extraction and library preparation

DNA extraction and library preparation were carried out in a clean-room facility dedicated to ancient DNA research at Kitasato University and University of Tokyo. Pieces weighing 2–5 g were extracted from each coprolite sample using a sterilized surgical knife. The outer surface of the coprolite sample was removed (with 1–2 mm thickness), after which UV radiation of the sample was performed for 30 minutes, in order to reduce the exogeneous DNA contamination risk. Coprolite samples were frozen with liquid nitrogen and crushed into a fine powder using a ShakeMaster Auto ver.2.0 (BioMedical Science), and DNA extraction was conducted using NucleoSpin DNA Stool (MACHEREY-NAGEL) with 0.08–0.1 g of powdered sample. Four DNA extracts from TH55, TH58, TH62, and TH74 containing 5.80 ng, 4.46 ng, 15.54 ng, and 5.36 ng of DNA, respectively, were proceeded to library preparation for metagenomic sequencing. Library construction for the Illumina sequencer was conducted using an NEBNext Ultra II DNA Library Prep Kit for Illumina and NEBNext Multiplex Oligos for Illumina (96 Unique Dual Index Primer Pairs; New England Biolabs) with 1 ng of DNA template. After size selection for excluding larger DNA fractions containing possible contaminant DNAs with x0.9 volume of AMPure XP Beads (Beckman Coulter Genomics, USA), library amplification was performed, with 10 to 12 cycles. Sequencing was performed on the Illumina NovaSeq6000 platform using a 150 bp paired-end dual-index run with 300 cycles.

Data pre-processing

First, we removed adapters and phiX174 spiked in reads using BBDuk with the following options: ftm = 5; qtrim = rl; trimq = 20; ftl = 15; ref = adapters,phix; ktrim = r; k = 23; mink = 11; hdist = 1; tpe tbo maq = 20. The human-derived reads were aligned to the hg19 human reference genome using the Burrows-Wheeler Aligner (BWA) 0.7.17 [16] and discarded. The human-decontaminated reads were then deduplicated using Dedupe. Finally, the remained reads were error-corrected with Tadpole using the mode = correct and k = 50 options. BBDuk, Dedupe, and Tadpole were provided by BBTools 38.49 (BBMap; Bushnell, B.; sourceforge.net/projects/bbmap/).

Viral search

We collected viral sequences, including bacteriophages, from the NCBI viral RefSeq (released September 2020) [17, 18] and the Millard labs PHAge REference Database (INPHARED; July 2019) [19]. We also utilized gut viral sequences from the human Gut Virome Database (GVD) [20] and an in-house data set of CRISPR targeted viral sequences constructed from human gut metagenomic data [21]. All pre-processed reads were aligned to these viral reference genomes using the BWA. After alignment, reads with mapping quality below 25 and duplicated ones were filtered out using SAMtools 1.10 [22]. The read length distribution and coverages were analyzed using BEDtools 2.29.2 [23]. We then calculated the post-mortem degradation score (PMDS) for each filtered read using PMDtools, in order to distinguish the ancient DNA from modern contaminants [24]. The viral classification was analyzed based on the taxonomic classification of the GVD. We used the old taxonomic classification information in the GVD, although several major families such as Siphoviridae, Myoviridae, and Podoviridae were removed from the International Committee on Taxonomy of Viruses (ICTV) classification in August 2022 [25].

Detection of viral host genomes and food remnants

We utilized the genomes of 19 host bacterial species of the viruses detected in the previous study [21]. Genomic assembly data of Acanthamoeba castellanii (GenBank assembly accession was GCA_000313135.1) was downloaded from the NCBI database. We also downloaded 4644 human gut prokaryotic genomes and their taxonomic information from the Unified Human Gastrointestinal Genome (UHGG) collection [26]. A maximum-likelihood phylogenetic tree of the 4616 bacterial species detected in the human gut was also downloaded from UHGG collection. The phylogenetic tree was visualized by iTOL v6 and added the gray bar graphs in the outermost layer, representing the reads ratio aligned to bacterial genomes in all four samples [27]. To analyze food remnants, we searched candidate sequences based on the BLAST search using NCBI non-redundant nucleotide database (downloaded on 23 February 2021) [17, 28]. According to the BLAST results, we downloaded genome assembly data of Oncorhynchus nerka (RefSeq assembly accession was GCF_006149115) and Vigna angularis (RefSeq assembly accession was GCF_001190045). All pre-processed reads were mapped to those reference sequences and filtered as previously described.

Results and discussion

We extracted DNA from four coprolites found at an archaeological site from the Early Jomon period and conducted shotgun metagenomic sequences using the Illumina NovaSeq6000. We obtained 4,944,613,662 reads in total (S2 Table). After pre-processing (including human decontamination), 3,188,801,202 reads remained, which were utilized for downstream analyses. During human decontamination, the human genome (hg19) alignment results revealed that the average and median alignment lengths were extremely short—both around 20 bp—reflecting serious fragmentation of host DNAs in the fecal sample over time (S3 Table). About 1% of the deduplicated reads showed post-mortem signatures, based on a postmortem degradation score (PMDS) greater than 3 [24].

We first analyzed the viral genomes. Viral genomes are typically shorter than those of other organisms; therefore, it was considered more efficient to detect their genomes with highly fragmented reads. We aligned the reads to viral genomes obtained from the modern human gut environment, in order to focus on the viruses existing in the gut and avoid soil contamination. We utilized 33,242 unique viral sequences and taxonomic information from the Gut Virome Database (GVD) [20], detecting 17,919 GVD-derived sequences with postmortem degradation (Table 1). Based on the taxonomic classification of the GVD, the results demonstrated that most of the viral sequences originated from bacteriophages (Fig 1A). Within the bacteriophages, the Siphoviridae, Myoviridae, and Podoviridae families were abundant (Fig 1B), which are also observed in the gut virome of present-day humans [20, 29, 30]. In the GVD alignment results, we removed the sequence reads that were filtered out for PMDS and showed larger than 30 bp average and median alignment length, as these reads might be derived from contamination, based on the average and median read length in human genome alignment. The maximum coverage was 77%, and 31 viral species genomes showed more than 50% coverage (S4 Table). The average alignment length was about 20 bp, reflecting the highly fragmented DNA, which might be related to the state of preservation and gut environment. Nevertheless, a vast number of reads from the Jomon coprolites were mapped to gut viral sequences. Although sequence variations were not detectable, due to the low depth of alignments, there might be similar or closely related viral species with the present-day gut viruses as those in the gut of Jomon people. To confirm this hypothesis, the additional sequencing of well preserved coprolites DNA might be required to obtain the species-level viral genomic sequences and to compare the modern viral genomes. Based on the previous reports, the gut viral composition differs across the life stages of human [20]. For example, eukaryotic viruses are more prevalent in infancy, due to the undeveloped immune system, but remain low throughout the rest of life. Our data illustrated a high abundance of bacteriophages, especially Siphoviridae viruses. Therefore, the coprolites might reflect the ancient gut environment of the Jomon people.

thumbnail
Fig 1. Viral components of Jomon people’s coprolites.

(A) The pie chart indicates the ratio of archaeal viruses, eukaryotic viruses, and phages based on reads alignment results. The taxonomic information of GVD was utilized for classification. (B) Bar charts show the taxonomic composition of each viral group based on an abundance of each viral family-derived reads. The colored families indicate the top three abundant ones. The light gray bar contains reads of all other known viral families, and the dark gray bar contains reads of unclassified viruses.

https://doi.org/10.1371/journal.pone.0295924.g001

thumbnail
Table 1. Number of detected gut viral and bacterial genomes.

https://doi.org/10.1371/journal.pone.0295924.t001

As the detected viral classification indicated the existence of eukaryotic viral genomes in the Jomon coprolites, we examined the eukaryotic viral genomes registered in the NCBI database. We observed reads that aligned to several human viral genomes and giant viral genomes in the coprolite data as shown in Table 2. Meanwhile, we detected the reads showing homology with human pathogenic viruses, such as human betaherpesvirus 5 and human adenovirus F, in all the samples indicating the existence of these viruses in the bodies of the Jomon people.

Two different giant viral genomes—Pandoravirus and Mollivirus—infecting Acanthamoeba castellanii were detected (Table 2). Giant viruses have been discovered since 2003 [31], which have large genomes (>100 kb) and more than 500 protein-coding sequences [3234]. The hygiene environments of the Jomon are not yet well known, but the detection of the giant viral genome in the Jomon coprolite may possibly suggest that they had poorer hygiene than modern humans. We also attempted to detect Acanthamoeba genomes, which are the hosts of the giant viruses, and more than one million reads were aligned to the amoeba genome with more than 10% coverage, indicating the existence of amoeba genomes in the Jomon coprolites as shown in Table 3.

These results indicated the possibility of simultaneously detecting host and viral genomes in the coprolite samples. We attempted to detect both gut phages and their bacterial host genomes, in order to examine their cohabitation. Based on our previous works, we detected 11,391 terminally redundant sequences targeted by host clustered regularly interspaced short palindromic repeats (CRISPR) immunological memory [21]. As prokaryotic cells can memorize previously infected phage sequences in their CRISPR system, we were able to determine the candidate hosts, based on the connection between the CRISPR-targeted sequences, or protospacers, and the associated CRISPR direct repeats. As a result, we detected 1410 CRISPR-targeted phages (Table 1), and the overall coverage ranged up to 40% (S5 Table). Within the 1410 detected phages, 23 candidate hosts were estimated, based on our previous results [21]. Then, we mapped all reads to 19 available reference sequences, resulting in the detection of 19 bacterial sequences (S6 Table). Thus, we confirmed both phage and host bacterial genome sequences in the coprolites, analogous to the giant virus and amoeba cases. Our results demonstrated that the host and viral co-habitants in the Jomon gut environment might be conserved through their evolutional history.

As we could align several reads to gut bacterial genomes, we then decided to utilize all representative gut bacterial genomes for alignment analyses. We utilized the 4644 gut prokaryotes from the Unified Human Gastrointestinal Genome (UHGG) collection [26]. Some gut bacteria, such as Bradyrhizobium bacteria, also live in soil environments, and may have contaminated the coprolites over time. In the present analyses, sequence reads aligned to soil bacteria had a tendency to show longer lengths due to lower fragmentation; thus, those with average alignment length longer than 30 bp—which were less likely from the Jomon DNAs—were filtered out. As a result, 4613 gut bacterial genomes were detected, as shown in Table 1 and S7 Table. We analyzed the bacterial compositions and the phylogenetic relationship of the detected gut bacterial species using the phylogenetic tree reported in [26] (Fig 2 and S1 Fig). In Fig 2, the bar graphs indicate the abundance of reads of bacterial genome-aligned reads within the four Jomon coprolite samples. The read abundance of each species was less than 0.5%. On the other hand, several species of Proteobacteria and Actinobacteria bacteria showed higher than 0.2% abundance. These bacterial phyla were highly prevalent, as well as Firmicutes A, Bacteroidota, and Verrucomicrobiota bacteria (S1 Fig). These tendencies could be seen in the present-day human gut microbiome [35].

thumbnail
Fig 2. Phylogenetic trees of detected gut bacteria.

The tree indicates phylogenetic relationships of 4,616 bacterial species. The colored chart shows different bacterial phylum. The phylogenetic tree and each taxonomic information were derived from the previous report [26]. The gray bar graphs in the outermost layer represent the reads ratio of reads aligned to bacterial genomes in all four samples. The red circular line is a scale bar indicating 0.2% abundance of aligned reads.

https://doi.org/10.1371/journal.pone.0295924.g002

Finally, we analyzed genomic information from the candidate foods, in order to infer the cultural and behavioral characteristics of the Torihama Jomon people. We detected the alignment of reads to the Oncorhynchus nerka (salmon) and Vigna angularis (red beans) genomes (S8 and S9 Tables). A previous archeological report has indicated that the Torihama Jomon people hunted freshwater and saltwater fish, such as salmon, based on the lipids detected from Jomon pottery [36]. Our results provided genomic support that the Jomon people may have hunted and used salmon (probably, red salmon) as a food resource, at least with respect to the Torihama shell-mound site. Further detailed analyses would be required to find robust evidence that it is a salmon.

In conclusion, we obtained genomic data from the coprolites of Jomon people and conducted alignment of reads to the genomes of gut viruses, gut microbes, and food remnants. Even though the average length of the reads was too short to analyze genomic variations, we still observed a vast number of reads mapped to the genomes of viruses, bacteria, and likely foods. Based on our alignment results, we could estimate the existence of microbes and host–viral coexistence in the Jomon gut environment, comparable to that in present-day humans, suggesting that the coprolites have the potential to reveal long-term host–viral co-evolution trends. Furthermore, we discovered the reads of possible foods through genomic information, providing biological evidence for the dietary characteristics of the Jomon people.

Supporting information

S1 Fig. Bacterial compositions of Jomon people’s coprolites.

Bar charts show the taxonomic classification of detected gut bacteria and proportion of each bacterial groups. The ratio was calculated based on the number of aligned reads. The colored bars indicate the top five abundant classifications. The light gray bar contains reads of all other known bacterial families, and the dark gray bar contains reads of unclassified bacteria.

https://doi.org/10.1371/journal.pone.0295924.s001

(TIF)

S5 Table. Alignment to CRISPR targeted sequences.

https://doi.org/10.1371/journal.pone.0295924.s006

(XLSX)

S6 Table. Alignment to host bacterial genomes.

https://doi.org/10.1371/journal.pone.0295924.s007

(XLSX)

S7 Table. Alignment to gut bacterial genomes.

https://doi.org/10.1371/journal.pone.0295924.s008

(XLSX)

Acknowledgments

We thank all the members of the Human Genetics Laboratory at the National Institute of Genetics for providing many suggestions. Most of the computational analyses were performed on the NIG supercomputer at ROIS National Institute of Genetics. This work was supported in part by The Graduate University for Advanced Studies, SOKENDAI. We would like to thank MDPI’s language editing service for English editing.

References

  1. 1. Allentoft ME, Sikora M, Sjögren KG, Rasmussen S, Rasmussen M, Stenderup J, et al. Population genomics of Bronze Age Eurasia. Nature. 2015;522: 167–172. pmid:26062507
  2. 2. McColl H, Racimo F, Vinner L, Demeter F, Gakuhari T, Moreno-Mayar JV, et al. The prehistoric peopling of Southeast Asia. Science. 2018;361: 88–92. pmid:29976827
  3. 3. De Barros Damgaard P, Marchi N, Rasmussen S, Peyrot M, Renaud G, Korneliussen T, et al. 137 ancient human genomes from across the Eurasian steppes. Nature. 2018;557: 369–374. pmid:29743675
  4. 4. Kanzawa-Kiriyama H, Jinam TA, Kawai Y, Sato T, Hosomichi K, Tajima A, et al. Late jomon male and female genome sequences from the funadomari site in Hokkaido, Japan. Anthropol Sci. 2019;127: 83–108.
  5. 5. Gakuhari T, Nakagome S, Rasmussen S, Allentoft ME, Sato T, Korneliussen T, et al. Ancient Jomon genome sequence analysis sheds light on migration patterns of early East Asian populations. Commun Biol. 2020;3: 1–10. pmid:32843717
  6. 6. Wang CC, Yeh HY, Popov AN, Zhang HQ, Matsumura H, Sirak K, et al. Genomic insights into the formation of human populations in East Asia. Nature. 2021;591: 413–419. pmid:33618348
  7. 7. Wibowo MC, Yang Z, Borry M, Hübner A, Huang KD, Tierney BT, et al. Reconstruction of ancient microbial genomes from the human gut. Nature. 2021;594: 234–239. pmid:33981035
  8. 8. Weyrich LS, Duchene S, Soubrier J, Arriola L, Llamas B, Breen J, et al. Neanderthal behaviour, diet, and disease inferred from ancient DNA in dental calculus. Nature. 2017;544: 357–361. pmid:28273061
  9. 9. Adler CJ, Dobney K, Weyrich LS, Kaidonis J, Walker AW, Haak W, et al. Sequencing ancient calcified dental plaque shows changes in oral microbiota with dietary shifts of the Neolithic and Industrial revolutions. Nat Genet. 2013;45: 450–455. pmid:23416520
  10. 10. Krause-Kyora B, Susat J, Key FM, Kühnert D, Bosse E, Immel A, et al. Neolithic and medieval virus genomes reveal complex evolution of hepatitis B. Elife. 2018;7: 1–15. pmid:29745896
  11. 11. Bos KI, Schuenemann VJ, Golding GB, Burbano HA, Waglechner N, Coombes BK, et al. A draft genome of Yersinia pestis from victims of the Black Death. Nature. 2011;478: 506–510. pmid:21993626
  12. 12. Nishimura L, Fujito N, Sugimoto R, Inoue I. Detection of Ancient Viruses and Long-Term Viral Evolution. Viruses. 2022;14: 1336. pmid:35746807
  13. 13. Nishimura L, Sugimoto R, Inoue J, Nakaoka H, Kanzawa-Kiriyama H, Shinoda K ichi, et al. Identification of ancient viruses from metagenomic data of the Jomon people. J Hum Genet. 2021;66: 287–296. pmid:32994538
  14. 14. Habu J. The Ancient Jomon of Japan. Cambridge: Cambridge University Press; 2004.
  15. 15. Torihama Kaiduka research group. Torihama Kaiduka 1. Fukui-ken Kyoiku Iinkai [Board of Education of Fukui Prefecture] Fukui Prefectural Wakasa History Museum; 1979.
  16. 16. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25: 1754–1760. pmid:19451168
  17. 17. O’Leary NA, Wright MW, Brister JR, Ciufo S, Haddad D, McVeigh R, et al. Reference sequence (RefSeq) database at NCBI: Current status, taxonomic expansion, and functional annotation. Nucleic Acids Res. 2016;44: D733–D745. pmid:26553804
  18. 18. Brister JR, Ako-Adjei D, Bao Y, Blinkova O. NCBI viral Genomes resource. Nucleic Acids Res. 2015;43: D571–D577. pmid:25428358
  19. 19. Cook R, Brown N, Redgwell T, Rihtman B, Barnes M, Clokie M, et al. INfrastructure for a PHAge REference Database: Identification of Large-Scale Biases in the Current Collection of Cultured Phage Genomes. PHAGE Ther Appl Res. 2021;2: 214–223. pmid:36159887
  20. 20. Gregory AC, Zablocki O, Zayed AA, Howell A, Bolduc B, Sullivan MB. The Gut Virome Database Reveals Age-Dependent Patterns of Virome Diversity in the Human Gut. Cell Host Microbe. 2020;28: 724–740.e8. pmid:32841606
  21. 21. Sugimoto R, Nishimura L, Nguyen PT, Ito J, Parrish NF, Mori H, et al. Comprehensive discovery of CRISPR-targeted terminally redundant sequences in the human gut metagenome: Viruses, plasmids, and more. Borenstein E, editor. PLOS Comput Biol. 2021;17: e1009428. pmid:34673779
  22. 22. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25: 2078–2079. pmid:19505943
  23. 23. Quinlan AR, Hall IM. BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26: 841–842. pmid:20110278
  24. 24. Skoglund P, Northoff BH, Shunkov M V., Derevianko AP, Pääbo S, Krause J, et al. Separating endogenous ancient DNA from modern day contamination in a Siberian Neandertal. Proc Natl Acad Sci U S A. 2014;111: 2229–2234. pmid:24469802
  25. 25. Walker PJ, Siddell SG, Lefkowitz EJ, Mushegian AR, Adriaenssens EM, Alfenas-Zerbini P, et al. Recent changes to virus taxonomy ratified by the International Committee on Taxonomy of Viruses (2022). Arch Virol. 2022;167: 2429–2440. pmid:35999326
  26. 26. Almeida A, Nayfach S, Boland M, Strozzi F, Beracochea M, Shi ZJ, et al. A unified catalog of 204,938 reference genomes from the human gut microbiome. Nat Biotechnol. 2021;39: 105–114. pmid:32690973
  27. 27. Letunic I, Bork P. Interactive tree of life (iTOL) v5: An online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 2021;49: W293–W296. pmid:33885785
  28. 28. Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, et al. BLAST+: Architecture and applications. BMC Bioinformatics. 2009;10: 1–9. pmid:20003500
  29. 29. Coughlan S, Das A, O’Herlihy E, Shanahan F, O’Toole PW, Jeffery IB. The gut virome in Irritable Bowel Syndrome differs from that of controls. Gut Microbes. 2021;13: 1–15. pmid:33602058
  30. 30. Clooney AG, Sutton TDS, Shkoporov AN, Holohan RK, Daly KM, O’Regan O, et al. Whole-Virome Analysis Sheds Light on Viral Dark Matter in Inflammatory Bowel Disease. Cell Host Microbe. 2019;26: 764–778.e5. pmid:31757768
  31. 31. La Scola B, Audic S, Robert C, Jungang L, De Lamballerie X, Drancourt M, et al. A giant virus in amoebae. Science. 2003;299: 2033. pmid:12663918
  32. 32. Brandes N, Linial M. Giant viruses-big surprises. Viruses. 2019. p. 404. pmid:31052218
  33. 33. Aherfi S, Colson P, La Scola B, Raoult D. Giant viruses of amoebas: An update. Front Microbiol. 2016;7: 1–14. pmid:27047465
  34. 34. Moniruzzaman M, Martinez-Gutierrez CA, Weinheimer AR, Aylward FO. Dynamic genome evolution and complex virocell metabolism of globally-distributed giant viruses. Nat Commun. 2020;11: 1–11. pmid:32249765
  35. 35. Arumugam M, Raes J, Pelletier E, Paslier D Le, Yamada T, Mende DR, et al. Enterotypes of the human gut microbiome. Nature. 2011;473: 174–180. pmid:21508958
  36. 36. Craig OE, Saul H, Lucquin A, Nishida Y, Taché K, Clarke L, et al. Earliest evidence for the use of pottery. Nature. 2013;496: 351–354. pmid:23575637