Enriched Opportunistic Pathogens Revealed by Metagenomic Sequencing Hint Potential Linkages between Pharyngeal Microbiota and COVID-19

Dongyan Xiong; Caroline Muema; Xiaoxu Zhang; Xinming Pan; Jin Xiong; Hang Yang; Junping Yu; Hongping Wei

doi:10.1007/s12250-021-00391-x

October 2021

Citation: Dongyan Xiong, Caroline Muema, Xiaoxu Zhang, Xinming Pan, Jin Xiong, Hang Yang, Junping Yu, Hongping Wei. Enriched Opportunistic Pathogens Revealed by Metagenomic Sequencing Hint Potential Linkages between Pharyngeal Microbiota and COVID-19 .VIROLOGICA SINICA, 2021, 36(5) : 924-933. http://dx.doi.org/10.1007/s12250-021-00391-x

Enriched Opportunistic Pathogens Revealed by Metagenomic Sequencing Hint Potential Linkages between Pharyngeal Microbiota and COVID-19

Dongyan Xiong ^1,2 ,
Caroline Muema ^1,2 ,
Xiaoxu Zhang ^1,2 ,
Xinming Pan ³ ,
Jin Xiong ¹ ,
Hang Yang ¹ ,
Junping Yu ^{1
,,} ,
Hongping Wei ^{1
,,}

1.
CAS Key Laboratory of Special Pathogens and Biosafety, Centre for Biosafety Mega-Sciences, Wuhan Institute of Virology, Chinese Academy of Sciences, Wuhan, 430071, China
2.
University of Chinese Academy of Sciences, Beijing, 100049, China
3.
Jiangxia District Center for Disease Control and Prevention, Wuhan, 430200, China

Corresponding author: Junping Yu, yujp@wh.iov.cn, ORCID: 0000-0001-9024-0793
Hongping Wei, hpwei@wh.iov.cn, ORCID: 0000-0002-9948-8880
Supplementary Information The online version contains supplementary material available at https://doi.org/10.1007/s12250-021-00391-x.
Received Date: 15 December 2020
Accepted Date: 15 March 2021
Published Date: 12 May 2021
Available online: 01 October 2021

Abstract

As a respiratory tract virus, SARS-CoV-2 infected people through contacting with the upper respiratory tract first. Previous studies indicated that microbiota could modulate immune response against pathogen infection. In the present study, we performed metagenomic sequencing of pharyngeal swabs from eleven patients with COVID-19 and eleven Non-COVID-19 patients who had similar symptoms such as fever and cough. Through metagenomic analysis of the above two groups and a healthy group from the public data, there are 6502 species identified in the samples. Specifically, the Pielou index indicated a lower evenness of the microbiota in the COVID-19 group than that in the Non-COVID-19 group. Combined with the linear discriminant analysis (LDA) and the generalized linear model, eighty-one bacterial species were found with increased abundance in the COVID-19 group, where 51 species were enriched more than 8 folds. The top three enriched genera were Streptococcus, Prevotella and Campylobacter containing some opportunistic pathogens. More interestingly, through experiments, we found that two Streptococcus strains, S. suis and S. agalactiae, could stimulate the expression of ACE2 of Vero cells in vitro, which may promote SARS-CoV-2 infection. Therefore, these enriched pathogens in the pharynxes of COVID-19 patients may involve in the virus-host interactions to affect SARS-CoV-2 infection and cause potential secondary bacterial infections through changing the expression of the viral receptor ACE2 and/or modulate the host's immune system.
- COVID-19
- , Metagenome sequencing
- , ACE2
- , Streptococcus
- , Prevotella
- , Campylobacter

Electronic Supplementary Material

10.1007s12250-021-00391-x-ESM1.pdf
10.1007s12250-021-00391-x-ESM2.xls
10.1007s12250-021-00391-x-ESM3.xls

References
1. Arashiro T, Furukawa K, Nakamura A (2020) COVID-19 in 2 persons with mild upper respiratory tract symptoms on a cruise ship, Japan. Emerg Infect Dis 26: 1345-1348
  doi: 10.3201/eid2606.200452
2. Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA (2012) SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol J Comput Mol Cell Biol 19: 455-477
  doi: 10.1089/cmb.2012.0021
3. Bolger AM, Lohse M, Usadel B (2014) Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30: 2114-2120
  doi: 10.1093/bioinformatics/btu170
4. Carmody RN, Gerber GK, Luevano JM Jr, Gatti DM, Somes L, Svenson KL, Turnbaugh PJ (2015) Diet dominates host genotype in shaping the murine gut microbiota. Cell Host Microbe 17: 72-84
  doi: 10.1016/j.chom.2014.11.010
5. De Boeck I, van den Broek MFL, Allonsius CN, Spacova I, Wittouck S, Martens K, Wuyts S, Cauwenberghs E, Jokicevic K, Vandenheuvel D, Eilers T, Lemarcq M, De Rudder C, Thys S, Timmermans JP, Vroegop AV, Verplaetse A, Van de Wiele T, Kiekens F, Hellings PW, Vanderveken OM, Lebeer S (2020) Lactobacilli have a niche in the human nose. Cell Rep 31: 107674
  doi: 10.1016/j.celrep.2020.107674
6. Dhar D, Mohanty A (2020) Gut microbiota and Covid-19-possible link and implications. Virus Res 285: 198018
  doi: 10.1016/j.virusres.2020.198018
7. Han M, Yang P, Zhong C, Ning K (2018) The human gut virome in hypertension. Front Microbiol 9: 3150
  doi: 10.3389/fmicb.2018.03150
8. Hanada S, Pirzadeh M, Carver KY, Deng JC (2018) Respiratory viral infection-induced microbiome alterations and secondary bacterial pneumonia. Front Immunol 9: 2640
  doi: 10.3389/fimmu.2018.02640
9. Huang C, Wang Y, Li X, Ren L, Zhao J, Hu Y, Zhang L, Fan G, Xu J, Gu X, Cheng Z, Yu T, Xia J, Wei Y, Wu W, Xie X, Yin W, Li H, Liu M, Xiao Y, Gao H, Guo L, Xie J, Wang G, Jiang R, Gao Z, Jin Q, Wang J, Cao B (2020) Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet 395: 497-506
10. Jonsson V, Österlund T, Nerman O, Kristiansson E (2016) Statistical evaluation of methods for identification of differentially abundant genes in comparative metagenomics. BMC Genomics 17: 78
  doi: 10.1186/s12864-016-2386-y
11. Kamada N, Seo SU, Chen GY, Núñez G (2013) Role of the gut microbiota in immunity and inflammatory disease. Nat Rev Immunol 13: 321-335
  doi: 10.1038/nri3430
12. Katoh K, Standley DM (2013) MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol 30: 772-780
  doi: 10.1093/molbev/mst010
13. Khan AA, Khan Z (2020) COVID-2019 associated overexpressed Prevotella proteins mediated host-pathogen interactions and their role in coronavirus outbreak. Bioinformatics 36: 4065−4069
  doi: 10.1093/bioinformatics/btaa285
14. Konkel ME, Monteville MR, Rivera-Amill V, Joens LA (2001) The pathogenesis of Campylobacter jejuni-mediated enteritis. Curr Issues Intest Microbiol 2: 55-71
15. Langmead B, Salzberg SL (2012) Fast gapped-read alignment with Bowtie 2. Nat Methods 9: 357-359
  doi: 10.1038/nmeth.1923
16. Letunic I, Bork P (2019) Interactive Tree Of Life (iTOL) v4: recent updates and new developments. Nucleic Acids Res 47: W256-W259
  doi: 10.1093/nar/gkz239
17. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R, Genome Project Data Processing S (2009) The sequence alignment/map format and SAMtools: Genome Project Data Processing S. Bioinformatics (Oxford, England) 25: 2078-2079
  doi: 10.1093/bioinformatics/btp352
18. Li D, Liu CM, Luo R, Sadakane K, Lam TW (2015) MEGAHIT: an ultra-fast single-node solution for large and complex metagenomics assembly via succinct de Bruijn graph. Bioinformatics 31: 1674-1676
  doi: 10.1093/bioinformatics/btv033
19. Li D, Chen Y, Liu H, Jia Y, Li F, Wang W, Wu J, Wan Z, Cao Y, Zeng R (2020) Immune dysfunction leads to mortality and organ injury in patients with COVID-19 in China: insights from ERS-COVID-19 study. Signal Transduct Target Ther 5: 62
  doi: 10.1038/s41392-020-0163-5
20. Lippi G, Simundic AM, Plebani M (2020) Potential preanalytical and analytical vulnerabilities in the laboratory diagnosis of coronavirus disease 2019 (COVID-19). Clin Chem Lab Med 58: 1070-1076
  doi: 10.1515/cclm-2020-0285
21. Livak KJ, Schmittgen TD (2001) Analysis of relative gene expression data using real-time quantitative PCR and the 2(−Delta Delta C(T)) method. Methods 25: 402-408
  doi: 10.1006/meth.2001.1262
22. Lopez CA, Skaar EP (2018) The impact of dietary transition metals on host-bacterial interactions. Cell Host Microbe 23: 737-748
  doi: 10.1016/j.chom.2018.05.008
23. Narasimhan V, Danecek P, Scally A, Xue Y, Tyler-Smith C, Durbin R (2016) BCFtools/RoH: a hidden Markov model approach for detecting autozygosity from next-generation sequencing data. Bioinformatics (Oxford, England) 32: 1749-1751
  doi: 10.1093/bioinformatics/btw044
24. Oksanen J (2014) vegan: Community Ecology Package. R package version 2.0-10 edn.
25. Park SE (2020) Epidemiology, virology, and clinical features of severe acute respiratory syndrome-coronavirus-2 (SARS-CoV-2; Coronavirus Disease-19). Clin Exp Pediatr 63: 119-124
  doi: 10.3345/cep.2020.00493
26. Price MN, Dehal PS, Arkin AP (2009) FastTree: computing large minimum evolution trees with profiles instead of a distance matrix. Mol Biol Evol 26: 1641-1650
  doi: 10.1093/molbev/msp077
27. Rio DC, Ares M Jr, Hannon GJ, Nilsen TW (2010) Purification of RNA using TRIzol (TRI reagent). Cold Spring Harb Protoc 2010: pdb. prot5439
28. Ritchie ME, Phipson B, Wu D, Hu Y, Law CW, Shi W, Smyth GK (2015) Limma powers differential expression analyses for RNA-sequencing and microarray studies. Nucleic Acids Res 43: e47
  doi: 10.1093/nar/gkv007
29. Robinson MD, McCarthy DJ, Smyth GK (2010) edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26: 139-140
  doi: 10.1093/bioinformatics/btp616
30. Segata N, Izard J, Waldron L, Gevers D, Miropolsky L, Garrett WS, Huttenhower C (2011) Metagenomic biomarker discovery and explanation. Genome Biol 12: R60
  doi: 10.1186/gb-2011-12-6-r60
31. Shen B, Yi X, Sun Y, Bi X, Du J, Zhang C, Quan S, Zhang F, Sun R, Qian L, Ge W, Liu W, Liang S, Chen H, Zhang Y, Li J, Xu J, He Z, Chen B, Wang J, Yan H, Zheng Y, Wang D, Zhu J, Kong Z, Kang Z, Liang X, Ding X, Ruan G, Xiang N, Cai X, Gao H, Li L, Li S, Xiao Q, Lu T, Zhu Y, Liu H, Chen H, Guo T (2020a) Proteomic and metabolomic characterization of COVID-19 patient sera. Cell 182: 59−72. e15
  doi: 10.1016/j.cell.2020.05.032
32. Shen Z, Xiao Y, Kang L, Ma W, Shi L, Zhang L, Zhou Z, Yang J, Zhong J, Yang D, Guo L, Zhang G, Li H, Xu Y, Chen M, Gao Z, Wang J, Ren L, Li M (2020b) Genomic diversity of SARS-CoV-2 in coronavirus disease 2019 patients. Clin Infect Dis 71: 713-720
  doi: 10.1093/cid/ciaa203
33. Sodhi CP, Nguyen J, Yamaguchi Y, Werts AD, Lu P, Ladd MR, Fulton WB, Kovler ML, Wang S, Prindle T Jr, Zhang Y, Lazartigues ED, Holtzman MJ, Alcorn JF, Hackam DJ, Jia H (2019) A Dynamic variation of pulmonary ACE2 is required to modulate neutrophilic inflammation in response to pseudomonas aeruginosa lung infection in mice. J Immunol 203: 3000-3012
  doi: 10.4049/jimmunol.1900579
34. Takahashi Y, Watanabe N, Kamio N, Kobayashi R, Iinuma T, Imai K (2020) Aspiration of periodontopathic bacteria due to poor oral hygiene potentially contributes to the aggravation of COVID-19. J Oral Sci 63: 1-3
35. Tsang TK, Lee KH, Foxman B, Balmaseda A, Gresh L, Sanchez N, Ojeda S, Lopez R, Yang Y, Kuan G, Gordon A (2020) Association between the respiratory microbiome and susceptibility to influenza virus infection. Clin Infect Dis 71: 1195−1203
  doi: 10.1093/cid/ciz968
36. van Vliet AH, Ketley JM (2001) Pathogenesis of enteric Campylobacter infection. Symp Ser Soc Appl Microbiol 2001: 455-565
37. Vesterbacka J, Rivera J, Noyan K, Parera M, Neogi U, Calle M, Paredes R, Sönnerborg A, Noguera-Julian M, Nowak P (2017) Richer gut microbiota with distinct metabolic profile in HIV infected Elite Controllers. Sci Rep 7: 6269
  doi: 10.1038/s41598-017-06675-1
38. Wang Z, Xu X (2020) scRNA-seq profiling of human testes reveals the presence of the ACE2 receptor, a target for SARS-CoV-2 infection in spermatogonia, leydig and sertoli cells. Cells 9: 920
  doi: 10.3390/cells9040920
39. Wood DE, Salzberg SL (2014) Kraken: ultrafast metagenomic sequence classification using exact alignments. Genome Biol 15: R46
  doi: 10.1186/gb-2014-15-3-r46
40. World Health Organization (WHO) (2020a) Coronavirus disease (COVID-2019) pandemic. https://www.who.int/emergencies/diseases/novel-coronavirus-2019
41. World Health Organization (WHO) (2020b) Modes of transmission of virus causing COVID-19: implications for IPC precaution recommendations: scientific brief, 29 March 2020. World Health Organization, Geneva
42. Xu X, Chen P, Wang J, Feng J, Zhou H, Li X, Zhong W, Hao P (2020) Evolution of the novel coronavirus from the ongoing Wuhan outbreak and modeling of its spike protein for risk of human transmission. Sci China Life Sci 63: 457-460
  doi: 10.1007/s11427-020-1637-5
43. Young KT, Davis LM, Dirita VJ (2007) Campylobacter jejuni: molecular biology and pathogenesis. Nat Rev Microbiol 5: 665-679
  doi: 10.1038/nrmicro1718
44. Zhang X, Nieuwdorp M, Groen AK, Zwinderman AH (2019) Statistical evaluation of diet-microbe associations. BMC Microbiol 19: 90
  doi: 10.1186/s12866-019-1464-0
45. Zhang W, Du RH, Li B, Zheng XS, Yang XL, Hu B, Wang YY, Xiao GF, Yan B, Shi ZL, Zhou P (2020) Molecular and serological investigation of 2019-nCoV infected patients: implication of multiple shedding routes. Emerg Microbes Infect 9: 386-389
  doi: 10.1080/22221751.2020.1729071
46. Zuo T, Zhang F, Lui GCY, Yeoh YK, Li AYL, Zhan H, Wan Y, Chung A, Cheung CP, Chen N, Lai CKC, Chen Z, Tso EYK, Fung KSC, Chan V, Ling L, Joynt G, Hui DSC, Chan FKL, Chan PKS, Ng SC (2020) Alterations in gut microbiota of patients with COVID-19 during time of hospitalization. Gastroenterology 159: 944-955
  doi: 10.1053/j.gastro.2020.05.048
Proportional views

Figures(4) / Tables(1)

PDF

Article Metrics

Article views(5241) PDF downloads(3) Cited by()

Proportional views

HTML

Introduction

Globally, as of 30 November 2020, there have been 62,195,274 confirmed cases of COVID-19 caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), including 1,453,355 deaths, according to the WHO (WHO 2020a). SARS-CoV-2 nucleic acid could be detected in pharynx, bronchoalveolar lavage fluid and feces of patients (Zhang et al. 2020), which is associated with angiotensin-converting enzyme 2 (ACE2), the receptor of SARS-CoV-2, existing in multiple organs of the host (Wang and Xu 2020; Xu et al. 2020).

One significant characteristic of COVID-19 different from other coronavirus infections is that the symptoms vary from patients to patients (Arashiro et al. 2020). As reported in a clinical analysis (Park 2020), fever and respiratory symptoms started from 3 to 7 days after exposure to SARS-CoV-2. Among the patients, 80% had no or mild to moderate pneumonia, and approximately 20% had severe pneumonia. Different studies have tried to find the factors contributing to various COVID-19 symptoms. For example, Li et al. (Li et al. 2020) found that the dead COVID-19 patients had lower percentages in counts of CD3⁺, CD4⁺, and CD8⁺ lymphocytes than that of the survivors, which has strong predictive values for in-hospital mortality, organ injury, and severe pneumonia. In order to identify the factors determining the transition of symptoms from mild to severe, used proteomics and metabolomics to determine the markers in patients with COVID-19. There were 22 proteins and 7 metabolites screened from serum through a machine learning prediction. With them, COVID-19 patients may have a high probability to develop into severe cases (Shen et al. 2020a).

Many studies have proven that microbiota is an important component of human body (Kamada et al. 2013; Lopez and Skaar 2018). Some evidences indicate that microbiome plays an important role in modulating immune response against pathogen infections (Vesterbacka et al. 2017; Hanada et al. 2018). Vice versa, during viral pathogen infections, such as influenza and AIDS, the secondary bacterial infections might happen and disrupt the balance of the original microbiota (Vesterbacka et al. 2017; Hanada et al. 2018; Tsang et al. 2020). Recent studies have shown that the microbiome of the upper respiratory tract also affects human health (De Boeck et al. 2020). Some scientists have also noticed the relationship between microbiota and COVID-19 infections (Khan and Khan 2020; Shen et al. 2020b; Zuo et al. 2020). For example, Tao et al. performed gut shotgun metagenome analysis of fecal samples from patients with COVID-19 in Hong Kong (Zuo et al. 2020). They found that the patients with COVID-19 had significant alterations in fecal microbiomes compared with the controls, characterized by enrichment of opportunistic pathogens and depletion of beneficial commensals at all time points during hospitalization. According to their results, the severity of the disease may be reduced by altering gut microbes. But it is not clear whether the similar conclusion could be extended in other organs within the patients. Furthermore, the enriched opportunistic pathogens identified may contribute to controlling the secondary bacterial infection, which is also a risk for COVID-19 critical patients. It has been reported that the secondary bacterial infection rate of the COVID-19 patients was between 1% and 10% (Huang et al. 2020).

In the current study, microbiome difference between SARS-CoV-2 RT-qPCR (quantitative reverse-transcription polymerase chain reaction) positive and negative pharyngeal swabs from suspected patients were analyzed. Pharyngeal swabs are chosen because SARS-CoV-2 is transmitted through contacting with the upper respiratory tract and the swabs are easily available since they are normally collected for the detection of SARS-CoV-2 by RT-qPCR due to that they contain relatively high viral loads after onset of symptoms (Lippi et al. 2020; WHO 2020b).

Materials and Methods

Sample Collection and SARS-CoV-2 Detection

Pharyngeal swabs in viral transportation medium (VTM) were collected from 22 suspected COVID-19 patients with the symptoms of fever and cough at the First People's Hospital of Jiangxia District in Wuhan during the period from 25 January 2020 to 10 February 2020. All the patients had the typical symptoms of COVID-19 such as fever and cough. The pharyngeal swab samples were inactivated at 56 ℃ for 1 h. The samples were shaken and mixed with a vortex before nucleic acid extraction. RNA extraction was performed according to the instructions of QIAamp Viral RNA Mini Kit (Qiagen, Hilden, Germany). The quality and quantity of the RNA were determined by Nanodrop (ND-2000, Thermo Fisher, Waltham, MA, USA). All samples were tested by RT-qPCR targeting RBD sequence of SARS-CoV-2 spike protein. The primers and probes used were as follows: forward primer: 5′-CAATGGTTTAACAGGCACAGG-3′, reverse primer: 5′-CTCAAGTGTCTGTGGATCACG-3′, probe: 5′- FAM-ACAGCATCAGTAGTGTCAGCAATGTCTC-BHQ1-3′. The primers and the probe were synthesized by Sangon Biotech Co., Ltd. (Shanghai, China). Quantitative reverse transcription PCR (RT-qPCR) was performed using LightCycler® Multiplex RNA Virus Master (Roche Diagnostics GmbH, Mannheim, Germany). Briefly, the 5× RT-qPCR reaction mixture (4 μL each) was mixed with 2 μL of 10× primer and probe mixture, 0.1 μL of RT enzyme solution, 8.9 μL of water, and 5 μL of RNA template and performed in a CFX96 Touch™ Real-time PCR detection System (CFX96, Bio-Rad Laboratories, Hercules, CA, USA) with an initial reverse transcription step at 50 ℃ for 10 min, next denaturation step at 95 ℃ for 2 min, then 40 cycles of denaturation at 95 ℃ for 10 s, annealing at 60 ℃ for 45 s with fluorescent signals acquisition and the last step at 40 ℃ for 30 s.

DNA and RNA Library Construction

The transcriptome sequencing was performed to confirm the molecular diagnosis results. The cDNA was synthesized firstly using TaKaRa PrimeScript IV 1st strand cDNA Synthesis kit (Takara Bio Inc., Kusatsu, Shiga, Japan) and ds-cDNA Synthesis Module 2 (Vazyme Biotech Co., Ltd, Nanjing, China). The sequencing libraries were prepared using VAHTS Universal Plus DNA Library Prep Kit. Paired-end sequencing was performed on the MGI-seq2000 platform (MGI Tech Co. Ltd, Shenzhen, China). The cDNA synthesis, library preparation and sequencing were performed by the Frasergen Bioinformatics Co., Ltd, Wuhan, China. For metagenomic sequencing, total DNA was extracted with QIAamp DNA Mini Kit (Qiagen, Hilden, Germany) according to the instruction of the manufacture. Library preparation and sequencing were performed as described above producing 150 bp pair-end reads.

SARS-CoV-2 Genome Assembling

Transcriptome data and metagenomic data were processed by filtering the host genome sequences first using the fastq screen software (version 0.14.0, with the options chose—aligner bowtie2—nohits, http://www.bioinformatics.babraham.ac.uk/projects/fastq_screen/), respectively. The filtered data were then trimmed by Trimmomatic software (Bolger et al. 2014) (In the version 0.36, the options of LEADING: 20 TRAILING: 20 SLIDINGWINDOW: 4:20 MINLEN: 40 were chosen) to remove low quality sequences.

For the transcriptome data, we mainly focused on the SARS-CoV-2 genome assembly. Ninety-seven SARS-CoV-2 genomes were downloaded from NCBI GenBank as the references. The clean reads were mapped to these references through samtools software (version 1.9) (Li et al. 2009). The reads which were aligned to the references were extracted and transformed to fastq format by bcftools software (version 1.9, with the default options) (Narasimhan et al. 2016). The de novo assembly was performed by SPAdes (version 3.13.0, with the default options) (Bankevich et al. 2012). SARS-CoV-2 genomes or draft genomes assembled from the SARS-CoV-2 positive samples together with the references were used to do multi-sequences alignment with MAFFT (version 7.4.07, with the default options) (Katoh and Standley 2013). Since the genome fragments assembled from the samples were of uneven quality, only the genomes with high quality were used for phylogenetic tree construction. The phylogenetic analysis for SARS-CoV-2 was performed by FastTree (Price et al. 2009) with the options of -gtr -nt -fastest and 1000 bootstrap value. The phylogenetic tree was visualized by an online tool iTOL (Letunic and Bork 2019).

Metagenomic Data Processing

To compare with pharyngeal samples from healthy people, seven next-generation shotgun sequencing microbiome data of pharyngeal samples from healthy people were downloaded from the public database: www.hmpdacc.org/hmp/resources/. The open access number of the healthy control group is listed in Supplementary Table S1. All the clean data were assembled using the megahit software (version 1.1.3, with the default options) (Li et al. 2015). After that, clean reads were mapped to the contigs by bowtie2 (Langmead and Salzberg 2012). In order to minimize the errors due to the sequencing differences between samples, the RPKMs (Reads Per Kilobase per Million mapped reads) value, which was used to describe the relative abundance of each microorganism or virus, was calculated and normalized based on the lengths and the numbers of the contigs of the mapped reads in each sample (Li et al. 2009; Han et al. 2018). Besides, the species annotation of each contig was processed by kraken2 (Wood and Salzberg 2014).

The downstream analysis was mainly processed in the R language program (version 3.5.3; https://cran.r-project.org/bin/windows/base/old/3.5.3/). The principal coordinates analysis (PCoA) based on the Bray-Curtis distances was calculated with the vegan package (Oksanen 2014). For the analysis of the differential species (including SARS-CoV-2) abundance, owing to the lack of an optimal single method in current studies (Zhang et al. 2019), the analysis of the abundance difference between groups was performed by two diverse methods. One method was the combination of Linear Discriminant Analysis (LDA) together with the Kruskal-Wallis (KW) sum-rank test, which was completed by the software LEfSe (with the default options) (Segata et al. 2011) commonly used for differential species identified. Another approach was edgeR (Robinson et al. 2010) in conjunction with the LIMMA package (Ritchie et al. 2015), a generalized linear model, which has been found to have the best overall performance in a metagenomic profile (Jonsson et al. 2016; Zhang et al. 2019). Here, when determining the differential abundance species via edgeR, the abundance level of species with an absolute value log2 fold change ≥ 2 and P value ≤ 0.05 would be filtered as differentially abundant species.

Growth of Bacteria

Streptococcus suis and S. agalactiae were grown on Todd Hewitt agar (THA). E. coli, Acinetobacter baumannii, and Bacillus cereus were grown on Lysogeny broth agar (LBA). A single colony of each bacterial strain was inoculated into respective media and incubated overnight at 37 ℃. Colony forming unit (CFU) was then determined by plating series of dilutions on respective agar.

Cell Culture

Vero cells (African green monkey kidney cell line) were cultured in DMEM containing 10% FBS, 100 units/mL penicillin and 100 µg/mL streptomycin at 37 ℃ in a 5% CO₂ atmosphere. Vero cells were seeded in 24 well plates with a density of 1 × 10⁵ cells/well and grown to 80% confluence in DMEM with 2% FBS and without antibiotics. Each well was then challenged with 10⁶ CFU of each bacterium, taking bacterial free cells as controls. Total cellular RNA was isolated from cells at 0 h, 4 h and 8 h post incubation using TRIzol (Invitrogen-Gibco, Grand Island, NY) assay as described previously (Rio et al. 2010). RNA concentration and purity were then determined using Nanodrop 1000 Spectrophotometer at 260 nm. Each of these tests was done in triplicate throughout the study.

RT-qPCR Detection of the Expression Level of ACE2

The expression level of ACE2 in Vero cells cocultured with bacteria was determined by RT-qPCR using a HiScript® II One Step RT-qPCR SYBR® Green kit (Vazyme, Nanjing, China) following the manufacturer's instructions, with the primers ACE2-F (5′-CGAAGCCGAAGACCTGTTCTA-3′) and ACE2-R (5′-GGGCAAGTGTGGACTGTTCC-3′). Briefly, the amplification was performed as follows: 50 ℃ for 15 min, 98 ℃ for 2 min, followed by 40 cycles of 98 ℃ for 10 s, 60 ℃ for 15 s, and 68 ℃ for 30 s), and a default melt curve step in ABI Stepone machine. The housekeeping gene GAPDH was used as the reference gene with the primers GAPDH-F (5′-AACTCTGGTAAAGTGGAT-3′) and GAPDH-R (5′-TACTCAGCGCCAGCATCG-3′). The relative expression level of ACE2 was determined using the 2^−ΔΔCt method as described previously (Livak and Schmittgen 2001) and compared to that of the control group using the Student's t test by R program (version 3.5.3; https://cran.r-project.org/bin/windows/base/old/3.5.3/).

Statistical Analysis

All the statistical analyses were performed in the R program (version 3.5.3; https://cran.r-project.org/bin/windows/base/old/3.5.3/). The two independent groups were analyzed via Student's t-test. Data are showed as means ± SD. Statistical differences were considered significant at *P < 0.05, **P < 0.01 and ***P < 0.01.

Discussion

It is well known that genetic and environmental factors play important roles in shaping human microbiota and immunity (Carmody et al. 2015). Microbiota may play certain roles in immune response to COVID-19 (Dhar and Mohanty 2020). Because SARS-CoV-2 is transmitted through contacting with the upper respiratory tract, pharyngeal swabs were used in the current study to find the microbiome difference between COVID-19 patients and Non-COVID-19 patients. Unlike previous studies which mainly focused on gut microbiome of hospitalized COVID-19 patients (Khan and Khan 2020; Shen et al. 2020b; Zuo et al. 2020), the microbiome in throat may be helpful for identifying risk factors for SARS-CoV-2 infection and the development of severe COVID-9 illness. Another advantage is that pharyngeal swabs can be collected easily and are compatible with the current RT-qPCR detection of SARS-CoV-2.

Based on the metagenomic sequencing data, we identified 6502 microbial taxa in the pharyngeal swabs, which showed that there exist diversified microbes in human throats. Both alpha diversity and beta diversity analysis indicated that the species diversities of the COVID-19 group and the Non-COVID-19 group were significantly lower than that of the healthy people group. The linear discriminant analysis and the generalized linear model indicated that there were 81 species having significantly increased abundance in the COVID-19 group compared with that in the Non-COVID-19 group (Fig. 3B), which supports the results of the Pielou index analysis shown in Fig. 2C. Among the species enriched in the COVID-19 group (Supplementary Table S5), the top three enriched genera were found to be Streptococcus, Prevotella, and Campylobacter. These results are consistent with the findings of a recent study (Zuo et al. 2020), showing that the opportunistic pathogens such as Clostridium hathewayi, Streptococcus parasanguinis, Actinomyces odontolyticus were enriched in the guts of the hospitalized COVID-19 patients. Further investigation on the relationship between the enriched opportunistic pathogens and the secondary infections in the COVID-19 patients would provide helpful guides for better treatment outcomes of COVID-19 patients.

It is also clearly a question to be answered whether these enriched opportunistic pathogens would affect SARS-CoV-2 infection. If so, how do they affect? In this study, we found that two Streptococcus, i.e. S. suis and S. agalactiae, can promote the ACE2 expression level in Vero cells, which indicates that they may promote SARS-CoV-2 infection. Other studies have found that the Campylobacter jejuni which is one of the enriched opportunistic pathogens identified in COVID-19 patients can cause significant inflammation, enteritis and diarrhea in humans (Konkel et al. 2001; van Vliet and Ketley 2001; Young et al. 2007). It has been reported that about 3% of COVID-19 patients had symptoms of diarrhea (Huang et al. 2020). It may be worth to do further study to see if diarrhea after SARS-CoV-2 infection would be caused by Campylobacter. While for Prevotella, Abdul et al. have found that the over-expressed Prevotella proteins can promote viral infection (Khan and Khan 2020). The Prevotella proteins are also found to be involved in multiple interactions with NF-κB which is related to the clinical severity of COVID-19 (Khan and Khan 2020). Therefore, all these studies indicated that the enriched pathogens in the COVID-19 patient group may play certain roles in SARS-CoV-2 infections. They could affect the expression of the ACE2 and/or modulate the immune system involved in the virus-host interactions.

Overall, our study reveals for the first time that there are several enriched opportunistic pathogens in the pharyngeal tracts of the COVID-19 patients compared with the uninfected people. These bacteria might be involved in the virus-host interactions to affect SARS-CoV-2 infection and cause potential secondary bacterial infections through changing the expression of the ACE2 and/or modulate the host's immune system. It also provides new clues for further investigations on elucidating whether and how microbiota in human throat would play certain roles in SARS-CoV-2 infections.

Acknowledgements

We would like to thank the financial supports of CAS (project No 153B42KYSB20170004) and the Wuhan National Biosafety Lab for this study. We are grateful to Ms. Lei Zhang from the Core Facility and Technical Support of Wuhan Institute of Virology for the sequencing support.

Author contributions

JY and HW generated the idea, designed the experiment and wrote the manuscript. DX performed the bioinformatic analysis and wrote the manuscript. CM and XZ performed the sequence experiments and revised the manuscript. XP and JX collected samples and extracted nucleic acids to construct libraries for sequencing. CM and HY performed the experiments of bacteria on ACE2 expression and revised the manuscript. JY and HW finalized the manuscript. All authors read and approved the final manuscript.

Compliance with Ethical Standards

Conflict of interest

The authors declare that they have no conflict of interests.

Animal and Human Rights Statement

Additional informed consent was obtained from all patients for which identifying information is included in this article. The study and use of all samples were approved by the Ethics Committee of Wuhan Institute of Virology, CAS (No. WIVH1720 2001).

Figure (4) Table (1) Reference (46) Relative (20)

Sample	Seq length (bp)	Number of contigs	Number of reads	Ct value	Avg depth
S1	30,062	1	200,476	17.5	886
S2	29,977	1	295,310	17.1	316
S3	30,026	1	225,212	23.1	1,009
S4	29,531	7	3632	26.2	11
S5	29,660	1	292,054	23.6	35
S6	393	1	82	28.4	30
S7	27,958	10	3268	26.3	10
S8	29,889	23	2836	28.6	11
S9	1061	2	12	31.2	2
S10	2553	5	148,226	26.4	103
S11	29,296	3	8050	25.7	36

Enriched Opportunistic Pathogens Revealed by Metagenomic Sequencing Hint Potential Linkages between Pharyngeal Microbiota and COVID-19

Abstract

Electronic Supplementary Material

References

Proportional views

Article Metrics

Related

Proportional views