Genome‐wide analysis of SARS‐CoV‐2 virus strains circulating worldwide implicates heterogeneity

Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), a novel evolutionary divergent RNA virus, is responsible for the present devastating COVID-19 pandemic.To explore the genomic signatures, we comprehensively analyzed 2,492 complete and/or near-complete genome sequences of SARS-CoV-2 strai...

Full description

Autores:
Tipo de recurso:
Article of investigation
Fecha de publicación:
2020
Institución:
Universidad de Bogotá Jorge Tadeo Lozano
Repositorio:
Expeditio: repositorio UTadeo
Idioma:
eng
OAI Identifier:
oai:expeditiorepositorio.utadeo.edu.co:20.500.12010/13392
Acceso en línea:
https://doi.org/10.1038/s41598-020-70812-6
http://hdl.handle.net/20.500.12010/13392
Palabra clave:
Genome‑wide analysis
SARS‑CoV‑2
Virus strains
Implicates heterogeneity
Síndrome respiratorio agudo grave
COVID-19
SARS-CoV-2
Coronavirus
Rights
License
Abierto (Texto Completo)
Description
Summary:Severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2), a novel evolutionary divergent RNA virus, is responsible for the present devastating COVID-19 pandemic.To explore the genomic signatures, we comprehensively analyzed 2,492 complete and/or near-complete genome sequences of SARS-CoV-2 strains reported from across the globe to theGISAID database up to 30 March 2020.Genome-wide annotations revealed 1,516 nucleotide-level variations at diferent positions throughout the entire genome of SARS-CoV-2. Moreover, nucleotide (nt) deletion analysis found twelve deletion sites throughout the genome other than previously reported deletions at coding sequence of the ORF8 (open reading frame), spike, and ORF7a proteins, specifcally in polyprotein ORF1ab (n= 9), ORF10 (n= 1), and 3´-UTR (n= 2). Evidence from the systematic gene-level mutational and protein profle analyses revealed a large number of amino acid (aa) substitutions (n= 744), demonstrating the viral proteins heterogeneous. Notably, residues of receptor-binding domain (RBD) showing crucial interactions with angiotensin-converting enzyme 2 (ACE2) and cross-reacting neutralizing antibody were found to be conserved among the analyzed virus strains, except for replacement of lysine with arginine at 378th position of the cryptic epitope of a Shanghai isolate, hCoV-19/Shanghai/SH0007/2020 (EPI_ISL_416320). Furthermore, our results of the preliminary epidemiological data on SARS-CoV-2 infections revealed that frequency of aa mutations were relatively higher in the SARS-CoV-2 genome sequences of Europe (43.07%) followed byAsia (38.09%), and NorthAmerica (29.64%) while case fatality rates remained higher in the European temperate countries, such as Italy, Spain, Netherlands, France, England and Belgium.Thus, the present method of genome annotation employed at this early pandemic stage could be a promising tool for monitoring and tracking the continuously evolving pandemic situation, the associated genetic variants, and their implications for the development of efective control and prophylaxis strategies.