Department of Biochemistry and Molecular Pharmacology; Graduate School of Biomedical Sciences
Bioinformatics | Computational Biology | Genomics | Immunology and Infectious Disease | Infectious Disease | Microbiology | Nucleic Acids, Nucleotides, and Nucleosides | Population Biology | Structural Biology | Virus Diseases
The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) pandemic has motivated a widespread effort to understand its epidemiology and pathogenic mechanisms. Modern high-throughput sequencing technology has led to the deposition of vast numbers of SARS-CoV-2 genome sequences in curated repositories, which have been useful in mapping the spread of the virus around the globe. They also provide a unique opportunity to observe virus evolution in real time. Here, we evaluate two sets of SARS-CoV-2 genomic sequences to identify emerging variants within structured cis-regulatory elements of the SARS-CoV-2 genome. Overall, 20 variants are present at a minor allele frequency of at least 0.5%. Several enhance the stability of Stem Loop 1 in the 5' untranslated region (UTR), including a group of co-occurring variants that extend its length. One appears to modulate the stability of the frameshifting pseudoknot between ORF1a and ORF1b, and another perturbs a bi-ss molecular switch in the 3'UTR. Finally, 5 variants destabilize structured elements within the 3'UTR hypervariable region, including the S2M (stem loop 2 m) selfish genetic element, raising questions as to the functional relevance of these structures in viral replication. Two of the most abundant variants appear to be caused by RNA editing, suggesting host-viral defense contributes to SARS-CoV-2 genome heterogeneity. Our analysis has implications for the development of therapeutics that target viral cis-regulatory RNA structures or sequences.
COVID-19, RNA structure, SARS, coronavirus, phylogeny
Rights and Permissions
Copyright © The Author(s) 2021. This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
DOI of Published Version
Ryder SP, Morgan BR, Coskun P, Antkowiak K, Massi F. Analysis of Emerging Variants in Structured Regions of the SARS-CoV-2 Genome. Evol Bioinform Online. 2021 May 5;17:11769343211014167. doi: 10.1177/11769343211014167. PMID: 34017166; PMCID: PMC8114311. Link to article on publisher's site
Evolutionary bioinformatics online
Ryder SP, Morgan BR, Coskun P, Antkowiak K, Massi F. (2021). Analysis of Emerging Variants in Structured Regions of the SARS-CoV-2 Genome. COVID-19 Publications by UMass Chan Authors. https://doi.org/10.1177/11769343211014167. Retrieved from https://escholarship.umassmed.edu/covid19/250
Creative Commons License
This work is licensed under a Creative Commons Attribution-Noncommercial 4.0 License
Bioinformatics Commons, Computational Biology Commons, Genomics Commons, Immunology and Infectious Disease Commons, Infectious Disease Commons, Microbiology Commons, Nucleic Acids, Nucleotides, and Nucleosides Commons, Population Biology Commons, Structural Biology Commons, Virus Diseases Commons