Unraveling the Genomic Mysteries of COVID-19: Using the Power of Phylogeny to Predict the Variants Before They Come Into Existence

Authors

  • Sana Abbas St. Andrews Episcopal Upper School

DOI:

https://doi.org/10.47611/jsrhs.v13i3.7735

Keywords:

Phylogenetics, COVID-19, Genomic, Variants, Predict, Existence

Abstract

There is a notable gap in the taxonomic research of COVID-19. This investigation aimed to address and fill this gap by evaluating the accuracy of different phylogenetic tree methods in displaying the evolutionary history of the virus  SARS - CoV 2. This research was motivated by the potential to prevent the recurrent spread of the disease world wide and making the vaccine creation process easier by gaining the ability to predict future virus mutations through the use of phylogenetic analysis. This investigation determined which phylogenetic mechanism (Parsimony, Maximum Likelihood, Neighbor-Joining, and UPGMA) was most accurate in portraying the evolutionary history of COVID-19. The hypothesis that was inferred before beginning this investigation was that the likelihood method of phylogeny would display the evolutionary history of COVID-19 most accurately due its reputation as a phylogenetic method that conducts more thorough studies than other statistical approaches. This investigation was carried out by equally sampling COVID-19 DNA from around the globe and inputting them into software that produces different types of phylogenetic trees. The bootstrap values of the clades of each of the four types of trees produced were then observed to reach a conclusion as to which of them displayed COVID-19 most accurately. The hypothesis ended up being unsupported as it turned out that of the 4 methods tested, the neighbor-joining method proved to be the most accurate. This discovery had significant implications for future research and could be used by scientists to possibly predict new COVID-19 variants before they come into existence. 

Downloads

Download data is not yet available.

References or Bibliography

Bartolucci, F., & Scrucca, L. (2010). Point Estimation Methods with Applications to Item Response Theory Models. In International Encyclopedia of Education (3rd ed., pp. 366–373). essay, Elsevier. https://doi.org/10.1016/B978-0-08-044894-7.01376-2.

Berkeley Edu. (n.d.). Phylogenetic systematics (evolutionary trees). Understanding Evolution . https://evolution.berkeley.edu/evolibrary/article/phylogenetics_08#:~:text=What%20is%20parsimony%3F,requires%20the%20fewest%20evolutionary%20changes

Cunningham, M. (2014). How can you tell if a phylogenetic tree is accurate? . Researchgate. https://www.researchgate.net/post/How-can-you-tell-if-a-phylogenetic-tree-is-accurate

Goloboff, P.A., Catalano, S.A., Marcos Mirande, J., Szumik, C.A., Salvador Arias, J., Källersjö, M. and Farris, J.S. (2009), Phylogenetic analysis of 73 060 taxa corroborates major eukaryotic groups. Cladistics, 25: 211-230. https://doi.org/10.1111/j.1096-0031.2009.00255.x

Hall, B. G. (2018b). Phylogenetic trees made easy: A how-to Manual. Sinauer Associates, imprint of Oxford University Press.

Huelsenbeck, J. P., & Rannala, B. (2004, December 1). Frequentist properties of Bayesian posterior probabilities of phylogenetic trees under simple and complex substitution models. OUP Academic. https://academic.oup.com/sysbio/article/53/6/904/1651356

Katoh, K. (2006). Multiple alignment program for amino acid or nucleotide sequences. MAFFT 7.526. https://mafft.cbrc.jp/alignment/software/

Lemey, P., Salemi, M., & Vandamme, A.-M. (2009). The phylogenetic handbook: A practical approach to DNA and protein phylogeny. Cambridge University Press.

Maddison, W. P., & Maddison, D. R. (2023). Mesquite: A modular system for evolutionary analysis. Version 3.81. Mesquite Project. http://www.mesquiteproject.org/

McLennan, D. A. (2010, September 29). How to read a phylogenetic tree - evolution: Education and outreach. BioMed Central. https://evolution-outreach.biomedcentral.com/articles/10.1007/s12052-010-0273-6

Miller, R. E., McDonald, J. A., & Manos, P. S. (2004). Systematics of Ipomoea subgenus Quamoclit (Convolvulaceae) based on ITS sequence data and a Bayesian phylogenetic analysis. American journal of botany, 91(8), 1208–1218. https://doi.org/10.3732/ajb.91.8.1208

Munar, M. P. (2021, May 29). How to analyze phylogenetic trees | interpret bootstrap values and sequence divergence. YouTube. https://m.youtube.com/watch?v=QlMwSqNbKA8

Nei, M. (1993). Molecular Evolutionary Genetics Analysis. MEGA. http://www.megasoftware.net/

Rambaut, D. (2018). Tracer v1.7.2 . Tracer | BEAST Documentation. https://beast.community/tracer

Swofford, D. (2003). PAUP* (* phylogenetic analysis using PAUP). PAUP Phylogenetic Analysis Using PAUP. https://paup.phylosolutions.com/

U.S. National Library of Medicine. (1982). GenBank Overview. National Center for Biotechnology Information. http://www.ncbi.nlm.nih.gov/genbank/

Published

08-31-2024

How to Cite

Abbas, S. (2024). Unraveling the Genomic Mysteries of COVID-19: Using the Power of Phylogeny to Predict the Variants Before They Come Into Existence . Journal of Student Research, 13(3). https://doi.org/10.47611/jsrhs.v13i3.7735

Issue

Section

HS Research Projects