Assessing AlphaFold AI’s Protease Enzyme Structure Prediction Accuracy

Authors

  • Iris Zhang Conestoga High School
  • Janet Wolfe

DOI:

https://doi.org/10.47611/jsrhs.v13i3.7703

Keywords:

Artificial Intelligence, Protein, Protease, Enzyme, AlphaFold, Protein Prediction

Abstract

This study analyzed AlphaFold AI’s ability in accurately predicting protease enzyme structures. AlphaFold uses machine learning, taking amino acid sequences and using physical and scientific knowledge of protein structures to generate a protein structure prediction. Past studies have comfirmed AlphaFold’s general abilities but have identified limitations in certain factors, like post translational modifications, ligands, and other environmental factors. However, there have not been studies assessing AlphaFold in predicting protease enzyme structures specifically. Quantitative data was collected using ex-post facto and correlational methods, which compared the RMSD score between AlphaFold and Protein Data Bank structures of the same protease enzyme. Furthermore, correlational trends were searched for between protein complexity and length with the RMSD score. 77% of the 30 protease enzymes assessed were found to be accurate, with more complex structures lowering in accuracy. Protein length was not a factor in AlphaFold’s prediction accuracy. By utilizing these findings, researchers in the pharmaceutical industry can consider the weak points of AlphaFold, conduct further studies identifying more factors that contribute to AlphaFold’s accuracy, and work on improving the program based on the results. 

Downloads

Download data is not yet available.

References or Bibliography

Bagdonas, H., Fogarty, C.A., Fadda, E. et al. The case for post-predictional modifications in the AlphaFold Protein Structure Database. Nat Struct Mol Biol 28, 869–870 (2021). https://doi.org/10.1038/s41594-021-00680-9

Berman, H. M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T. N., Weissig, H., Shindyalov, I. N., & Bourne, P. E. (2000). The Protein Data Bank. Nucleic acids research, 28(1), 235–242. https://doi.org/10.1093/nar/28.1.235

Buttazzo G. (2023). Rise of artificial general intelligence: risks and opportunities. Frontiers in artificial intelligence, 6, 1226990. https://doi.org/10.3389/frai.2023.1226990

Chakravarty D, Porter LL (2022). AlphaFold2 fails to predict protein fold switching. Protein Science. 31(6):e4353. https://doi.org/10.1002/pro.4353

Castro-Alvarez, A., Costa, A. M., & Vilarrasa, J. (2017). The Performance of Several Docking Programs at Reproducing Protein-Macrolide-Like Crystal Structures. Molecules (Basel, Switzerland), 22(1), 136. https://doi.org/10.3390/molecules22010136

Dhakal, A., McKay, C., Tanner, J. J., & Cheng, J. (2021). Artificial intelligence in the prediction of protein–ligand interactions: recent advances and future directions. Briefings in Bioinformatics, 23(1). https://doi.org/10.1093/bib/bbab476

Goddard, T. D., Huang, C. C., Meng, E. C., Pettersen, E. F., Couch, G. S., Morris, J. H., & Ferrin, T. E. (2018). UCSF ChimeraX: Meeting modern challenges in visualization and analysis. Protein science : a publication of the Protein Society, 27(1), 14–25. https://doi.org/10.1002/pro.3235

Jumper, J., Evans, R., Pritzel, A. et al. Highly accurate protein structure prediction with AlphaFold. Nature 596, 583–589 (2021). https://doi.org/10.1038/s41586-021-03819-2

Kolluri, S., Lin, J., Liu, R., Zhang, Y., & Zhang, W. (2022). Machine Learning and Artificial Intelligence in Pharmaceutical Research and Development: a Review. The AAPS Journal, 24(1). https://doi.org/10.1208/s12248-021-00644-3

López-Otín, C., & Bond, J. S. (2008). Proteases: multifunctional enzymes in life and disease. The Journal of biological chemistry, 283(45), 30433–30437. https://doi.org/10.1074/jbc.R800035200

Metz, C. (2022, July 28). A.I. Predicts the Shape of Nearly Every Protein Known to Science. The New York Times. https://www.nytimes.com/2022/07/28/science/ai-deepmind-proteins.html

Sahu, A., Mishra, J., & Kushwaha, N. (2021). Artificial Intelligence (AI) in Drugs and Pharmaceuticals. Combinatorial Chemistry & High Throughput Screening, 25. https://doi.org/10.2174/1386207325666211207153943

Shanehsazzadeh, A. (2024). Unlocking de novo antibody design with generative artificial intelligence [Review of Unlocking de novo antibody design with generative artificial intelligence]. Cold Spring Harbor Laboratory.

Terwilliger, Thomas C et al. “AlphaFold predictions are valuable hypotheses and accelerate but do not replace experimental structure determination.” Nature methods vol. 21,1 (2024): 110-116. doi:10.1038/s41592-023-02087-4

Thornton, J.M., Laskowski, R.A. & Borkakoti, N. AlphaFold heralds a data-driven revolution in biology and medicine. Nat Med 27, 1666–1669 (2021). https://doi.org/10.1038/s41591-021-01533-0

Published

08-31-2024

How to Cite

Zhang, I., & Wolfe, J. (2024). Assessing AlphaFold AI’s Protease Enzyme Structure Prediction Accuracy . Journal of Student Research, 13(3). https://doi.org/10.47611/jsrhs.v13i3.7703

Issue

Section

AP Capstone™ Research