A Machine Learning Approach for Plant-based Drug Discovery: High-Throughput Prediction of Biological Activities and Enzyme Commission Numbers from Phytochemicals and Amino Acid Sequences of Plants

Authors

  • Leoni Kim Chadwick International School Songdo
  • Ryan Oh Chadwick International School Songdo
  • Christopher Koester Chadwick International School Songdo

DOI:

https://doi.org/10.47611/jsrhs.v14i1.8785

Keywords:

Phytochemical, Enzyme Commission Number, Machine Learning

Abstract

The success of many plant-based drugs and the acknowledgement of the limitations of synthetic drugs has again sparked interest in plant-derived natural products (NP) as a valuable source for novel drug development. Researchers have traditionally used the knowledge-based approach, which relies on traditional medicines to identify candidate plants and extracts. However, NP-based drug development comes with many limitations during the screening stage. First, NP extracts are mostly incompatible with target-based or high-throughput screening. Furthermore, in the case of phenotypic assays, the deconvolution of the mechanism of action of the compound is costly and time-consuming. This study proposes a novel machine learning framework for the high-throughput identification and characterization of plant-derived NPs. This framework consists of two independent models. The first model is a neural network designed to predict phytochemicals’ bioactivities through multi-label classification in four categories: antioxidant activity, anti-inflammatory, neurotoxicity, and lipid metabolism. The second model is a convolutional neural network (CNN) that predicts the Enzyme Commission (EC) numbers of enzymes present in the plant. The proposed framework showed robust performance with the Bioactivity Prediction Model achieving 97.62% accuracy and the EC-number Prediction Model achieving 81.97% accuracy. The framework facilitates a more efficient NP-based drug development by providing important insights applicable to the screening, isolation, and deconvolution of NPs.

Downloads

Download data is not yet available.

References or Bibliography

A. G. Atanasov et al., “Discovery and resupply of pharmacologically active plant-derived natural products: A review,” Biotechnology Advances, vol. 33, no. 8, pp. 1582–1614, Dec. 2015, doi: 10.1016/j.biotechadv.2015.08.001.

A. G. Atanasov, S. B. Zotchev, V. M. Dirsch, and C. T. Supuran, “Natural products in drug discovery: advances and opportunities,” Nature Reviews Drug Discovery, vol. 20, no. 3, pp. 200–216, Jan. 2021, doi: 10.1038/s41573-020-00114-z.

AI Hub. (2024, Aug 21). “Plant functionality prediction genomic data”: AI Hub. https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&aihubDataSe=data&dataSetSn=71316

Bano, Asghari, et al. “Bioactive Metabolites of Plants and Microbes and Their Role in Agricultural Sustainability and Mitigation of Plant Stress.” South African Journal of Botany, vol. 159, Aug. 2023, pp. 98–109, doi:10.1016/j.sajb.2023.05.049.

BRENDA, “All enzymes,” BRENDA Enzyme Database. Accessed: Sep. 22, 2024. [Online]. Available: https://www.brenda-enzymes.org/all_enzymes.php

Bruce, Stella Omokhefe. “Secondary Metabolites from Natural Products.” IntechOpen, 16 Feb. 2022, https://www.intechopen.com/chapters/80477.

D. J. Newman and G. M. Cragg, “Natural Products as Sources of New Drugs over the Nearly Four Decades from 01/1981 to 09/2019,” Journal of Natural Products, vol. 83, no. 3, pp. 770–803, Mar. 2020, doi: 10.1021/acs.jnatprod.9b01285.

E. Patridge, P. Gareiss, M. S. Kinch, and D. Hoyer, “An analysis of FDA-approved drugs: natural products and their derivatives,” Drug Discovery Today, vol. 21, no. 2, pp. 204–207, Feb. 2016, doi: 10.1016/j.drudis.2015.01.009.

IBM, “Recurrent neural network (RNN),” IBM. Accessed: Sep. 22, 2024. [Online]. Available: https://www.ibm.com/topics/recurrent-neural-networks

Licciardi, P. V., & Underwood, J. R. (2011). Plant-derived medicines: a novel class of immunological adjuvants. International immunopharmacology, 11(3), 390-398.

Mushtaq, S., Abbasi, B. H., Uzair, B., & Abbasi, R. (2018). Natural products as reservoirs of novel therapeutic agents. EXCLI journal, 17, 420.

Nasim, N., Sandeep, I. S., & Mohanty, S. (2022). Plant-derived natural products for drug discovery: current approaches and prospects. The Nucleus, 65(3), 399-411.

Nielsen, J. (2022). Bioactive metabolites: The double-edged sword in your food. Cell, 185(24), 4469-4471.

Petrovska, B. B. (2012). Historical review of medicinal plants’ usage. Pharmacognosy reviews, 6(11), 1.

Toropov, A. A., Toropova, A. P., Mukhamedzhanoval, D. V., & Gutman, I. (2005). Simplified molecular input line entry system (SMILES) as an alternative for constructing quantitative structure-property relationships (QSPR).

T. W. Corson and C. M. Crews, “Molecular Understanding and Modern Application of Traditional Medicines: Triumphs and Trials,” Cell, vol. 130, no. 5, pp. 769–774, Sep. 2007, doi: 10.1016/j.cell.2007.08.021.

Published

02-28-2025

How to Cite

Kim, L., Oh, R., & Koester, C. (2025). A Machine Learning Approach for Plant-based Drug Discovery: High-Throughput Prediction of Biological Activities and Enzyme Commission Numbers from Phytochemicals and Amino Acid Sequences of Plants. Journal of Student Research, 14(1). https://doi.org/10.47611/jsrhs.v14i1.8785

Issue

Section

HS Research Projects