A Machine Learning Approach for Plant-based Drug Discovery: High-Throughput Prediction of Biological Activities and Enzyme Commission Numbers from Phytochemicals and Amino Acid Sequences of Plants
DOI:
https://doi.org/10.47611/jsrhs.v14i1.8785Keywords:
Phytochemical, Enzyme Commission Number, Machine LearningAbstract
The success of many plant-based drugs and the acknowledgement of the limitations of synthetic drugs has again sparked interest in plant-derived natural products (NP) as a valuable source for novel drug development. Researchers have traditionally used the knowledge-based approach, which relies on traditional medicines to identify candidate plants and extracts. However, NP-based drug development comes with many limitations during the screening stage. First, NP extracts are mostly incompatible with target-based or high-throughput screening. Furthermore, in the case of phenotypic assays, the deconvolution of the mechanism of action of the compound is costly and time-consuming. This study proposes a novel machine learning framework for the high-throughput identification and characterization of plant-derived NPs. This framework consists of two independent models. The first model is a neural network designed to predict phytochemicals’ bioactivities through multi-label classification in four categories: antioxidant activity, anti-inflammatory, neurotoxicity, and lipid metabolism. The second model is a convolutional neural network (CNN) that predicts the Enzyme Commission (EC) numbers of enzymes present in the plant. The proposed framework showed robust performance with the Bioactivity Prediction Model achieving 97.62% accuracy and the EC-number Prediction Model achieving 81.97% accuracy. The framework facilitates a more efficient NP-based drug development by providing important insights applicable to the screening, isolation, and deconvolution of NPs.
Downloads
References or Bibliography
A. G. Atanasov et al., “Discovery and resupply of pharmacologically active plant-derived natural products: A review,” Biotechnology Advances, vol. 33, no. 8, pp. 1582–1614, Dec. 2015, doi: 10.1016/j.biotechadv.2015.08.001.
A. G. Atanasov, S. B. Zotchev, V. M. Dirsch, and C. T. Supuran, “Natural products in drug discovery: advances and opportunities,” Nature Reviews Drug Discovery, vol. 20, no. 3, pp. 200–216, Jan. 2021, doi: 10.1038/s41573-020-00114-z.
AI Hub. (2024, Aug 21). “Plant functionality prediction genomic data”: AI Hub. https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&aihubDataSe=data&dataSetSn=71316
Bano, Asghari, et al. “Bioactive Metabolites of Plants and Microbes and Their Role in Agricultural Sustainability and Mitigation of Plant Stress.” South African Journal of Botany, vol. 159, Aug. 2023, pp. 98–109, doi:10.1016/j.sajb.2023.05.049.
BRENDA, “All enzymes,” BRENDA Enzyme Database. Accessed: Sep. 22, 2024. [Online]. Available: https://www.brenda-enzymes.org/all_enzymes.php
Bruce, Stella Omokhefe. “Secondary Metabolites from Natural Products.” IntechOpen, 16 Feb. 2022, https://www.intechopen.com/chapters/80477.
D. J. Newman and G. M. Cragg, “Natural Products as Sources of New Drugs over the Nearly Four Decades from 01/1981 to 09/2019,” Journal of Natural Products, vol. 83, no. 3, pp. 770–803, Mar. 2020, doi: 10.1021/acs.jnatprod.9b01285.
E. Patridge, P. Gareiss, M. S. Kinch, and D. Hoyer, “An analysis of FDA-approved drugs: natural products and their derivatives,” Drug Discovery Today, vol. 21, no. 2, pp. 204–207, Feb. 2016, doi: 10.1016/j.drudis.2015.01.009.
IBM, “Recurrent neural network (RNN),” IBM. Accessed: Sep. 22, 2024. [Online]. Available: https://www.ibm.com/topics/recurrent-neural-networks
Licciardi, P. V., & Underwood, J. R. (2011). Plant-derived medicines: a novel class of immunological adjuvants. International immunopharmacology, 11(3), 390-398.
Mushtaq, S., Abbasi, B. H., Uzair, B., & Abbasi, R. (2018). Natural products as reservoirs of novel therapeutic agents. EXCLI journal, 17, 420.
Nasim, N., Sandeep, I. S., & Mohanty, S. (2022). Plant-derived natural products for drug discovery: current approaches and prospects. The Nucleus, 65(3), 399-411.
Nielsen, J. (2022). Bioactive metabolites: The double-edged sword in your food. Cell, 185(24), 4469-4471.
Petrovska, B. B. (2012). Historical review of medicinal plants’ usage. Pharmacognosy reviews, 6(11), 1.
Toropov, A. A., Toropova, A. P., Mukhamedzhanoval, D. V., & Gutman, I. (2005). Simplified molecular input line entry system (SMILES) as an alternative for constructing quantitative structure-property relationships (QSPR).
T. W. Corson and C. M. Crews, “Molecular Understanding and Modern Application of Traditional Medicines: Triumphs and Trials,” Cell, vol. 130, no. 5, pp. 769–774, Sep. 2007, doi: 10.1016/j.cell.2007.08.021.
Published
How to Cite
Issue
Section
Copyright (c) 2025 Leoni Kim, Ryan Oh; Christopher Koester

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Copyright holder(s) granted JSR a perpetual, non-exclusive license to distriute & display this article.


