Machine Learning-Based Prediction of Biological Activity in Natural Product Phytochemicals

Authors

  • Jean Lee Sunny Hills High School
  • Sue Jung Kim

DOI:

https://doi.org/10.47611/jsrhs.v14i1.8485

Keywords:

Biological Activity, Machine Learning, Molecular Structural Analysis

Abstract

Natural product phytochemicals hold significant promise for drug discovery due to their diverse biological activities and potential to offer novel therapeutic solutions. These compounds are derived from plants, fungi, and other natural sources, providing a rich source of chemical diversity that is often not found in synthetic libraries. Their complex structures and unique mechanisms of action can lead to the development of drugs with new and innovative therapeutic properties. However, discovering and developing new drugs from these compounds presents challenges due to the reliance on knowledge-based methods, which can be time-consuming and inefficient. To address this issue, we propose a novel bioactivity classification network designed to predict the biological activities of phytochemical samples. This study utilized a dataset categorized into four groups: Antioxidant, Toxicity, Anti-inflammatory and Immune, and Lipid Metabolism. The performance of the network was assessed using key metrics including Accuracy, Recall, Precision, and F1-Score. The results revealed that a model with a depth of 4 layers achieved the highest performance. The proposed network achieved an accuracy of 0.7926 and an F1-Score of 0.7248 on the public dataset.

Downloads

Download data is not yet available.

References or Bibliography

Achary, P. G. R., Toropova, A. P., & Toropov, A. A. (2019). Combinations of graph invariants and attributes of simplified molecular input-line entry system (SMILES) to build up models for sweetness. Food Research International, 122, 40-46.

Ben-Shabat, S., Yarmolinsky, L., Porat, D., & Dahan, A. (2020). Antiviral effect of phytochemicals from medicinal plants: Applications and drug delivery strategies. Drug delivery and translational research, 10, 354-367.

Cereto-Massagué, A., Ojeda, M. J., Valls, C., Mulero, M., Garcia-Vallvé, S., & Pujadas, G. (2015). Molecular

fingerprint similarity search in virtual screening. Methods, 71, 58-63.

Hossin, M., & Sulaiman, M. N. (2015). A review on evaluation metrics for data classification evaluations. International journal of data mining & knowledge management process, 5(2), 1.

Rao, V. S., & Srinivas, K. (2011). Modern drug discovery process: An in silico approach. Journal of bioinformatics and sequence analysis, 2(5), 89-94.

Yadav, R. N. S., & Agarwala, M. (2011). Phytochemical analysis of some medicinal plants. Journal of phytology, 3(12).

Published

02-28-2025

How to Cite

Lee, J., & Kim, S. J. (2025). Machine Learning-Based Prediction of Biological Activity in Natural Product Phytochemicals. Journal of Student Research, 14(1). https://doi.org/10.47611/jsrhs.v14i1.8485

Issue

Section

HS Research Articles