Applying Machine Learning Techniques to Mitigate Impact of COVID-19 Pandemic


  • Sidarth Krishna Acton-Boxborough Regional High School
  • Rajagopal Appavu
  • Jothsna Kethar



Machine Learning, R, Programming, High School, COVID-19, Data Science, Data Analysis, Regression


Since March 2020, COVID-19 has played a very influential role in our lives. Totaling over 300 million cases and 5.5 million deaths worldwide it has been one of the most transmittal viruses humans have seen in recent generations. Even after the mass distribution of vaccines, COVID-19 shows no signs of stopping. This is because many communities that are especially struggling during this time period have not been identified and are not being helped adequately enough. By better understanding how different factors in communities such as ethnic percentages, poverty rates and much more can help us determine which communities need to be addressed to slow the spread of COVID-19. To identify the most significant of these demographic factors an in depth data analysis using machine learning models and regression analysis were carried out on various datasets. The results highlighted that for COVID-19 cases the most influential factor was Population Density. For deaths, the most significant factors were poverty rates in communities as well as education level. From this analysis and results, in order to mitigate the impact of the COVID-19 pandemic in the future it is of utmost importance to address the needs of underprivileged communities by providing access to low cost and high quality medical resources for all.


Download data is not yet available.

References or Bibliography

US Department of Agriculture Data. (n.d.). Retrieved from

Ford, T. N., Reber, S., & Reeves, R. V. (2020, June 17). Race gaps in COVID-19 deaths are even bigger than they appear. Retrieved from


Gao, Y., Ding, M., Dong, X., Zhang, J., Azkur, A. K., Azkur, D., . . . Akdis, C. A. (2020, December 04). Risk factors for severe and critically ill COVID 19 patients: A review. Retrieved from

Ludvigsson, Jonas. (2020, June). Systematic review of COVID-19 in children shows milder cases and a better prognosis than adults. Retrieved from

Kopel, J., Perisetti, A., Roghani, A., Aziz, M., Gajendran, M., & Goyal, H. (2020, July 28). Racial and Gender-Based Differences in COVID-19. Retrieved from

Magesh, S. (2021, November 11). Disparities in COVID-19 Outcomes by Race, Ethnicity, and Socioeconomic Status. Retrieved from

Monita Karmakar, P. (2021, January 29). Association of Sociodemographic Factors With COVID-19 Incidence and Death Rates in the US. Retrieved from

O’Driscoll, M., Ribeiro Dos Santos, G., Wang, L., Cummings, D. A., Azman, A. S., Paireau, J., . . . Salje, H. (2020, November 02). Age-specific mortality and immunity patterns of SARS-CoV-2. Retrieved from

Roy, S., & Ghosh, P. (n.d.). Factors affecting COVID-19 infected and death rates inform lockdown-related policymaking. Retrieved from

World Health Organization. (n.d.). WHO Coronavirus (COVID-19) Dashboard. World Health Organization. Retrieved from

Monod, Melodie, Blenkinsop, Alexandra, Xi, X., Hebert, D., Bershan, S., Tietze, S., . . . Ratmann, Oliver*, (2021, February 02). Age groups that sustain resurging COVID-19 epidemics in the United States. Retrieved from

T.M. Therneau. A short introduction to recursive partitioning. Orion Technical Report 21, Stanford University, Department of Statistics, 1983. Retrieved from

T.M Therneau and E.J Atkinson. An introduction to recursive partitioning using the rpart routines. Division of Biostatistics 61, Mayo Clinic, 1997.Retrieved from


Link, B. G., and Phelan, J. Social conditions as fundamental causes of disease. J. Health Sociol. Behav. 80–94. doi: 10.2307/2626958, 1995. Retrieved from



How to Cite

Krishna, S., Appavu, R., & Kethar, J. (2022). Applying Machine Learning Techniques to Mitigate Impact of COVID-19 Pandemic. Journal of Student Research, 11(2).



HS Research Projects