Preprint / Version 1

Machine Learning Analysis of Gun Violence

##article.authors##

  • Aakash Akkineni School for the Talented and Gifted

Keywords:

Machine Learning, Gun Violence

Abstract

In the U.S, gun violence is a substantial and divisive issue that legislators have spent decades trying to solve by passing various laws, restrictions, and bans using data analysis and research to support their ideas. This paper aims to use machine learning models such as logistic regression, random forest classifiers, gradient boosting classifiers, (Support Vector Machine) SVC classifiers, and Multi-layer Perceptron (MLP) classifiers to analyze different pieces of shooting incidents. Using the feature weights of the best-performing model, we aim to find which factors have the largest effect on shooting incident lethality - whether anybody died or not during the incident. Based on this, lawmakers can focus on the factors that most contribute to lethality. The first models in the paper aim to predict whether a shooting was lethal or not based on the age and gender of the shooter(s), state poverty rate, percentage of people in that state with only a high school education rate, shooting location, year, etc. The other models aim to predict the shooter’s gender, the state that the shooting was in, and the political trifecta of that state based on similar factors. The precision, recall, and f-score of these models, as well as the feature weights of the random forest classifiers and gradient boosting classifiers, are reported. We used tree-based models to analyze the feature weights because of their high performance and because of Sk-learn’s limitations in finding feature weights for other model types. Additionally, we include an analysis and possible reasoning for our results.

References or Bibliography

Stark, David E., and Nigam H. Shah. “Funding and Publication of Research on Gun Violence

and Other Leading Causes of Death.” JAMA, vol. 317, no. 1, Jan. 2017, pp. 84–85. Silverchair,

https://doi.org/10.1001/jama.2016.16215.

Ghoneim, Salma. “Accuracy, Recall, Precision, F-Score amp; Specificity, Which to Optimize on?”

Medium, Towards Data Science, 8 Apr. 2019, https://towardsdatascience.com/accuracy-recall-precisionf-score-specificity-which-to-optimize-on-867d3f11124.

Firearm Violence Prevention —Violence Prevention—Injury Center—CDC. 5 May 2021, https://

www.cdc.gov/

Amid a Series of Mass Shootings in the U.S., Gun Policy Remains Deeply Divisive.” Pew Re-search

Center - U.S. Politics Policy, 20 Apr. 2021, https://www.pewresearch.org/politics/2021/04/20/amid-aseries-of-mass-shootings-in-the-u-s-gun-policy-remains-deeply-divisive/.

Gun Violence Data. https://kaggle.com/jameslko/gun-violence-data.

“Poverty.” USDA ERS - Data Products, https://data.ers.usda.gov/reports.aspx?ID=17826.

“Education.” USDA ERS - Data Products, https://data.ers.usda.gov/reports.aspx?ID=17829.

“State Government Trifectas.” Ballotpedia, https://ballotpedia.org/State government trifectas. Accessed 19 Sept. 2021.

Swalin, Alvira. “How to Make Your Machine Learning Models Robust to Outliers.” Medium,

Heartbeat, 24 Sept. 2021, https://heartbeat.comet.ml/how-to-make-your-machine-learning-modelsrobust-to-outliers-44d404067d07.

Downloads

Posted

10-26-2021