Preprint / Version 1

Using Machine Learning to Predict Stroke Risk


  • Arnav Goel South Brunswick High School


Stroke, Patient, Prediction, Model, Regression, Machine Learning, Logistic


Many things are believed to cause strokes, but the actual factors that can lead to increased risk of having a stroke can be identified using logistic regression and machine learning. Knowing these factors will allow more insight into stroke prevention.

References or Bibliography

IBM Cloud Education. “What Is Random Forest?” IBM,,both%20classification%20and%20regression%20problems

IBM Cloud Education. “What Is Supervised Learning?” IBM,

Kumawat, Dinesh. “7 Types of Activation Functions in Neural Network.” Analytics Steps,

Kumawat, Dinesh. “Introduction to Logistic Regression - Sigmoid Function, Code Explanation.” Analytics Steps,

Mitchell, Tom M. Machine Learning. MacGraw-Hill, 1997.

Rossi, Richard, and Richard Rossi. Mathematical Statistics: An Introduction to Likelihood Based Inference. John Wiley & Sons, Inc., 2018.

Stroke Awareness Foundation. “Stroke Facts & Statistics.” Stroke Awareness Foundation, 23 Jan. 2021,