Predicting Flood Streamflow with Auto Regressive Integrated Moving Average Models

Authors

  • Everest Yang High School Student
  • Yong Zhuang University of Massachusetts Boston

DOI:

https://doi.org/10.47611/jsrhs.v11i3.3072

Keywords:

Artificial Intelligence, Machine Learning, ARIMA, Time Series, Flood Prediction, Statistical Modeling

Abstract

Flooding is the most common natural disaster and continues to increase in frequency and intensity due to climate changes [7]. Currently, there is a lack of efficient tools to predict flooding. This research is aimed to create a Time Series Machine Learning (ML) program using Auto Regressive Moving Average (ARIMA) models to forecast streamflow, one of the most prominent factors in flood prediction. A streamflow dataset from the Ganges River, Bangladesh was used to plot several graphs of the river Log Volume to observe possible trends. Another plot was graphed to check and quantify how much the distribution of the stream volume changed over the course of 10 years using KL Divergence. The plot analyses and Partial Autocorrelation Function (PACF) and Autocorrelation Function (ACF) tests were used to help obtain the ARIMA parameters of (p, d, q) as (1, 1, 1). However, the forecasted streamflow of the ARIMA function was not accurate when compared with previously recorded data because of heavy seasonality. As a result, the final program was redesigned with Seasonal ARIMA (SARIMA) to account for the inaccuracy. The SARIMA model was used to forecast the streamflow of subsequent years and was close to the actual recorded data. Such accuracy indicates that this method can be a useful tool in navigating and preparing for floods.

Downloads

Download data is not yet available.

Author Biography

Yong Zhuang, University of Massachusetts Boston

Ph.D. in applied machine learning. Specializes in Deep learning, Spatio-temporal analysis, time series forecasting, and feature selection. 

References or Bibliography

Brewster, Signe. "Flickr tags could help predict floods." Science.org, 7 Mar. 2017, www.science.org/content/article/flickr-tags-could-help-predict-floods. Accessed 4 July 2022.

Brownlee, Jason. "A Gentle Introduction to SARIMA for Time Series Forecasting in Python." Machine Learning Mastery.com, 17 Aug. 2018, machinelearningmastery.com/sarima-for-time-series-forecasting-in-python/#:~:text=to%20use%20SARIMA.-,What%20is%20SARIMA%3F,data%20with%20a%20seasonal%20component. Accessed 4 Feb. 2022.

---. "How to Calculate the KL Divergence for Machine Learning." Machine Learning Mastery, 19 Oct. 2018, machinelearningmastery.com/divergence-between-probability-distributions/. Accessed 4 Feb. 2022.

---. "How to Create an ARIMA Model for Time Series Forecasting in Python" ["How to Create an ARIMA Model for Time Series Forecasting in Python"]. Machine Learning Mastery, edited by Jason Brownlee, 9 Jan. 2017, machinelearningmastery.com/arima-for-time-series-forecasting-with-python/. Accessed 21 Oct. 2021.

"CSV, Comma Separated Values (RFC 4180)." loc.gov, Library of Congress, 11 Feb. 2020, www.loc.gov/preservation/digital/formats/fdd/fdd000323.shtml. Accessed 4 July 2022.

"8.1 Stationarity and Differencing." Otexts, otexts.com/fpp2/stationarity.html. Accessed 4 Feb. 2022.

"Floods." World Health Organization, WHO, www.who.int/health-topics/floods#tab=tab_1. Accessed 21 Oct. 2021.

Hasnet, Saif, and Mike Ives. "Bangladesh Floods Cause Death and Destruction in Sylhet." The New York Times, New York Times Company, 24 June 2022, www.nytimes.com/2022/06/24/world/asia/sylhet-bangladesh-floods.html. Accessed 4 July 2022.

Masum, Mohammad. "Time Series Analysis: Identifying AR and MA using ACF and PACF Plots." Towards Data Science, 13 Aug. 2020, towardsdatascience.com/identifying-ar-and-ma-terms-using-acf-and-pacf-plots-in-time-series-forecasting-ccb9fd073db8. Accessed 4 Feb. 2022.

Peixeiro, Marco. "The Complete Guide to Time Series Analysis and Forecasting" ["The Complete Guide to Time Series Analysis and Forecasting"]. Towards Data Science, edited by Maxo Peixiero, 7 Aug. 2019, towardsdatascience.com/the-complete-guide-to-time-series-analysis-and-forecasting-70d476bfe775?gi=b05b1ecae15a. Accessed 21 Oct. 2021.

Prabhakaran, Selva. "ARIMA Model – Complete Guide to Time Series Forecasting in Python" ["ARIMA Model – Complete Guide to Time Series Forecasting in Python"]. Machine Learning +, edited by Selva Prabhakaran, 21 Aug. 2021, www.machinelearningplus.com/time-series/arima-model-time-series-forecasting-python/. Accessed 21 Oct. 2021.

---. "Augmented Dickey Fuller Test (ADF Test) – Must Read Guide" ["Augmented Dickey Fuller Test (ADF Test) – Must Read Guide"]. Machine Learning Plus, 2 Nov. 2019, www.machinelearningplus.com/time-series/augmented-dickey-fuller-test/. Accessed 9 Dec. 2021.

"Rivers and Flooding Case Study: Bangladesh." BBC, www.bbc.co.uk/bitesize/guides/zgycwmn/revision/4#:~:text=Causes%20of%20flooding%20in%20Bangladesh&text=Melt%20water%20from%20the%20Himalayas,Increasing%20urban%20areas. Accessed 4 July 2022.

Saeed, Mehreen. "Kernel Density Estimation in Python Using Scikit-Learn." StackAbuse.com, stackabuse.com/kernel-density-estimation-in-python-using-scikit-learn/. Accessed 4 Feb. 2022.

"Streamflow and the Water Cycle." The U.S. Geological Survey (USGS), Water Science School, 12 June 2019, www.usgs.gov/special-topics/water-science-school/science/streamflow-and-water-cycle. Accessed 4 July 2022.

"What is Machine Learning?" Micro Forum, www.microfocus.com/en-us/what-is/machine-learning. Accessed 4 July 2022.

"Why You Should be Using Jupyter Notebooks." odsc.medium, ODSC - Open Data Science, 15 July 2020, odsc.medium.com/why-you-should-be-using-jupyter-notebooks-ea2e568c59f2#:~:text=The%20Jupyter%20Notebook%20is%20an,text%20in%20a%20single%20document. Accessed 4 July 2022.

Yiu, Tony. "Understanding SARIMA (More Time Series Modeling)." Towards Datascience.com, 30 Apr. 2020, towardsdatascience.com/understanding-sarima-955fe217bc77. Accessed 4 Feb. 2022.

Published

08-31-2022

How to Cite

Yang, E., & Zhuang, Y. (2022). Predicting Flood Streamflow with Auto Regressive Integrated Moving Average Models. Journal of Student Research, 11(3). https://doi.org/10.47611/jsrhs.v11i3.3072

Issue

Section

HS Research Projects