Overcoming data limitation challenges in predicting tropical storm surge with interpretable machine learning methods

Stanton, Carly

Overcoming data limitation challenges in predicting tropical storm surge with interpretable machine learning methods

dc.contributor.advisor	King, Scott
dc.contributor.author	Stanton, Carly
dc.contributor.committeeMember	Tissot, Philippe
dc.contributor.committeeMember	Wang, Wenlu
dc.creator.orcid	https://orcid.org/0009-0003-5257-4655
dc.date.accessioned	2023-10-24T20:51:18Z
dc.date.available	2023-10-24T20:51:18Z
dc.date.issued	2023-08
dc.description	A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Computer Science	en_US
dc.description.abstract	The impacts of climate change have increased the risk of storm surge flooding in coastal areas. Tropical islands are especially vulnerable to the effects of sea level rise and the increase in frequency and intensity of tropical cyclones (TCs). Typically, storm surge prediction is performed using a combination of numerical forecasting models, synoptic forecasting, and statistical methods. Machine learning techniques, particularly convolutional neural networks (CNNs), have shown promise in accurately predicting storm surge levels in the short term. However, deep learning methods are computationally expensive and require large amounts of data to train their models. Often researchers must train neural network models on synthetic data generated by numerical models. The goal of this work is to study the effectiveness of simpler, interpretable models, including random forest (RF) regression, multiple linear regression (MLR), and support vector machine regression (SVR), to predict storm surge in San Juan Bay, Puerto Rico using limited local meteorological and tidal data and hurricane reanalysis data from actual storm events over the last few decades. These algorithms were used to predict surge at five different lead times from one hour to 24 hours and were trained on three different feature sets with two different types of training data windows. Models were trained using a leave-one-out cross-validation (LOOCV) approach, in which data for one TC was separated out for each model as a validation dataset. The performance of the models and different training methods was compared in terms of root mean square error (RMSE), normalized RMSE, and error at peak surge. It was found that an RF model trained on data from only eight TCs was able to predict the peak surge of Hurricane Irma to within about 0.03 m and predicted time of peak surge within three hours at lead times up to 12 hours as long as one extreme TC event, in this case Hurricane Maria, was included in the training data. However, all models failed to accurately predict surge for Hurricane Maria, even when including other high-surge storms in the training data. Other training methods achieved lower RMSE when validated against a peak surge window from the 12 hours prior to 12 hours after peak surge, but could not approach the accuracy of the RF model at predicting the time of peak surge.	en_US
dc.description.college	College of Engineering and Computer Science	en_US
dc.description.department	Computer Science	en_US
dc.format.extent	90 pages	en_US
dc.identifier.uri	https://hdl.handle.net/1969.6/97611
dc.language.iso	en_US	en_US
dc.rights.uri	https://creativecommons.org/licenses/by-nd/4.0/deed.en	*
dc.subject	machine learning	en_US
dc.subject	predictive analytics	en_US
dc.subject	random forests	en_US
dc.subject	storm surge	en_US
dc.subject	tropical cyclone	en_US
dc.title	Overcoming data limitation challenges in predicting tropical storm surge with interpretable machine learning methods	en_US
dc.type	Text	en_US
dc.type.genre	Thesis	en_US
thesis.degree.discipline	Computer Science	en_US
thesis.degree.grantor	Texas A & M University--Corpus Christi	en_US
thesis.degree.level	Masters	en_US
thesis.degree.name	Master of Science	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: Stanton_Carly_Thesis.pdf
Size:: 12.54 MB
Format:: Adobe Portable Document Format

Download

Collections

Theses