TY - JOUR
T1 - A prediction scheme of tropical cyclone frequency based on lasso and random forest
AU - Tan, Jinkai
AU - Liu, Hexiang
AU - Li, Mengya
AU - Wang, Jun
N1 - Publisher Copyright:
© 2017, Springer-Verlag GmbH Austria.
PY - 2018/8/1
Y1 - 2018/8/1
N2 - This study aims to propose a novel prediction scheme of tropical cyclone frequency (TCF) over the Western North Pacific (WNP). We concerned the large-scale meteorological factors inclusive of the sea surface temperature, sea level pressure, the Niño-3.4 index, the wind shear, the vorticity, the subtropical high, and the sea ice cover, since the chronic change of these factors in the context of climate change would cause a gradual variation of the annual TCF. Specifically, we focus on the correlation between the year-to-year increment of these factors and TCF. The least absolute shrinkage and selection operator (Lasso) method was used for variable selection and dimension reduction from 11 initial predictors. Then, a prediction model based on random forest (RF) was established by using the training samples (1978–2011) for calibration and the testing samples (2012–2016) for validation. The RF model presents a major variation and trend of TCF in the period of calibration, and also fitted well with the observed TCF in the period of validation though there were some deviations. The leave-one-out cross validation of the model exhibited most of the predicted TCF are in consistence with the observed TCF with a high correlation coefficient. A comparison between results of the RF model and the multiple linear regression (MLR) model suggested the RF is more practical and capable of giving reliable results of TCF prediction over the WNP.
AB - This study aims to propose a novel prediction scheme of tropical cyclone frequency (TCF) over the Western North Pacific (WNP). We concerned the large-scale meteorological factors inclusive of the sea surface temperature, sea level pressure, the Niño-3.4 index, the wind shear, the vorticity, the subtropical high, and the sea ice cover, since the chronic change of these factors in the context of climate change would cause a gradual variation of the annual TCF. Specifically, we focus on the correlation between the year-to-year increment of these factors and TCF. The least absolute shrinkage and selection operator (Lasso) method was used for variable selection and dimension reduction from 11 initial predictors. Then, a prediction model based on random forest (RF) was established by using the training samples (1978–2011) for calibration and the testing samples (2012–2016) for validation. The RF model presents a major variation and trend of TCF in the period of calibration, and also fitted well with the observed TCF in the period of validation though there were some deviations. The leave-one-out cross validation of the model exhibited most of the predicted TCF are in consistence with the observed TCF with a high correlation coefficient. A comparison between results of the RF model and the multiple linear regression (MLR) model suggested the RF is more practical and capable of giving reliable results of TCF prediction over the WNP.
KW - Lasso
KW - Prediction
KW - Random Forest
KW - Tropical cyclone
KW - Year-to-year increment
UR - https://www.scopus.com/pages/publications/85026522691
U2 - 10.1007/s00704-017-2233-3
DO - 10.1007/s00704-017-2233-3
M3 - 文章
AN - SCOPUS:85026522691
SN - 0177-798X
VL - 133
SP - 973
EP - 983
JO - Theoretical and Applied Climatology
JF - Theoretical and Applied Climatology
IS - 3-4
ER -