Applying different resampling strategies in machine learning models to predict head-cut gully erosion susceptibility

Fengjie Wang, Mehebub Sahana, Bahareh Pahlevanzadeh, Subodh Chandra Pal, Pravat Kumar Shit, Md Jalil Piran, Saeid Janizadeh, Shahab S. Band, Amir Mosavi

Research output: Contribution to journalArticlepeer-review

37 Citations (Scopus)

Abstract

Gully erosion is one of the advanced forms of water erosion. Identifying the effective factors and gully erosion predicting is one of the important tools to control and manage such phenomenon. The main purpose of this study is to evaluate the effect of four different resampling algorithms including cross-validation (5-fold and 10-fold) and bootstrapping (Bootstrap and Optimism bootstrap) on boosted regression tree (BRT), support vector machine (SVM), and random forest (RF) models in spatial modeling and evaluation of head-cut gully erosion in Konduran watershed. For this purpose, based on an extensive field survey, the points of the head-cut of the gully erosion were identified first, and a map of the distribution of head-cut gully erosion in the study area was prepared. Then 18 variable identify and prepare as factors affecting the occurrence of head-cut gully erosion. To assess the efficiency of the models, receiver operating characteristics (ROC) and area under the curve (AUC) were used. <Through the assessment result we indicate that…>The results of the assessment indicated that the use of resampling algorithms increases the efficiency of the models. The integrated optimism-bootstrap-BRT, optimism-bootstrap-SVM, and Optimism-Bootstrap-RF models with AUC 0.85, 0.823 and 0.89 respectively, outperformed the cross-validation 5fold (BRT, SVM, RF), Cross-validation 10fold (BRT, SVM, RF) and Bootstrap (BRT, SVM, RF) integrated algorithms.

Original languageEnglish
Pages (from-to)5813-5829
Number of pages17
JournalAlexandria Engineering Journal
Volume60
Issue number6
DOIs
Publication statusPublished - Dec 2021
Externally publishedYes

Keywords

  • Boosted regression tree
  • Bootstrap
  • Head-cut gully erosion
  • K-fold cross validation
  • Machine learning
  • Random forest
  • Resampling algorithms
  • Support vector machine

Fingerprint

Dive into the research topics of 'Applying different resampling strategies in machine learning models to predict head-cut gully erosion susceptibility'. Together they form a unique fingerprint.

Cite this