Application of Transformer Models to Landslide Susceptibility Mapping.

Journal: Sensors (Basel, Switzerland)
Published Date:

Abstract

Landslide susceptibility mapping (LSM) is of great significance for the identification and prevention of geological hazards. LSM is based on convolutional neural networks (CNNs); CNNs use fixed convolutional kernels, focus more on local information and do not retain spatial information. This is a property of the CNN itself, resulting in low accuracy of LSM. Based on the above problems, we use Vision Transformer (ViT) and its derivative model Swin Transformer (Swin) to conduct LSM for the selected study area. Machine learning and a CNN model are used for comparison. Fourier transform amplitude, feature similarity and other indicators were used to compare and analyze the difference in the results. The results show that the Swin model has the best accuracy, F1-score and AUC. The results of LSM are combined with landslide points, faults and other data analysis; the ViT model results are the most consistent with the actual situation, showing the strongest generalization ability. In this paper, we believe that the advantages of ViT and its derived models in global feature extraction ensure that ViT is more accurate than CNN and machine learning in predicting landslide probability in the study area.

Authors

  • Shuai Bao
    School of Geomatics, Liaoning Technical University, Fuxin 123000, China.
  • Jiping Liu
    Chinese Academy of Surveying and Mapping, Beijing 100036, China.
  • Liang Wang
    Information Department, Dazhou Central Hospital, Dazhou 635000, China.
  • Xizhi Zhao
    Chinese Academy of Surveying and Mapping, Beijing 100036, China.