Understanding the spatial dimension of natural language by measuring the spatial semantic similarity of words through a scalable geospatial context window.

Journal: PloS one
Published Date:

Abstract

Measuring the semantic similarity between words is important for natural language processing tasks. The traditional models of semantic similarity perform well in most cases, but when dealing with words that involve geographical context, spatial semantics of implied spatial information are rarely preserved. Geographic information retrieval (GIR) methods have focused on this issue; however, they sometimes fail to solve the problem because the spatial and textual similarities of words are considered and calculated separately. In this paper, from the perspective of spatial context, we consider the two parts as a whole-spatial context semantics, and we propose a method that measures spatial semantic similarity using a sliding geospatial context window for geo-tagged words. The proposed method was first validated with a set of simulated data and then applied to a real-world dataset from Flickr. As a result, a spatial semantic similarity model at different scales is presented. We believe this model is a necessary supplement for traditional textual-language semantic analyses of words obtained by word-embedding technologies. This study has the potential to improve the quality of recommendation systems by considering relevant spatial context semantics, and benefits linguistic semantic research by emphasising the spatial cognition among words.

Authors

  • Bozhi Wang
    School of Resource and Environmental Sciences, Wuhan University, Wuhan, China.
  • Teng Fei
    School of Resource and Environmental Sciences, Wuhan University, Wuhan, China.
  • Yuhao Kang
    Geospatial Data Science Lab, Department of Geography, University of Wisconsin, Madison, WI, United States of America.
  • Meng Li
    Co-Innovation Center for the Sustainable Forestry in Southern China; Cerasus Research Center; College of Biology and the Environment, Nanjing Forestry University, Nanjing, China.
  • Qingyun Du
    School of Resources and Environmental Science, Wuhan University, Wuhan 430079, China. qydu@whu.edu.cn.
  • Meng Han
    School of Economics, Ocean University of China, Qingdao, China.
  • Ning Dong
    Jinling Clinical Medical College, Nanjing Medical University,Nanjing,Jiangsu 210002,China.