Semi-Supervised Text Classification Framework: An Overview of Dengue Landscape Factors and Satellite Earth Observation.

Journal: International journal of environmental research and public health
PMID:

Abstract

In recent years there has been an increasing use of satellite Earth observation (EO) data in dengue research, in particular the identification of landscape factors affecting dengue transmission. Summarizing landscape factors and satellite EO data sources, and making the information public are helpful for guiding future research and improving health decision-making. In this case, a review of the literature would appear to be an appropriate tool. However, this is not an easy-to-use tool. The review process mainly includes defining the topic, searching, screening at both title/abstract and full-text levels and data extraction that needs consistent knowledge from experts and is time-consuming and labor intensive. In this context, this study integrates the review process, text scoring, active learning (AL) mechanism, and bidirectional long short-term memory (BiLSTM) networks, and proposes a semi-supervised text classification framework that enables the efficient and accurate selection of the relevant articles. Specifically, text scoring and BiLSTM-based active learning were used to replace the title/abstract screening and full-text screening, respectively, which greatly reduces the human workload. In this study, 101 relevant articles were selected from 4 bibliographic databases, and a catalogue of essential dengue landscape factors was identified and divided into four categories: land use (LU), land cover (LC), topography and continuous land surface features. Moreover, various satellite EO sensors and products used for identifying landscape factors were tabulated. Finally, possible future directions of applying satellite EO data in dengue research in terms of landscape patterns, satellite sensors and deep learning were proposed. The proposed semi-supervised text classification framework was successfully applied in research evidence synthesis that could be easily applied to other topics, particularly in an interdisciplinary context.

Authors

  • Zhichao Li
    School of Political Science and Public Administration, East China University of Political Science and Law, Shanghai 201620, China. 2863@ecupl.edu.cn.
  • Helen Gurgel
    Department of Geography, University of Brasilia (UnB), Brasilia CEP 70910-900, Brazil.
  • Nadine Dessay
    International Joint Laboratory Sentinela, FIOCRUZ, UnB, IRD, Rio de Janeiro RJ-21040-900, Brazil.
  • Luojia Hu
    Qian Xuesen Laboratory of Space Technology, China Academy of Space Technology, Beijing 100094, China.
  • Lei Xu
    Key Laboratory of Biomedical Information Engineering of the Ministry of Education, Department of Biomedical Engineering, School of Life Science and Technology, Xi'an Jiaotong University, Xi'an, China.
  • Peng Gong
    Ministry of Education Key Laboratory for Earth System Modeling, Department of Earth System, Science, Tsinghua University, Beijing 100084, China.