Improving SDG Classification Precision Using Combinatorial Fusion.

Journal: Sensors (Basel, Switzerland)
PMID:

Abstract

Combinatorial fusion algorithm (CFA) is a machine learning and artificial intelligence (ML/AI) framework for combining multiple scoring systems using the rank-score characteristic (RSC) function and cognitive diversity (CD). When measuring the relevance of a publication or document with respect to the 17 Sustainable Development Goals (SDGs) of the United Nations, a classification scheme is used. However, this classification process is a challenging task due to the overlapping goals and contextual differences of those diverse SDGs. In this paper, we use CFA to combine a topic model classifier (Model A) and a semantic link classifier (Model B) to improve the precision of the classification process. We characterize and analyze each of the individual models using the RSC function and CD between Models A and B. We evaluate the classification results from combining the models using a score combination and a rank combination, when compared to the results obtained from human experts. In summary, we demonstrate that the combination of Models A and B can improve classification precision only if these individual models perform well and are diverse.

Authors

  • D Frank Hsu
    Laboratory of Informatics and Data Mining (LIDM), Department of Computer and Information Science, Fordham University, 113 West 60th Street, New York, New York 10023, United States.
  • Marcelo T LaFleur
    Department of Economic and Social Affairs, United Nations, New York, NY 10017, USA.
  • Ilyas Orazbek
    Laboratory of Informatics and Data Mining, Department of Computer and Information Science, Fordham University, New York, NY 10023, USA.