More refined superbag: Distantly supervised relation extraction with deep clustering.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

Distant supervision (DS) can automatically generate annotated data for relation extraction (RE) with knowledge bases and corpora. The existing DS methods that train on bags selected by attention mechanism are susceptible to noisy bags and neglect useful information in noisy bags. In this paper, we propose DCSR, a novel DS method which utilizes deep clustering to obtain refined superbag representations for solving the wrong labeling problem. we substitute deep clustering for selective attention to construct superbags, capturing helpful information between spatially-close bags, including noisy bags. Moreover, we implement data augmentation on the input sentences to handle the long-tail problem. Experiments on the NYT2010 and NYT-H datasets show that our method can effectively improve RE and significantly outperforms state-of-the-art methods.

Authors

  • Suizhu Yang
    School of Software Engineering, South China University of Technology, China.
  • Yanxia Liu
    Key Laboratory of Big Data and Intelligent Robot (South China University of Technology), Ministry of Education,; School of Software Engineering, South China University of Technology, Guangzhou, China. Electronic address: cslyx@scut.edu.cn.
  • Yuantong Jiang
    School of Software Engineering, South China University of Technology, China.
  • Zhiqiang Liu
    Shenzhen Key Laboratory of Reproductive Immunology for Peri-implantation, Shenzhen Zhongshan Institute for Reproductive Medicine and Genetics, Shenzhen, China.