Multi-granularity heterogeneous graph attention networks for extractive document summarization.

Journal: Neural networks : the official journal of the International Neural Network Society
Published Date:

Abstract

Extractive document summarization is a fundamental task in natural language processing (NLP). Recently, several Graph Neural Networks (GNNs) are proposed for this task. However, most existing GNN-based models can neither effectively encode semantic nodes of multiple granularity level apart from sentences nor substantially capture different cross-sentence meta-paths. To address these issues, we propose MHgatSum, a novel Multi-granularity Heterogeneous Graph ATtention networks for extractive document SUMmarization. Specifically, we first build a multi-granularity heterogeneous graph (HetG) for each document, which is better to represent the semantic meaning of the document. The HetG contains not only sentence nodes but also multiple other granularity effective semantic units with different semantic levels, including keyphrases and topics. These additional nodes act as the intermediary between sentences to build the meta-paths involved in sentence node (i.e., Sentence-Keyphrase-Sentence and Sentence-Topic-Sentence). Then, we propose a heterogeneous graph attention networks to embed the constructed HetG for extractive summarization, which enjoys multi-granularity semantic representations. The model is based on a hierarchical attention mechanism, including node-level and semantic-level attentions. The node-level attention can learn the importance between a node and its meta-path based neighbors, while the semantic-level attention is able to learn the importance of different meta-paths. Moreover, to better integrate sentence global knowledge, we further incorporate sentence node global importance in local node-level attention. We conduct empirical experiments on two benchmark datasets, which demonstrates the superiority of MHgatSum over previous SOTA models on the task of extractive summarization.

Authors

  • Yu Zhao
    College of Agriculture, Shanxi Agricultural University, Taigu, Shanxi, China.
  • Leilei Wang
    Key Laboratory of Modern Power System Simulation and Control & Renewable Energy Technology, Ministry of Education (Northeast Electric Power University), Jilin City, Jilin, China.
  • Cui Wang
    Fintech Innovation Center, Financial Intelligence and Financial Engineering Key Laboratory, Southwestern University of Finance and Economics (SWUFE), Chengdu, 611130, China.
  • Huaming Du
    School of Business Administration, Faculty of Business Administration, SWUFE, China.
  • Shaopeng Wei
    State Key Laboratory for Zoonotic Diseases, College of Veterinary Medicine, Jilin University, Changchun, China.
  • Huali Feng
    Fintech Innovation Center, Financial Intelligence and Financial Engineering Key Laboratory, Southwestern University of Finance and Economics (SWUFE), Chengdu, 611130, China.
  • Zongjian Yu
    Fintech Innovation Center, Financial Intelligence and Financial Engineering Key Laboratory, Southwestern University of Finance and Economics (SWUFE), Chengdu, 611130, China. Electronic address: yuzj@swufe.edu.cn.
  • Qing Li
    Department of Internal Medicine, University of Michigan Ann Arbor, MI 48109, USA.