A Novel Keyword Generation Model Based on Topic-Aware and Title-Guide.

Journal: Computational intelligence and neuroscience
Published Date:

Abstract

Keywords are usually one or more words or phrases that describe the subject information of the document. The traditional automatic keywords extraction methods cannot obtain the keywords which do not appear in the document, and the semantic information is not considered in the extraction process. In this paper, we introduce a novel Keyword Generation Model based on Topic-aware and Title-guide (KGM-TT). In the KGM-TT, the neural topic model is used to identify the latent topic words, and a hierarchical encoder technology with attention mechanism is able to encode the title and its content, respectively. The keywords are generated by the recurrent neural network with attention and replication mechanism in our model. This model can not only generate the keywords which do not appeared in the source document but also use the topic information and the highly summative word meaning in the title to assist the generation of keywords. The experimental results show that the 1 value of this model is about 10% higher than that of CopyRNN and CopyCNN.

Authors

  • Jialin Ma
    Department of Intensive Care Unit, China-Japan Union Hospital of Jilin University, Changchun, China.
  • Jieyi Cheng
    College of Computer Science, Huaiyin Institute of Technology, Huaian 223003, China.
  • Yue Zhang
    Department of Ophthalmology, Beijing Hospital, National Center of Gerontology, Institute of Geriatric Medicine, Chinese Academy of Medical Sciences, Beijing, China.