基于句法依存卷积神经网络的句子相似度计算
DOI:
作者:
作者单位:

1.重庆大学;2.重庆工商大学

作者简介:

通讯作者:

中图分类号:

基金项目:


Sentence Similarity Computation on Syntactic Dependency Convolutional Neural Network
Author:
Affiliation:

1.Chongqing University;2.Chongqing Technology and Business University

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    句子相似度计算是自然语言处理的一项基础任务,其准确性直接影响机器翻译、问题回答等下游任务的性能。传统机器学习方法主要依靠词形、词序及结构等浅层特征计算句子相似度,而深度学习方法能够融入深层语义特征,从而取得了更好效果。深度学习方法如卷积神经网络在提取文本特征时存在感受野狭小、长距离依赖信息不足的缺点。因此本文设计了DCNN模型,该模型利用词语之间的依存关系来解决该不足。DCNN模型首先通过依存句法分析得到句子中词语之间的依存关系,而后根据与当前词存在一跳或者两跳关系的词语形成二元和三元的词语组合,再将这两部分信息作为原句信息的补充,输入到卷积神经网络中,以此来获取词语之间长距离依赖信息。实验结果表明,加入依存句法信息得到的长距离依赖能有效提升模型性能。在MSRP数据集上,模型准确度和F1值分别为80.33%和85.91,在SICK 数据集上模型的皮尔森相关系数能达到87.5, 在MSRvid 数据集上模型的皮尔森相关系数能达到92.2。

    Abstract:

    Sentence Similarity Computation lays the basis of many natural language processing tasks, the accuracy of which has a direct impact on the performance of language related systems, especially in machine translation, plagiarism detection, query ranking and question answering. Therefore, it is in demand to raise the accuracy of sentence similarity computation by directing traditional methods that rely on shallow features like morphology, word sequence and grammar structure to employ deep learning methods, or integrating these two approaches together. However, deep learning methods using convolutional neural networks are required to overcome disadvantages such as narrow receptive field and insufficient long-distance information dependence when extracting text features. This paper carries out dependency-based syntactic analysis to retrieve information over longer distance. We make text parsing employing Stanford NLP for syntactic analysis, and then retrieve mutual relationship between two words in a binary combination or triplet. As lexical supplement information embedded in these word combinations, the dependency information, in addition to the original sentence, is added up as Convolutional Neural Network input, thus constructing a Dependency CNN. Experiment results reveal that the long distance dependency information effectively improve the similarity computation performance in our proposed dependency model on MSRP dataset, reaching an accuracy and F1 value of 80.33% and 85.91 respectively. Notably, Pearson correlation coefficient of this long distance dependency model reaches 87.5 on SICK dataset, and a much higher 92.2 on MSRvid dataset.

    参考文献
    相似文献
    引证文献
引用本文
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2019-08-15
  • 最后修改日期:2019-11-02
  • 录用日期:2019-11-04
  • 在线发布日期:
  • 出版日期: