《厦门大学学报（自然科学版）》

为有效解决数据的稀疏性问题,并考虑句法预测的内在层次性,提出了一个基于双向长短时记忆(bidirectional long short term memory,BLSTM)神经网络模型的渐步性句法分析模型.该模型将树形概率计算方法应用到对句法标签分类的研究中,利用句法结构和标签之间的层次关系,提出一种从句法结构到句法标签的渐步性句法分析方法,再使用句法分析树来生成句法标签的特征表示,并输入到BLSTM神经网络模型里进行句法标签的分类.在清华大学语义依存语料库上进行实验的结果表明,与链式概率计算方法以及其他依存句法分析器比较,依存准确率提升了0～1个百分点,表明新方法是可行、有效的.

In order to effectively solve the problem of data sparseness and inherent level of syntactic prediction,an incremental stepwise dependency parsing model based on bidirectional long short term memory(BLSTM)is proposed.This paper applying the tree-like probability calculation method to the study of syntactic tag classification,using the hierarchical relationship between syntactic structure and tag,proposes a step-by-step syntactic analysis method from syntactic structure to syntax tag,using syntactic analysis tree to generate the characteristics of the syntactic tag which are input into the BLSTM model to classify syntactic tags.Compared with other syntactic analysis methods and chained probability calculation method on the Semantic Dependency Corpus dataset of Tsinghua University,the dependency accuracy rate is improved by 0-1 percent.It shows that the new method is feasible and effective.

引言
1 基于树形概率的句法标签分类方法
2 BLSTM模型实现及训练
3 实验
4 结果分析
5 结论

图1 链式概率计算方法示意图<br/>Fig.1 A schematic diagram of chained probability calculation method

图1 链式概率计算方法示意图
Fig.1 A schematic diagram of chained probability calculation method

图2 树形概率计算方法<br/>Fig.2 Tree-like probability calculation method

图2 树形概率计算方法
Fig.2 Tree-like probability calculation method

图3 概率计算方法示例<br/>Fig.3 Example of probability calculation method

图3 概率计算方法示例
Fig.3 Example of probability calculation method

图4 神经网络结构<br/>Fig.4 Neural network structure

图4 神经网络结构
Fig.4 Neural network structure

表1 模型超参数设定<br/>Tab.1 Super parameter setting of the model

表1 模型超参数设定
Tab.1 Super parameter setting of the model

表2 语料库数据集示例<br/>Tab.2 Corpus dataset example

表2 语料库数据集示例
Tab.2 Corpus dataset example

表3 句法标签实验结果<br/>Tab.3 Experimental results of syntactic label%

表3 句法标签实验结果
Tab.3 Experimental results of syntactic label%

表4 依存分析实验结果<br/>Tab.4 Dependency parsing experimental results %

表4 依存分析实验结果
Tab.4 Dependency parsing experimental results %

[1] 刘海涛.依存语法和机器翻译[J].语言文字应用,1997,23(3):89-93.
[2] HAYS D G.Dependency theory:a formalism and some observations[J].Language,1964,40(4):511-525.
[3] GAIFMAN H.Dependency systems and phrase-structure systems[J].Information and Control,1965,8(3):304-337.
[4] YAMADA H.Statistical dependency analysis with support vector machines[C]∥Proceedings of the 8th International Workshop on Parsing Technologies.Nancy:International Workshop on Parsing Technologies,2003:195-206.
[5] LAI T B Y,HUANG C,ZHOU M,et al.Span-based statistical dependency parsing of chinese[C]∥Proceedings of the 6th Natural Language Processsing Pacific Rim Syposium(NLPRS2001).Tokyo:National Center of Sciences,2001:677-684.
[6] COLLOBERT R.Deep learning for efficient discriminative parsing[C]∥International Conference on Artificical Intelligence and Statistics.Lauderdate:AISTATS,2011:224-232.
[7] CHEN D,MANNING C D.A fast and accurate dependency parser using neural networks[C]∥Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing.Doha:Association for Computational Linguistics,2014:740-750.
[8] DURRETT G,KLEIN D.Neural CRF Parsing[EB/OL].[2018-03-01].http:∥arxiv.org/abs/1507.03641.
[9] MA X,HOVY E.Neural probabilistic model for non-projective MST parsing[C]∥Proceeding of the 8th International Point Conference on Natural Language Processing.Taipei:IJCNLP,2017:59-69.
[10] 王衡军,司念文,宋玉龙,单义栋.结合全局向量特征的神经网络依存句法分析模型[J].通信学报,2018,39(2):53-64.
[11] WANG W,CHANG B.Improved graph-based dependency parsing via hierarchical LSTM networks[C]∥China National Conference on Chinese Computational Linguistics.Yantai:Springer International Publishing,2016:25-32.
[12] ZHANG X,CHENG J,LAPATA M.Dependency parsing as head selection[EB/OL].[2018-03-01].http:∥arxiv.org/pdf/1606.01280.pdf.
[13] 张丹,周俏丽,张桂平.引入层次成分分析的依存句法分析[J].沈阳航空航天大学学报,2017,34(1):76-82.

备注

引言

1 基于树形概率的句法标签分类方法

2 BLSTM模型实现及训练

3 实验

4 结果分析

5 结论

学报简介

备注

引言