《厦门大学学报（自然科学版）》

与传统的机器译文评价方法不同,译文质量估计技术旨在无参考译文的情况下对机器译文质量进行评价.针对目前流行的基于深度学习的译文质量估计方法因数据匮乏和模型限制导致所提取的深度学习特征不充分的现状,提出一种多特征融合的方法.该方法将词预测特征、语境化词嵌入特征、依存句法特征和基线特征等从不同模型中提取到的特征分别输入到基于循环神经网络的下游模型中,进一步学习后采用不同的特征融合方式进行融合,以此来提高译文质量估计的准确性.通过对比实验表明,本文所提出的多特征融合策略相比于单个特征能更好地对双语信息进行表达,且进一步提高了译文质量估计的皮尔逊相关系数等评价指标.

Unlike traditional machine translation evaluation methods,translation quality estimation technique aims to evaluate the quality of machine translations without references.At present,the deep learning-based features extracted by translation quality estimation methods are not sufficient due to the lack of data and the limitation of models.Focusing on this problem,we propose a multi-feature fusion method.In this method,features extracted from different aspects such as word prediction features,contextualized word embedding features,dependent syntactic features and baseline features are input to the downstream model based on the recurrent neural network.Then different strategies are adopted to combine these features.Comparative experiments show that the proposed method can better express the bilingual information compared with the single-feature method,and can improve the Pearson correlation coefficient as well as other evaluation metrics of sentence-level translation quality estimation.

引言
1 特征提取
2 多特征融合的译文质量估计模型
3 实验
4 总结与展望

图1 词预测特征提取过程<br/>Fig.1 Word prediction feature extraction process

图1 词预测特征提取过程
Fig.1 Word prediction feature extraction process

图2 语境化词嵌入特征提取过程<br/>Fig.2 Contextualized word embedding feature extraction process

图2 语境化词嵌入特征提取过程
Fig.2 Contextualized word embedding feature extraction process

图3 多特征融合的译文质量估计模型整体架构<br/>Fig.3 Architecture of translation quality estimation model with multi-feature fusion

图3 多特征融合的译文质量估计模型整体架构
Fig.3 Architecture of translation quality estimation model with multi-feature fusion

表1 数据集信息<br/>Tab.1 Information of data sets

表1 数据集信息
Tab.1 Information of data sets

表2 参数设置
Tab.2 Parameter settings

表3 不同融合策略的系统性能<br/>Tab.3 System performance of different fusion strategies

表3 不同融合策略的系统性能
Tab.3 System performance of different fusion strategies

表4 算术平均融合策略各系统性能<br/>Tab.4 System performance of arithmetic mean strategy

表4 算术平均融合策略各系统性能
Tab.4 System performance of arithmetic mean strategy

[1] SPECIA L,SHAH K,DE SOUZA J G C,et al.QuEst:a translation quality estimation framework[C]∥Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics:System Demonstrations.Stroudsburg:ACL,2013:79-84.
[2] FELICE M,SPECIA L.Linguistic features for quality estimation[C]∥Proceedings of the Seventh Workshop on Statistical Machine Translation.Stroudsburg:ACL,2012:96-103.
[3] 尹宝生,苗雪雷,季铎,等.大规模无参考译文质量自动评测技术的研究[J].沈阳航空航天大学学报,2012,29(1):70-74.
[4] HAN A L F,LU Y,WONG D F,et al.Quality estimation for machine translation using the joint method of evaluation criteria and statistical modeling[C]∥Proceedings of the Eighth Workshop on Statistical Machine Translation.Stroudsburg:ACL,2013:365-372.
[5] SHAH K,COHN T,SPECIA L.A bayesian non-linear method for feature selection in machine translation quality estimation[J].Machine Translation,2015,29(2):101-125.
[6] ALMAGHOUT H,SPECIA L.A CCG-based quality estimation metric for statistical machine translation[C]∥Proceedings of MT Summit XIV.Langhorne:AMTA,2013:223-230.
[7] KREUTZER J,SCHAMONI S,RIEZLER S.Quality estimation from ScraTCH(QUETCH):deep learning for word-level translation quality estimation[C]∥Proceedings of the Tenth Workshop on Statistical Machine Translation.Stroudsburg:ACL,2015:316-322.
[8] PATEL R N,SASIKUMAR M.Translation quality estimation using recurrent neural network[C]∥Proceedings of the First Conference on Machine Translation:Volume 2,Shared Task Papers.Stroudsburg:ACL,2016:819-824.
[9] KIM H,LEE J H.A recurrent neural networks approach for estimating the quality of machine translation output[C]∥Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.Stroudsburg:NAACL,2016:494-498.
[10] 孙潇,朱聪慧,赵铁军.融合翻译知识的机器翻译质量估计算法[J].智能计算机与应用,2019,9(2):279-283.
[11] FAN K,WANG J,LI B,et al.“Bilingual expert” can find translation errors[C]∥Proceedings of the AAAI Conference on Artificial Intelligence.Hawaii:AAAI,2019:6367-6374.
[12] MIKOLOV T,CHEN K,CORRADO G,et al.Efficient estimation of word representations in vector space[EB/OL].[2019-08-01].http:∥arxiv.org/pdf/1301.3781.pdf.
[13] VASWANI A,SHAZEER N,PARMAR N,et al.Attention is all you need[C]∥Proceedings of the 31st International Conference on Neural Information Processing Systems.Los Angeles:NIPS,2017:6000-6010.
[14] 陈志明,李茂西,王明文.基于神经网络特征的句子级别译文质量估计[J].计算机研究与发展,2017,54(8):1804-1812.
[15] PENNINGTON J,SOCHER R,MANNING C.Glove:global vectors for word representation[C]∥Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing.Stroudsburg:EMNLP,2014:1532-1543.
[16] SCHWENK H.Continuous space translation models for phrase-based statistical machine translation[C]∥Proceedings of COLING 2012:Posters.Stroudsburg:COLING,2012:1071-1080.
[17] BAHDANAU D,CHO K,BENGIO Y.Neural machine translation by jointly learning to align and translate[EB/OL].[2019-08-01].http:∥arxiv.org/pdf/:1409.0473.pdf.
[18] DEVLIN J,CHANG M W,LEE K,et al.BERT:pre-training of deep bidirectional transformers for language understanding[C]∥Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.Minneapolis:NAACL,2019:4171-4186.
[19] HOCHREITER S,SCHMIDHUBER J.Long short-term memory[J].Neural Computation,1997,9(8):1735-1780.
[20] SNOVER M,DORR B,SCHWARTZ R,et al.A study of translation edit rate with targeted human annotation[C]∥Proceedings of the 7th Conference of the Association for Machine Translation in the Americas.Cambridge:AMTA,2006:223-231.
[21] KIM H,JUNG H Y,KWON H,et al.Predictor-estimator:neural quality estimation based on target word prediction for machine translation[J].ACM Transactions on Asian and Low-Resource Language Information Processing(TALLIP),2017,17(1):3.

备注

引言

1 特征提取

2 多特征融合的译文质量估计模型

3 实验

4 总结与展望

学报简介

备注

引言

1 特征提取

2 多特征融合的译文质量估计模型

3 实 验

4 总结与展望

学报简介

3 实验