《厦门大学学报（自然科学版）》

传统的机器翻译评价方法往往需要参考译文,利用机器双语互译评估(BLEU)值等方法比较翻译结果与参考译文之间的相似性.但是,在现实生活中却很难为每一句待翻译的句子找到参考答案,因此,不使用参考译文的译文质量估计(quality estimation,QE)方法有着更加广泛的应用场景.在该文中,基于多语言的预训练语言模型,利用联合编码的策略完成句子级的QE任务,在WMT 2018的QE任务德语→英语语言方向上的评测数据集上取得了最佳的实验结果.同时,对比了微调过程中不同网络结构对于该任务的影响,并探究了平行语料联合编码二次预训练在句子级跨语言任务上的效果.

In recent years,neural machine translation has advanced greatly.Traditional machine translation evaluation methods generally require references,such as BLEU(bilingual evaluation understudy).These methods aim to compare similarities between candidate and reference.However,in practice,it is difficult for us to find a reference for each source sentence.Therefore,the quality estimation(QE)application scenario is more extensive.In this paper,we use the multi-language pre-trained language model,with the joint-encoding strategy to complete the sentence-level QE task.Experiments show that our model can obtain outstanding results in WMT 2018 QE Shared Task German→English language direction.At the same time,we also compare the impact of different network structures on the task.Finally,we explore the effect of the secondary pre-trained of parallel corpus on the cross-lingual sentence tasks.

引言
1 基于Multi-BERT跨语言联合编码的预训练
2 基于预训练语言模型的QE方法
3 实验与分析
4 结论

图1 使用平行语料的Multi-BERT二次训练方法<br/>Fig.1 Pretraining method based on Multi-BERT with parallel sentences

图1 使用平行语料的Multi-BERT二次训练方法
Fig.1 Pretraining method based on Multi-BERT with parallel sentences

图2 基于Multi-BERT的BERT+Bi-GRU模型示意图<br/>Fig.2 BERT+Bi-GRU model based on Multi-BERT

图2 基于Multi-BERT的BERT+Bi-GRU模型示意图
Fig.2 BERT+Bi-GRU model based on Multi-BERT

图3 基于Multi-BERT的BERT+Bi-GRU+LASER+Baseline模型示意图<br/>Fig.3 BERT+Bi-GRU+LASER+Baseline model based on Multi-BERT

图3 基于Multi-BERT的BERT+Bi-GRU+LASER+Baseline模型示意图
Fig.3 BERT+Bi-GRU+LASER+Baseline model based on Multi-BERT

表1 WMT 2018 QE Shared Task德语→英语测试集实验结果<br/>Tab.1 The official result of WMT 2018 QE Shared Task German→English test dataset

表1 WMT 2018 QE Shared Task德语→英语测试集实验结果
Tab.1 The official result of WMT 2018 QE Shared Task German→English test dataset

表2 神经网络结构对WMT 2018 QE任务中德语→英语验证集上实验结果的影响<br/>Tab.2 The influence of neural network architecture on WMT 2018 QE Shared Task German→English valid dataset's result

表2 神经网络结构对WMT 2018 QE任务中德语→英语验证集上实验结果的影响
Tab.2 The influence of neural network architecture on WMT 2018 QE Shared Task German→English valid dataset's result

表3 使用平行语料训练WMT 2018 QE任务中德语→英语语言方向验证集上的结果<br/>Tab.3 The results of parallel data training on WMT 2018 QE Shared Task German→English valid dataset

表3 使用平行语料训练WMT 2018 QE任务中德语→英语语言方向验证集上的结果
Tab.3 The results of parallel data training on WMT 2018 QE Shared Task German→English valid dataset

表4 前5最相似词表中单词翻译对的数量和准确性<br/>Tab.4 The number and accuracy of word translations at top-5 most similar words list

表4 前5最相似词表中单词翻译对的数量和准确性
Tab.4 The number and accuracy of word translations at top-5 most similar words list

图4 不同语言模型的跨语言注意力权重可视化<br/>Fig.4 Cross-lingual attention weight visualization in different language models

图4 不同语言模型的跨语言注意力权重可视化
Fig.4 Cross-lingual attention weight visualization in different language models

[1] ZHOU L,ZHANG J J,ZONG C Q.Synchronous bidirectional neural machine translation[J].Transactions of the Association for Computational Linguistics,2019,7:91-105.
[2] ZHOU L,ZHANG J J,ZONG C Q,et al.Sequence generation:from both sides to the middle[EB/OL].[2019-11-29].https:∥arxiv.org/pdf/1906.09601.pdf.
[3] BAHDANAU D,CHO K,BENGIO Y.Neural machine translation by jointly learning to align and translate[EB/OL].[2019-11-29].https:∥arxiv.org/pdf/1409.0473.pdf.
[4] GEHRING J,AULI M,GRANGIER D,et al.Convo-lutional sequence to sequence learning[C]∥Proceedings of the 34th International Conference on Machine Learning-Volume 70.[S.l.]:PMLR,2017:1243-1252.
[5] VASWANI A,SHAZEER N,PARMAR N,et al.Attention is all you need[C]∥Proceedings of the 31st Annual Conference on Neural Information Processing Systems.California:NIPS,2017:5998-6008.
[6] ZHANG J J, ZONG C Q.Deep neural networks in machine translation:an overview[J].IEEE Intelligent Systems,2015,30(5):16-25.
[7] BLATZ J,FITZGERALD E,FOSTER G,et al.Confidence estimation for machine translation[C]∥COLING'04:Proceedings of the 20th international conference on computational linguistics.Geneva:ACL,2004:315-321.
[8] BOJAR O,BUCK C,FEDERMANN C,et al.Findings of the 2014 workshop on statistical machine translation[C]∥Proceedings of Ninth Workshop on Statistical Machine Translation.Baltimore:ACL,2014:12-58.
[9] SPECIA L,PAETZOLD G,SCARTON C.Multi-level translation quality prediction with QuEst++[C]∥Proceedings of ACL-IJCNLP 2015 System Demonstrations.Beijing:ACL,2015:115-120.
[10] ZHANG J J,LIU S J,LI M,et al.Bilingually-constrained phrase embeddings for machine translation[C] ∥Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics.Baltimore:ACL,2014:111-121.
[11] KIM H,JUNG H Y,KWON H,et al.Predictor-estimator:neural quality estimation based on target word prediction for machine translation[J].ACM Transactions on Asian and Low-Resource Language Information Processing(TALLIP),2017,17(1):3.doi:10.1145/3109480.
[12] LI M,XIANG Q,CHEN Z,et al.A unified neural network for quality estimation of machine translation[J].IEICE Transactions on Information and Systems,2018(9):2417-2421.
[13] FAN K,WANG J Y,LI B et al. “Bilingual Expert” can find translation errors[C] ∥Proceedings of the AAAI Conference on Artificial Intelligence.Hawaii:AAAI,2019,33(1):6367-6374.
[14] IVE J,BLAIN F,SPECIA L.DeepQuEst:a framework for neural-based quality estimation[C] ∥Proceedings of the 27th International Conference on Computational Linguistics.New Mexico:ACL,2018:3146-3157.
[15] SNOVER M G,MADNANI N,DORR B,et al.TER-Plus:paraphrase,semantic,and alignment enhancements to translation edit rate[J].Machine Translation,2009,23(2/3):117-127.
[16] PETERS M E,NEUMANN M,IYYER M,et al.Deep contextualized word representations[C] ∥Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics:Human Language Technologies.New Orleans:ACL,2018:2227-2237.
[17] RADFORD A,NARASIMHAN K,SALIMANS T,et al.Improving language understanding with unsupervised learning[EB/OL].[2019-11-29].https:∥openai.com/blog/language-unsupervised.
[18] DEVLIN J,CHANG M W,LEE K,et al.BERT:pre-training of deep bidirectional transformers for language understanding[EB/OL].[2019-11-29].https:∥arxiv.org/pdf/1810.04805.pdf.
[19] LAMPLE G,CONNEAU A.Cross-lingual language model pretraining[EB/OL].[2019-11-29].https:∥arxiv.org/pdf/1901.07291v1.pdf.
[20] CHUNG J Y,GULCEHRE C,CHO K,et al.Empirical evaluation of gated recurrent neural networks on sequence modeling[EB/OL].[2019-11-29].https:∥arxiv.org/pdf/1412.3555.pdf.
[21] KIPF T N,WELLING M.Semi-supervised classification with graph convolutional networks[EB/OL].[2019-11-29].https:∥arxiv.org/pdf/1609.02907.pdf.
[22] ARTETXE M,SCHWENK H.Massively multilingual sentence embeddings for zero-shot cross-lingual transfer and beyond[J].Transactions of the Association for Computational Linguistics,2019,7:597-610.
[23] CONNEAU A,LAMPLE G,RANZATO M A,et al.Word translation without parallel data[EB/OL].[2019-11-29].https:∥arxiv.org/pdf/1710.04087.pdf.

备注

引言

1 基于Multi-BERT跨语言联合编码的预训练

2 基于预训练语言模型的QE方法

3 实验与分析

4 结论

学报简介

备注

引言

1 基于Multi-BERT跨语言联合编码的预训练

2 基于预训练语言模型的QE方法

3 实验与分析

4 结 论

学报简介

4 结论