Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

Prosodic Annotation Enriched Statistical Machine Translation

Abstract: 

More and more linguistic information has been employed to improve the performance of machine translation, such as part of speech, syntactic structures, discourse contexts, and so on. However, conventional approaches typically ignore the key information beyond the text such as prosody. In this paper, we exploit and employ three prosodic features: pronunciation (phonetic alphabet and tone), prosodic boundaries and emphasis. Based on the annotated data, a conditional random fields (CRF) sequential tagger is used to label the prosodic tags for Chinese sentences, and three methods are presented to integrate these features: (1) factored translation models where the prosodic features are incorporated as factors; (2) a word lattice decoding model where the prosodic boundaries are considered to be an alternative to the tokenization boundaries; (3) re-ranking models where the prosodic features are integrated in the language model to re-score the n-best translation candidates. We evaluate the proposed methods with bilingual evaluation understudy (BLEU) score both in English-to-Chinese (E2C) and Chinese-to-English (C2E) translation directions. Experiments show that with prosodic features, the re-ranking model achieves significant improvement, while the word lattice decoding and the factored translation models also improve the performance.

up
0 users have voted:

Paper Details

Authors:
Peidong Guo, Heyan Huang, Ping Jian, Yuhang Guo
Submitted On:
15 October 2016 - 12:10pm
Short Link:
Type:
Presentation Slides
Event:
Presenter's Name:
Peidong Guo
Paper Code:
S2-3
Document Year:
2016
Cite

Document Files

Prosodic Annotation Enriched Statistical Machine Translation(161014).pdf

(71 downloads)

Subscribe

[1] Peidong Guo, Heyan Huang, Ping Jian, Yuhang Guo, "Prosodic Annotation Enriched Statistical Machine Translation", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1250. Accessed: May. 28, 2017.
@article{1250-16,
url = {http://sigport.org/1250},
author = {Peidong Guo; Heyan Huang; Ping Jian; Yuhang Guo },
publisher = {IEEE SigPort},
title = {Prosodic Annotation Enriched Statistical Machine Translation},
year = {2016} }
TY - EJOUR
T1 - Prosodic Annotation Enriched Statistical Machine Translation
AU - Peidong Guo; Heyan Huang; Ping Jian; Yuhang Guo
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1250
ER -
Peidong Guo, Heyan Huang, Ping Jian, Yuhang Guo. (2016). Prosodic Annotation Enriched Statistical Machine Translation. IEEE SigPort. http://sigport.org/1250
Peidong Guo, Heyan Huang, Ping Jian, Yuhang Guo, 2016. Prosodic Annotation Enriched Statistical Machine Translation. Available at: http://sigport.org/1250.
Peidong Guo, Heyan Huang, Ping Jian, Yuhang Guo. (2016). "Prosodic Annotation Enriched Statistical Machine Translation." Web.
1. Peidong Guo, Heyan Huang, Ping Jian, Yuhang Guo. Prosodic Annotation Enriched Statistical Machine Translation [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1250