Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

The Effect of Shallow Segmentation for English-Tigrinya Statistical Machine Translation

Abstract: 

This paper presents initial English-Tigrinya statistical machine translation (SMT) research. Tigrinya is a highly inflected Semitic language spoken in Eritrea and Ethiopia. Translation involving morphologically complex languages is challenged by factors including data sparseness and source-target word alignment. We try to address these problems through morphological segmentation of Tigrinya words. After segmentation the difference in token count dropped significantly from 37.7% to 0.1%. The out-of-vocabulary rate was reduced by 46%. We analyzed phrase-based translation with unsegmented corpus and segmented corpus to study the effect of segmentation on translation quality. Preliminary results demonstrate promising performance improvement from a relatively small parallel corpus.

up
0 users have voted:

Paper Details

Authors:
Yemane Tedla and Kazuhide Yamamoto
Submitted On:
21 November 2016 - 8:31am
Short Link:
Type:
Presentation Slides
Event:
Presenter's Name:
Kazuhide Yamamoto
Document Year:
2016
Cite

Document Files

IALP-tig.pdf

(346)

Subscribe

[1] Yemane Tedla and Kazuhide Yamamoto, "The Effect of Shallow Segmentation for English-Tigrinya Statistical Machine Translation", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1281. Accessed: Jul. 20, 2019.
@article{1281-16,
url = {http://sigport.org/1281},
author = {Yemane Tedla and Kazuhide Yamamoto },
publisher = {IEEE SigPort},
title = {The Effect of Shallow Segmentation for English-Tigrinya Statistical Machine Translation},
year = {2016} }
TY - EJOUR
T1 - The Effect of Shallow Segmentation for English-Tigrinya Statistical Machine Translation
AU - Yemane Tedla and Kazuhide Yamamoto
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1281
ER -
Yemane Tedla and Kazuhide Yamamoto. (2016). The Effect of Shallow Segmentation for English-Tigrinya Statistical Machine Translation. IEEE SigPort. http://sigport.org/1281
Yemane Tedla and Kazuhide Yamamoto, 2016. The Effect of Shallow Segmentation for English-Tigrinya Statistical Machine Translation. Available at: http://sigport.org/1281.
Yemane Tedla and Kazuhide Yamamoto. (2016). "The Effect of Shallow Segmentation for English-Tigrinya Statistical Machine Translation." Web.
1. Yemane Tedla and Kazuhide Yamamoto. The Effect of Shallow Segmentation for English-Tigrinya Statistical Machine Translation [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1281