Sorry, you need to enable JavaScript to visit this website.

Exploring Tonal Information for Lhasa Dialect Acoustic Modeling

Citation Author(s):
Jian Li, Hongcui Wang, Longbiao Wang, Jianwu Dang, Kuntharrgyal khuru, Gyaltsen Lobsang
Submitted by:
Jian Li
Last updated:
14 October 2016 - 11:52am
Document Type:
Poster
Document Year:
2016
Event:
Presenters:
Jian Li
 

Detailed analysis of tonal features for Tibetan Lhasa dialect is an important task for Tibetan automatic speech recognition (ASR) applications. However, it is difficult to utilize tonal information because it remains controversial how many tonal patterns the Lhasa dialect has. Therefore, few studies have focused on modeling the tonal information of the Lhasa dialect for speech recognition purpose. For this reason, we investigated influences of the tonal information on the performance of Lhasa Tibetan speech recognition. Since Lhasa Tibetan has no conclusive tonal pattern yet, in this study, we used a four-tone pattern and designed a phone set based on the four contour contrasts scheme. Speech recognition performance was examined using the acoustic model with and without the pitch-related features. The experimental results showed that the character error rate (CER) was improved 11% after applying the tone based phone set and pitch-related features to DNN-HMM based speech recognition by comparing to that without tonal information. This preliminary study revealed that the tonal information plays an important role in speech recognition of Tibetan Lhasa dialect.

up
0 users have voted: