Sorry, you need to enable JavaScript to visit this website.

Sell-corpus: an Open Source Multiple Accented Chinese-english Speech Corpus for L2 English Learning Assessment

Citation Author(s):
Yu Chen, Jun Hu, Xinyu Zhang
Submitted by:
Yu Chen
Last updated:
9 May 2019 - 11:28pm
Document Type:
Poster
Document Year:
2019
Event:
Presenters:
Yu Chen
Paper Code:
1738
Keywords:
 

We present SELL-CORPUS, a multiple accented speech corpus for L2 English learning in China, aiming at the potential research of multiple accented acoustic model, mispronunciation detection and pronunciation assessment for future nationwide oral English tests. Our corpus contains 31.6 hour speech recordings contributed by 389 volunteer speakers, including 186 males and 203 females. Our corpus covers seven major regional dialects and provides a baseline for Chinese multiple accented automatic speech recognition system. We released our speech corpus to the public for academic research. To the best of our knowledge, it is the first open-source English speech corpus that accounts for the accents of all major Chinese regional dialects.

up
0 users have voted: