Sorry, you need to enable JavaScript to visit this website.

ISCSLP 2016

Welcome to ISCSLP 2016 - October 17-20, 2016, Tianjin, China

The ISCSLP will be hosted by Tianjin University. Tianjin has a reputation throughout China for being extremely friendly, safe and a place of delicious food. Welcome to Tianjin to attend the ISCSLP2016. The 10th International Symposium on Chinese Spoken Language Processing (ISCSLP 2016) will be held on October 17-20, 2016 in Tianjin. ISCSLP is a biennial conference for scientists, researchers, and practitioners to report and discuss the latest progress in all theoretical and technological aspects of spoken language processing. While the ISCSLP is focused primarily on Chinese languages, works on other languages that may be applied to Chinese speech and language are also encouraged. The working language of ISCSLP is English.

 

Relatively little research has addressed the role of L1 in the
perception of English speech contrasts by Chinese learners of
English as L3. The present study investigates the role of L1 in
the perception of the English alveolar-velar nasal coda contrast
(/n/ vs. /ŋ/) after the vowels /i ʌ æ/ by bilingual Changsha
Chinese speakers, whose L1 is Changsha Chinese and L2 is
Standard Mandarin. Changsha Chinese only permits an
alveolar nasal coda /n/, while Standard Mandarin permits both
final /n/ and /ŋ/. We examined whether or not monolingual

Categories:
15 Views

Assuming that linguistic specifications and information
planning contribute to different levels of prosodic organization
that cumulatively constitute output prosody, quantitative
analysis of respective contributions can be derived through
normalization procedures that remove levels of interactions
involved. The current study attempts to account for how L2
prosody departs from the L1 norm in the two levels mentioned
and whether an account can be offered. F0 patterns of word
English stress categories (primary, secondary and tertiary) and

Categories:
5 Views

This is oral presentation at ISCSLP, for more information, please refer to paper:

Jun-Hua Liu, Zhen-Hua Ling, Si Wei, Guo-Ping Hu, Li-Rong Dai, "Cluster-Based Senone Selection for the Efficient Calculation of Deep Neural Network Acoustic Models", ISCSLP, 2016.

Categories:
7 Views

Directions into Velocities of Articulators (DIVA) model is a kind of self-adaptive neural network model which controls movements of a simulated vocal tract to produce words, syllables or phonemes. However, DIVA model lacks of emotion functions. To implement the emotion function in DIVA model, we investigate the process of affective speech production based on the combination of fundamental frequency (F0) and formant frequencies, as well as the relations between F0 and formants of emotional speech.

Categories:
12 Views

Success in spoken word processing relies not only on accurate word recognition but also the veracity with which words are maintained in memory. However, research on word retention is still scarce, especially in tonal languages and phonologically impaired populations. To address these gaps, the present study administered an auditory order recall task to native Cantonese speakers with and without amusia. Stimuli intrinsic (segmental similarity, suprasegmental similarity, and lexicality) and extrinsic (cognitive load) factors were manipulated.

Categories:
1 Views

In conventional codebook-driven speech enhancement, only spectral envelopes of speech and noise are considered, and at the same time, the type of noise is the priori information when we enhance the noisy speech. In this paper, we propose a novel codebook-based speech enhancement method which exploits a priori information about binaural cues, including clean cue and pre-enhanced cue, stored in the trained codebook. This method includes two main parts: offline training of cues and online enhancement by means of cues.

Categories:
10 Views

The main object of this study is voice quality after total thyroidectomy (which involves complete removal of the thyroid gland) or isthmolobectomie (which involves removal of the half, right or left, portions of the gland). This often causes degradation of voice quality permanently or temporarily. Voice quality will be studied using aerodynamic cues. From an aerodynamic point of view, oral airflow (Oaf) and maximum phonation time (TMP) were observed.

Categories:
3 Views

Pages