Speech Processing

Towards Automatic Assessment of Aphasia Speech Using Automatic Speech Recognition Techniques

Aphasia is a type of acquired language impairment caused by brain injury. This paper presents an automatic speech recog- nition (ASR) based approach to objective assessment of apha- sia patients. A dedicated ASR system is developed to facilitate acoustical and linguistic analysis of Cantonese aphasia speech. The acoustic models and the language models are trained with domain- and style-matched speech data from unimpaired con- trol speakers. The speech recognition performance of this sys- tem is evaluated on natural oral discourses from patients with various types of aphasia.

conference-10.18.pdf

conference-10.18.pdf (739)

Categories:: Speech Processing

7 Views

Poster for Nonstationary Blind Super-resolution

Read more about Poster for Nonstationary Blind Super-resolution
Log in to post comments

ICASSP_poster_with_reference.pdf

ICASSP_poster_with_reference.pdf (803)

Categories:: Speech Processing

4 Views

Template based techniques for automatic segmentation of TTS unit database

Read more about Template based techniques for automatic segmentation of TTS unit database
Log in to post comments

Template based automatic segmentation of unit-database for TTS into phonetic and syllabic units.

Template Based Techniques For Automatic Segmentation Of TTS Unit Database.pdf

Template Based Techniques For Automatic Segmentation Of TTS Unit Database.pdf (97)

Categories:: Speech Processing

9 Views

Detecting The Instant of Emotion Change from Speech Using A Martingale Framework

Read more about Detecting The Instant of Emotion Change from Speech Using A Martingale Framework
Log in to post comments

Towards a better understanding of emotion in speech, it is important to understand how emotion changes and when it changes. Recognizing emotions using pre-segmented speech utterances results in a loss in continuity of emotions and does not provide insights into emotion changes. In this paper, we propose an investigation into emotion change detection from the perspective of exchangeability of data points observed sequentially using a martingale framework. Within the framework, a per-frame GMM likelihood based approach is proposed as a measure of strangeness from a particular emotion class.

ICASSP2016_Huang_25_03_2016_Upload.pdf

ICASSP2016_Huang_25_03_2016_Upload.pdf (778)

Categories:: Speech Processing

11 Views

Speech Processing

Pages