Sorry, you need to enable JavaScript to visit this website.

Speech Processing

Speaker Diarization System for Autism Children’s Real-Life Audio Data

Paper Details

Authors:
Submitted On:
15 October 2016 - 12:39pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

167.pdf

(417)

Subscribe

[1] , "Speaker Diarization System for Autism Children’s Real-Life Audio Data", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1251. Accessed: Sep. 17, 2019.
@article{1251-16,
url = {http://sigport.org/1251},
author = { },
publisher = {IEEE SigPort},
title = {Speaker Diarization System for Autism Children’s Real-Life Audio Data},
year = {2016} }
TY - EJOUR
T1 - Speaker Diarization System for Autism Children’s Real-Life Audio Data
AU -
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1251
ER -
. (2016). Speaker Diarization System for Autism Children’s Real-Life Audio Data. IEEE SigPort. http://sigport.org/1251
, 2016. Speaker Diarization System for Autism Children’s Real-Life Audio Data. Available at: http://sigport.org/1251.
. (2016). "Speaker Diarization System for Autism Children’s Real-Life Audio Data." Web.
1. . Speaker Diarization System for Autism Children’s Real-Life Audio Data [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1251

Perceptual Evaluation of Natural and Synthesized Speech with Prosodic Focus in Mandarin Production of American Learners


Natural and synthesized speech in L2 Mandarin produced by American English learners was evaluated by native Mandarin speakers to identify focus status and rate the naturalness of the speech. The results reveal that natural speech was recognized and rated better than synthesized speech, early learners’ speech better than late learners’ speech, focused sentences better than no-focus sentences, and initial focus and medial focus better than final focus. Tones of in-focus words interacted with focus status of the sentence and speaker group.

Paper Details

Authors:
Ying Chen, Li Liu, Xueqin Zhao
Submitted On:
14 October 2016 - 1:50pm
Short Link:
Type:
Event:

Document Files

ChenEtAl._ISCSLP2016_poster.pdf

(394)

Subscribe

[1] Ying Chen, Li Liu, Xueqin Zhao, "Perceptual Evaluation of Natural and Synthesized Speech with Prosodic Focus in Mandarin Production of American Learners", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1211. Accessed: Sep. 17, 2019.
@article{1211-16,
url = {http://sigport.org/1211},
author = {Ying Chen; Li Liu; Xueqin Zhao },
publisher = {IEEE SigPort},
title = {Perceptual Evaluation of Natural and Synthesized Speech with Prosodic Focus in Mandarin Production of American Learners},
year = {2016} }
TY - EJOUR
T1 - Perceptual Evaluation of Natural and Synthesized Speech with Prosodic Focus in Mandarin Production of American Learners
AU - Ying Chen; Li Liu; Xueqin Zhao
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1211
ER -
Ying Chen, Li Liu, Xueqin Zhao. (2016). Perceptual Evaluation of Natural and Synthesized Speech with Prosodic Focus in Mandarin Production of American Learners. IEEE SigPort. http://sigport.org/1211
Ying Chen, Li Liu, Xueqin Zhao, 2016. Perceptual Evaluation of Natural and Synthesized Speech with Prosodic Focus in Mandarin Production of American Learners. Available at: http://sigport.org/1211.
Ying Chen, Li Liu, Xueqin Zhao. (2016). "Perceptual Evaluation of Natural and Synthesized Speech with Prosodic Focus in Mandarin Production of American Learners." Web.
1. Ying Chen, Li Liu, Xueqin Zhao. Perceptual Evaluation of Natural and Synthesized Speech with Prosodic Focus in Mandarin Production of American Learners [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1211

Towards Automatic Assessment of Aphasia Speech Using Automatic Speech Recognition Techniques


Aphasia is a type of acquired language impairment caused by brain injury. This paper presents an automatic speech recog- nition (ASR) based approach to objective assessment of apha- sia patients. A dedicated ASR system is developed to facilitate acoustical and linguistic analysis of Cantonese aphasia speech. The acoustic models and the language models are trained with domain- and style-matched speech data from unimpaired con- trol speakers. The speech recognition performance of this sys- tem is evaluated on natural oral discourses from patients with various types of aphasia.

Paper Details

Authors:
Ying Qin, Tan Lee, Anthony Pak Hin Kong, Sam Po Law
Submitted On:
14 October 2016 - 5:51am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

conference-10.18.pdf

(441)

Subscribe

[1] Ying Qin, Tan Lee, Anthony Pak Hin Kong, Sam Po Law, "Towards Automatic Assessment of Aphasia Speech Using Automatic Speech Recognition Techniques", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1187. Accessed: Sep. 17, 2019.
@article{1187-16,
url = {http://sigport.org/1187},
author = {Ying Qin; Tan Lee; Anthony Pak Hin Kong; Sam Po Law },
publisher = {IEEE SigPort},
title = {Towards Automatic Assessment of Aphasia Speech Using Automatic Speech Recognition Techniques},
year = {2016} }
TY - EJOUR
T1 - Towards Automatic Assessment of Aphasia Speech Using Automatic Speech Recognition Techniques
AU - Ying Qin; Tan Lee; Anthony Pak Hin Kong; Sam Po Law
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1187
ER -
Ying Qin, Tan Lee, Anthony Pak Hin Kong, Sam Po Law. (2016). Towards Automatic Assessment of Aphasia Speech Using Automatic Speech Recognition Techniques. IEEE SigPort. http://sigport.org/1187
Ying Qin, Tan Lee, Anthony Pak Hin Kong, Sam Po Law, 2016. Towards Automatic Assessment of Aphasia Speech Using Automatic Speech Recognition Techniques. Available at: http://sigport.org/1187.
Ying Qin, Tan Lee, Anthony Pak Hin Kong, Sam Po Law. (2016). "Towards Automatic Assessment of Aphasia Speech Using Automatic Speech Recognition Techniques." Web.
1. Ying Qin, Tan Lee, Anthony Pak Hin Kong, Sam Po Law. Towards Automatic Assessment of Aphasia Speech Using Automatic Speech Recognition Techniques [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1187

Poster for Nonstationary Blind Super-resolution

Paper Details

Authors:
Dehui Yang, Gongguo Tang, Michael Wakin
Submitted On:
30 March 2016 - 3:34am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

ICASSP_poster_with_reference.pdf

(505)

Subscribe

[1] Dehui Yang, Gongguo Tang, Michael Wakin, "Poster for Nonstationary Blind Super-resolution", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1070. Accessed: Sep. 17, 2019.
@article{1070-16,
url = {http://sigport.org/1070},
author = {Dehui Yang; Gongguo Tang; Michael Wakin },
publisher = {IEEE SigPort},
title = {Poster for Nonstationary Blind Super-resolution},
year = {2016} }
TY - EJOUR
T1 - Poster for Nonstationary Blind Super-resolution
AU - Dehui Yang; Gongguo Tang; Michael Wakin
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1070
ER -
Dehui Yang, Gongguo Tang, Michael Wakin. (2016). Poster for Nonstationary Blind Super-resolution. IEEE SigPort. http://sigport.org/1070
Dehui Yang, Gongguo Tang, Michael Wakin, 2016. Poster for Nonstationary Blind Super-resolution. Available at: http://sigport.org/1070.
Dehui Yang, Gongguo Tang, Michael Wakin. (2016). "Poster for Nonstationary Blind Super-resolution." Web.
1. Dehui Yang, Gongguo Tang, Michael Wakin. Poster for Nonstationary Blind Super-resolution [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1070

Template based techniques for automatic segmentation of TTS unit database

Paper Details

Authors:
S. Adithya, Sunil Rao, C. Mahima, S. Vishnu
Submitted On:
24 March 2016 - 11:32pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

Template Based Techniques For Automatic Segmentation Of TTS Unit Database.pdf

(97)

Subscribe

[1] S. Adithya, Sunil Rao, C. Mahima, S. Vishnu, "Template based techniques for automatic segmentation of TTS unit database", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1041. Accessed: Sep. 17, 2019.
@article{1041-16,
url = {http://sigport.org/1041},
author = {S. Adithya; Sunil Rao; C. Mahima; S. Vishnu },
publisher = {IEEE SigPort},
title = {Template based techniques for automatic segmentation of TTS unit database},
year = {2016} }
TY - EJOUR
T1 - Template based techniques for automatic segmentation of TTS unit database
AU - S. Adithya; Sunil Rao; C. Mahima; S. Vishnu
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1041
ER -
S. Adithya, Sunil Rao, C. Mahima, S. Vishnu. (2016). Template based techniques for automatic segmentation of TTS unit database. IEEE SigPort. http://sigport.org/1041
S. Adithya, Sunil Rao, C. Mahima, S. Vishnu, 2016. Template based techniques for automatic segmentation of TTS unit database. Available at: http://sigport.org/1041.
S. Adithya, Sunil Rao, C. Mahima, S. Vishnu. (2016). "Template based techniques for automatic segmentation of TTS unit database." Web.
1. S. Adithya, Sunil Rao, C. Mahima, S. Vishnu. Template based techniques for automatic segmentation of TTS unit database [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1041

Detecting The Instant of Emotion Change from Speech Using A Martingale Framework


Towards a better understanding of emotion in speech, it is important to understand how emotion changes and when it changes. Recognizing emotions using pre-segmented speech utterances results in a loss in continuity of emotions and does not provide insights into emotion changes. In this paper, we propose an investigation into emotion change detection from the perspective of exchangeability of data points observed sequentially using a martingale framework. Within the framework, a per-frame GMM likelihood based approach is proposed as a measure of strangeness from a particular emotion class.

Paper Details

Authors:
Zhaocheng Huang, Julien Epps
Submitted On:
23 March 2016 - 2:53am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

ICASSP2016_Huang_25_03_2016_Upload.pdf

(443)

Subscribe

[1] Zhaocheng Huang, Julien Epps, "Detecting The Instant of Emotion Change from Speech Using A Martingale Framework", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/983. Accessed: Sep. 17, 2019.
@article{983-16,
url = {http://sigport.org/983},
author = {Zhaocheng Huang; Julien Epps },
publisher = {IEEE SigPort},
title = {Detecting The Instant of Emotion Change from Speech Using A Martingale Framework},
year = {2016} }
TY - EJOUR
T1 - Detecting The Instant of Emotion Change from Speech Using A Martingale Framework
AU - Zhaocheng Huang; Julien Epps
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/983
ER -
Zhaocheng Huang, Julien Epps. (2016). Detecting The Instant of Emotion Change from Speech Using A Martingale Framework. IEEE SigPort. http://sigport.org/983
Zhaocheng Huang, Julien Epps, 2016. Detecting The Instant of Emotion Change from Speech Using A Martingale Framework. Available at: http://sigport.org/983.
Zhaocheng Huang, Julien Epps. (2016). "Detecting The Instant of Emotion Change from Speech Using A Martingale Framework." Web.
1. Zhaocheng Huang, Julien Epps. Detecting The Instant of Emotion Change from Speech Using A Martingale Framework [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/983

Pages