Sorry, you need to enable JavaScript to visit this website.

ICASSP 2019 Paper #4001: INCREASE APPARENT PUBLIC SPEAKING FLUENCY BY SPEECH AUGMENTATION

Citation Author(s):
Nisha Gandhi, Tejas Naik, Roy Shilkrot
Submitted by:
Sagnik Das
Last updated:
12 May 2019 - 12:38pm
Document Type:
Poster
Document Year:
2019
Event:
Presenters:
SAGNIK DAS
Paper Code:
4001
 

Fluent and confident speech is desirable to every speaker. But professional speech delivering requires a great deal of experience and practice. In this paper, we propose a speech stream manipulation system which can help non-professional speakers to produce fluent, professional-like speech content, in turn contributing towards better listener engagement and comprehension. We propose to achieve this task by manipulating the disfluencies in human speech, like the sounds uh and um, the filler words and awkward long silences. Given any unrehearsed speech we segment and silence the filled pauses and doctor the duration of imposed silence as well as other long pauses (disfluent) by a predictive model learned using professional speech dataset. Finally, we output a audio stream in which speaker sounds more fluent, confident and practiced compared to the original recorded speech. According to our quantitative evaluation, we significantly increase the fluency of speech by reducing rate of pauses and fillers.

up
0 users have voted: