Audio and Acoustic Signal Processing

Detection of Mood Disorder Using Speech Emotion Profiles and LSTM

Read more about Detection of Mood Disorder Using Speech Emotion Profiles and LSTM
Log in to post comments

In mood disorder diagnosis, bipolar disorder (BD) patients are often misdiagnosed as unipolar depression (UD) on initial presentation. It is crucial to establish an accurate distinction between BD and UD to make a correct and early diagnosis, leading to improvements in treatment and course of illness. To deal with this misdiagnosis problem, in this study, we experimented on eliciting subjects’ emotions by watching six eliciting emotional video clips. After watching each video clips, their speech responses were collected when they were interviewing with a clinician.

ISCSLP-2016-1014-1.pdf

ISCSLP-2016-1014-1.pdf (867)

Categories:: Audio and Acoustic Signal Processing

73 Views

The Correlation Between Signal Distance and Consonant Pronunciation in Mandarin Words

Read more about The Correlation Between Signal Distance and Consonant Pronunciation in Mandarin Words
Log in to post comments

In Mandarin language speaking, some consonant and vowel pairs are hard to be distinguished and pronounced clearly even for some native speakers. This study investigates the signal distance between consonants compared in pairs from the signal processing point of view to reveal the correlation of signal distance and consonant pronunciation. Some popular speech quality objective measures are innovatively applied to obtain the signal distance.

ISCSLP Poster_Correlation Between Signal Distance.pdf

ISCSLP Poster_Correlation Between Signal Distance.pdf (78)

Categories:: Audio and Acoustic Signal Processing

8 Views

Pronunciation Error Detection using DNN Articulatory Model based on Multi-lingual and Multi-task Learning

poster-v2.pdf

poster-v2.pdf (305)

Categories:: Audio and Acoustic Signal Processing

7 Views

Spatial Co-variation of Lip and Tongue at Strong and Weak Syllables

Read more about Spatial Co-variation of Lip and Tongue at Strong and Weak Syllables
Log in to post comments

Speech production requires control for coordination among different articulatory organs. During the natural speech, the articulatory co-variation is more common rather than compensation, but the studies supporting this view are few. In this study, the coordination of lip and tongue articulation was examined during speech using articulatory data. Native speakers of Chinese served as subjects. Speech materials consisted of short Chinese sentences, which include words having the cardinal vowels at different locations in sentences with and without emphasis.

ZJ_ISCSLP2016_kh.pdf

ISCSLP2016_POSTER (707)

Categories:: Audio and Acoustic Signal Processing

14 Views

The Examination of the Relationship between Perception and Production of Mandarin tone of Kazak Students

This study aims at examination on the relationship between the
perception and production of Mandarin tone by Kazak minor
learners from China. The eight-day perceptual training course
of Mandarin tone is designed. Perception is assessed by means
of identification test. Production data is collected both at
pretest and post-test, and evaluated by native speakers of
Mandarin Chinese. The results from the perception at pretest
and post-test reveal that training Kazak learners to perceive
Mandarin tones has been shown to be effective, with

ISCSLP168.pdf

iscslp168 (699)

Categories:: Audio and Acoustic Signal Processing

65 Views

Study on the Relation of Fundamental and Formant Frequencies for Affective Speech Synthesis

Directions into Velocities of Articulators (DIVA) model is a kind of self-adaptive neural network model which controls movements of a simulated vocal tract to produce words, syllables or phonemes. However, DIVA model lacks of emotion functions. To implement the emotion function in DIVA model, we investigate the process of affective speech production based on the combination of fundamental frequency (F0) and formant frequencies, as well as the relations between F0 and formants of emotional speech.

ISCSLP_POSTER_20161010.pdf

poster (319)

Categories:: Audio and Acoustic Signal Processing

13 Views

A post-thyroidectomy voice quality study in patients suffering or not from Recurrent Laryngeal paralysis

The main object of this study is voice quality after total thyroidectomy (which involves complete removal of the thyroid gland) or isthmolobectomie (which involves removal of the half, right or left, portions of the gland). This often causes degradation of voice quality permanently or temporarily. Voice quality will be studied using aerodynamic cues. From an aerodynamic point of view, oral airflow (Oaf) and maximum phonation time (TMP) were observed.

Für Tianjin 2016.pdf

Für Tianjin 2016.pdf (70)

Categories:: Audio and Acoustic Signal Processing

7 Views

IEEE SP Cup 2016 Project Report by Team "10Hertz": Exploring Power Signals for Location Forensics of Media Recordings

Electric Network Frequency is the frequency of power distribution networks in power grids that fluctuates about a nominal value with respect to the changing loads.Its ubiquitous nature has made notable contributions to forensic analysis that has substantiated its use as a significant tool in this area. In this paper we have proposed a technique to identify the power grid in which the ENF containing signal was recorded, without the assistance of concurrent power references.

10Hertz_Paper.pdf

10Hertz_Paper.pdf (1284)

Categories:: Audio and Acoustic Signal Processing
Signal Processing Theory and Methods

126 Views

A HIGH PERFORMANCE BASEBAND INSTRUMENT

Read more about A HIGH PERFORMANCE BASEBAND INSTRUMENT
Log in to post comments

Testing complex digital signal processors (DSPs) requires a development platform with sufficient
signal bandwidth and system performance to fully exercise the DSP. Without a development plat-
form, verification of DSPs would be limited to monitoring test output signals for an indication of
performance and successful operation. In addition, a development platform with high-speed analog
input and output interfaces to the DSP system allows it to be used directly in many sophisticated