- Transducers
- Spatial and Multichannel Audio
- Source Separation and Signal Enhancement
- Room Acoustics and Acoustic System Modeling
- Network Audio
- Audio for Multimedia
- Audio Processing Systems
- Audio Coding
- Audio Analysis and Synthesis
- Active Noise Control
- Auditory Modeling and Hearing Aids
- Bioacoustics and Medical Acoustics
- Music Signal Processing
- Loudspeaker and Microphone Array Signal Processing
- Echo Cancellation
- Content-Based Audio Processing
- Read more about Detection of Mood Disorder Using Speech Emotion Profiles and LSTM
- Log in to post comments
In mood disorder diagnosis, bipolar disorder (BD) patients are often misdiagnosed as unipolar depression (UD) on initial presentation. It is crucial to establish an accurate distinction between BD and UD to make a correct and early diagnosis, leading to improvements in treatment and course of illness. To deal with this misdiagnosis problem, in this study, we experimented on eliciting subjects’ emotions by watching six eliciting emotional video clips. After watching each video clips, their speech responses were collected when they were interviewing with a clinician.
- Categories:
- Read more about The Correlation Between Signal Distance and Consonant Pronunciation in Mandarin Words
- Log in to post comments
In Mandarin language speaking, some consonant and vowel pairs are hard to be distinguished and pronounced clearly even for some native speakers. This study investigates the signal distance between consonants compared in pairs from the signal processing point of view to reveal the correlation of signal distance and consonant pronunciation. Some popular speech quality objective measures are innovatively applied to obtain the signal distance.
- Categories:
- Read more about Pronunciation Error Detection using DNN Articulatory Model based on Multi-lingual and Multi-task Learning
- Log in to post comments
poster-v2.pdf
- Categories:
- Read more about Spatial Co-variation of Lip and Tongue at Strong and Weak Syllables
- Log in to post comments
Speech production requires control for coordination among different articulatory organs. During the natural speech, the articulatory co-variation is more common rather than compensation, but the studies supporting this view are few. In this study, the coordination of lip and tongue articulation was examined during speech using articulatory data. Native speakers of Chinese served as subjects. Speech materials consisted of short Chinese sentences, which include words having the cardinal vowels at different locations in sentences with and without emphasis.
- Categories:
- Read more about The Examination of the Relationship between Perception and Production of Mandarin tone of Kazak Students
- Log in to post comments
This study aims at examination on the relationship between the
perception and production of Mandarin tone by Kazak minor
learners from China. The eight-day perceptual training course
of Mandarin tone is designed. Perception is assessed by means
of identification test. Production data is collected both at
pretest and post-test, and evaluated by native speakers of
Mandarin Chinese. The results from the perception at pretest
and post-test reveal that training Kazak learners to perceive
Mandarin tones has been shown to be effective, with
ISCSLP168.pdf
- Categories:
- Read more about Study on the Relation of Fundamental and Formant Frequencies for Affective Speech Synthesis
- Log in to post comments
Directions into Velocities of Articulators (DIVA) model is a kind of self-adaptive neural network model which controls movements of a simulated vocal tract to produce words, syllables or phonemes. However, DIVA model lacks of emotion functions. To implement the emotion function in DIVA model, we investigate the process of affective speech production based on the combination of fundamental frequency (F0) and formant frequencies, as well as the relations between F0 and formants of emotional speech.
- Categories:
- Read more about A post-thyroidectomy voice quality study in patients suffering or not from Recurrent Laryngeal paralysis
- Log in to post comments
The main object of this study is voice quality after total thyroidectomy (which involves complete removal of the thyroid gland) or isthmolobectomie (which involves removal of the half, right or left, portions of the gland). This often causes degradation of voice quality permanently or temporarily. Voice quality will be studied using aerodynamic cues. From an aerodynamic point of view, oral airflow (Oaf) and maximum phonation time (TMP) were observed.
- Categories:
- Read more about IEEE SP Cup 2016 Project Report by Team "10Hertz": Exploring Power Signals for Location Forensics of Media Recordings
- Log in to post comments
Electric Network Frequency is the frequency of power distribution networks in power grids that fluctuates about a nominal value with respect to the changing loads.Its ubiquitous nature has made notable contributions to forensic analysis that has substantiated its use as a significant tool in this area. In this paper we have proposed a technique to identify the power grid in which the ENF containing signal was recorded, without the assistance of concurrent power references.
- Categories:
- Read more about A HIGH PERFORMANCE BASEBAND INSTRUMENT
- Log in to post comments
Testing complex digital signal processors (DSPs) requires a development platform with sufficient
signal bandwidth and system performance to fully exercise the DSP. Without a development plat-
form, verification of DSPs would be limited to monitoring test output signals for an indication of
performance and successful operation. In addition, a development platform with high-speed analog
input and output interfaces to the DSP system allows it to be used directly in many sophisticated
- Categories: