Sorry, you need to enable JavaScript to visit this website.

Speech Landmark Bigrams for Depression Detection from Naturalistic Smartphone Speech

Citation Author(s):
Zhaocheng Huang, Julien Epps, Dale Joachim
Submitted by:
Zhaocheng Huang
Last updated:
6 June 2019 - 4:42am
Document Type:
Presentation Slides
Document Year:
Zhaocheng Huang
Paper Code:


Detection of depression from speech has attracted significant research attention in recent years but remains a challenge, particularly for speech from diverse smartphones in natural environments. This paper proposes two sets of novel features based on speech landmark bigrams associated with abrupt speech articulatory events for depression detection from smartphone audio recordings. Combined with techniques adapted from natural language text processing, the proposed features further exploit landmark bigrams by discovering latent articulatory events. Experimental results on a large, naturalistic corpus containing various spoken tasks recorded from diverse smartphones suggest that speech landmark bigram features provide a 30.1% relative improvement in F1 (depressed) relative to an acoustic feature baseline system. As might be expected, a key finding was the importance of tailoring the choice of landmark bigrams to each elicitation task, revealing that different aspects of speech articulation are elicited by different tasks, which can be effectively captured by the landmark approaches.

1 user has voted: Zhaocheng Huang