Sorry, you need to enable JavaScript to visit this website.

Automatic Assessment of the Degree of Clinical Depression from Speech Using X-Vectors

Citation Author(s):
José Vicente Egas-López, Gábor Kiss, Sztahó David, Gábor Gosztolya,
Submitted by:
Jose Egas-Lopez
Last updated:
5 May 2022 - 6:00am
Document Type:
Research Manuscript
Document Year:
2022
Event:
Presenters:
Jose Egas-Lopez
Paper Code:
SPE-86.5
 

Depression is a frequent and curable psychiatric disorder, detrimentally affecting daily activities, harming both work-place productivity and personal relationships. Among many other symptoms, depression is associated with disordered
speech production, which might permit its automatic screening by means of the speech of the subject. However, the choice of actual features extracted from the recordings is not trivial. In this study, we employ x-vectors, a DNN-based
feature extractor technique, to detect depression from a Hungarian corpus. We experiment with training custom x-vector extractors, and we also explore the performance of an out-of-domain pre-trained one. Our findings confirm that x-vectors are able to capture meaningful speaker traits that contain information for depression discrimination. We also show that the language of the extractor is of secondary importance compared to the frame-level feature set: our best model, which achieved an AUC score of 0.940 and an RMSE score of 9.54, was trained on log-energies instead of MFCCs.

up
0 users have voted: