Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

Automatic identification of speakers from head gestures in a narration

Abstract: 

In this work, we focus on quantifying speaker identity information encoded in the head gestures of speakers, while they narrate a story. We hypothesize that the head gestures over a long duration have speaker-specific patterns. To establish this, we consider a classification problem to identify speakers from head gestures. We represent every head orientation as a triplet of Euler angles and a sequence of head orientations as head gestures. We use a database having recordings from 24 speakers where the head movements are recorded using a motion capture device, with each subject narrating ten stories. We get the best speaker identification accuracy of 0.836 using head gestures over a duration of 40 seconds. Further, the accuracy increases by combining decisions from multiple 40 second windows when a recording is available with duration more than the window length. We achieve an average accuracy of 0.9875 on our database when the entire recording is used. Analysis of the speaker identification performance over 40 second windows across a recording reveals that the speaker-identity information is more prevalent in some parts of a story than others.

up
0 users have voted:

Paper Details

Authors:
Sanjeev Kadagathur Vadiraj, Achuth Rao M V, Prasanta Kumar Ghosh
Submitted On:
16 May 2020 - 10:52am
Short Link:
Type:
Presentation Slides
Event:
Presenter's Name:
Sanjeev Kadagathur Vadiraj
Document Year:
2020
Cite

Document Files

ICASSP_2020_Presentation.pdf

(40)

Subscribe

[1] Sanjeev Kadagathur Vadiraj, Achuth Rao M V, Prasanta Kumar Ghosh, "Automatic identification of speakers from head gestures in a narration", IEEE SigPort, 2020. [Online]. Available: http://sigport.org/5375. Accessed: Sep. 26, 2020.
@article{5375-20,
url = {http://sigport.org/5375},
author = {Sanjeev Kadagathur Vadiraj; Achuth Rao M V; Prasanta Kumar Ghosh },
publisher = {IEEE SigPort},
title = {Automatic identification of speakers from head gestures in a narration},
year = {2020} }
TY - EJOUR
T1 - Automatic identification of speakers from head gestures in a narration
AU - Sanjeev Kadagathur Vadiraj; Achuth Rao M V; Prasanta Kumar Ghosh
PY - 2020
PB - IEEE SigPort
UR - http://sigport.org/5375
ER -
Sanjeev Kadagathur Vadiraj, Achuth Rao M V, Prasanta Kumar Ghosh. (2020). Automatic identification of speakers from head gestures in a narration. IEEE SigPort. http://sigport.org/5375
Sanjeev Kadagathur Vadiraj, Achuth Rao M V, Prasanta Kumar Ghosh, 2020. Automatic identification of speakers from head gestures in a narration. Available at: http://sigport.org/5375.
Sanjeev Kadagathur Vadiraj, Achuth Rao M V, Prasanta Kumar Ghosh. (2020). "Automatic identification of speakers from head gestures in a narration." Web.
1. Sanjeev Kadagathur Vadiraj, Achuth Rao M V, Prasanta Kumar Ghosh. Automatic identification of speakers from head gestures in a narration [Internet]. IEEE SigPort; 2020. Available from : http://sigport.org/5375