Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

Hierarchy-aware Loss Function on a Tree Structured Label Space for Audio Event Detection

Abstract: 

The paper introduces a hierarchy-aware loss function in a Deep Neural Network for an audio event detection task that has a bi-level tree structured label space. The goal is not only to improve audio event detection performance at all levels in the label hierarchy, but also to produce better audio embeddings. We exploit the label tree structure to preserve that information in the hierarchy-aware loss function. Two different loss functions are separately employed. First, a triplet loss with probabilistic multi-level batch mining is introduced. Second, a quadruplet learning method is applied, which is a special case of generalized triplet learning for bi-level label taxonomy. The training is performed in a multi-task learning framework by jointly optimizing cross entropy based loss and hierarchy-aware loss function. The proposed method is found to outperform the baseline cross entropy based models at both levels of the hierarchy. The multi-task model is also able to learn better audio representations as observed in our clustering experiments. Moreover, the model is shown to transfer well when an out-of-domain dataset is used for evaluation.
https://ieeexplore.ieee.org/document/8682341

up
0 users have voted:

Paper Details

Authors:
Arindam Jati, Naveen Kumar, Ruxin Chen, Panayiotis Georgiou
Submitted On:
10 May 2019 - 1:02am
Short Link:
Type:
Presentation Slides
Event:
Presenter's Name:
Jaekwon Yoo
Paper Code:
2594
Document Year:
2019
Cite

Document Files

icassp_arindam_hierarchy.pptx

(88)

Subscribe

[1] Arindam Jati, Naveen Kumar, Ruxin Chen, Panayiotis Georgiou, "Hierarchy-aware Loss Function on a Tree Structured Label Space for Audio Event Detection", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4060. Accessed: Jul. 04, 2020.
@article{4060-19,
url = {http://sigport.org/4060},
author = {Arindam Jati; Naveen Kumar; Ruxin Chen; Panayiotis Georgiou },
publisher = {IEEE SigPort},
title = {Hierarchy-aware Loss Function on a Tree Structured Label Space for Audio Event Detection},
year = {2019} }
TY - EJOUR
T1 - Hierarchy-aware Loss Function on a Tree Structured Label Space for Audio Event Detection
AU - Arindam Jati; Naveen Kumar; Ruxin Chen; Panayiotis Georgiou
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4060
ER -
Arindam Jati, Naveen Kumar, Ruxin Chen, Panayiotis Georgiou. (2019). Hierarchy-aware Loss Function on a Tree Structured Label Space for Audio Event Detection. IEEE SigPort. http://sigport.org/4060
Arindam Jati, Naveen Kumar, Ruxin Chen, Panayiotis Georgiou, 2019. Hierarchy-aware Loss Function on a Tree Structured Label Space for Audio Event Detection. Available at: http://sigport.org/4060.
Arindam Jati, Naveen Kumar, Ruxin Chen, Panayiotis Georgiou. (2019). "Hierarchy-aware Loss Function on a Tree Structured Label Space for Audio Event Detection." Web.
1. Arindam Jati, Naveen Kumar, Ruxin Chen, Panayiotis Georgiou. Hierarchy-aware Loss Function on a Tree Structured Label Space for Audio Event Detection [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4060