Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

Spatial audio feature discovery with convolutional neural networks

Abstract: 

The advent of mixed reality consumer products brings about a pressing need to develop and improve spatial sound rendering techniques for a broad user base. Despite a large body of prior work, the precise nature and importance of various sound localization cues and how they should be personalized for an individual user to improve localization performance is still an open research problem. Here we propose training a convolutional neural network (CNN) to classify the elevation angle of spatially rendered sounds and employing Layerwise Relevance Propagation (LRP) on the trained CNN model. LRP provides saliency maps that can be used to identify spectral features used by the network for classification. These maps, in addition to the convolution filters learned by the CNN, are discussed in the context of listening tests reported in the literature. The proposed approach could potentially provide an avenue for future studies on modeling and personalization of head-related transfer functions (HRTFs).

up
0 users have voted:

Paper Details

Authors:
Etienne Thuillier, Hannes Gamper, Ivan J. Tashev
Submitted On:
30 May 2018 - 7:50am
Short Link:
Type:
Presentation Slides
Event:
Presenter's Name:
Etienne Thuillier
Paper Code:
ICASSP18001
Document Year:
2018
Cite

Document Files

Spatial_audio_feature_discovery_ICASSP_2018.pdf

(141)

Subscribe

[1] Etienne Thuillier, Hannes Gamper, Ivan J. Tashev, "Spatial audio feature discovery with convolutional neural networks", IEEE SigPort, 2018. [Online]. Available: http://sigport.org/2975. Accessed: Jun. 16, 2019.
@article{2975-18,
url = {http://sigport.org/2975},
author = {Etienne Thuillier; Hannes Gamper; Ivan J. Tashev },
publisher = {IEEE SigPort},
title = {Spatial audio feature discovery with convolutional neural networks},
year = {2018} }
TY - EJOUR
T1 - Spatial audio feature discovery with convolutional neural networks
AU - Etienne Thuillier; Hannes Gamper; Ivan J. Tashev
PY - 2018
PB - IEEE SigPort
UR - http://sigport.org/2975
ER -
Etienne Thuillier, Hannes Gamper, Ivan J. Tashev. (2018). Spatial audio feature discovery with convolutional neural networks. IEEE SigPort. http://sigport.org/2975
Etienne Thuillier, Hannes Gamper, Ivan J. Tashev, 2018. Spatial audio feature discovery with convolutional neural networks. Available at: http://sigport.org/2975.
Etienne Thuillier, Hannes Gamper, Ivan J. Tashev. (2018). "Spatial audio feature discovery with convolutional neural networks." Web.
1. Etienne Thuillier, Hannes Gamper, Ivan J. Tashev. Spatial audio feature discovery with convolutional neural networks [Internet]. IEEE SigPort; 2018. Available from : http://sigport.org/2975