Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

Learning Multiple Sound Source 2D Localization

Abstract: 

In this paper, we propose novel deep learning based algorithms for multiple sound source localization. Specifically, we aim to find the 2D Cartesian coordinates of multiple sound sources in an enclosed environment by using multiple microphone arrays. To this end, we use an encoding-decoding architecture and propose two improvements on it to accomplish the task. In addition, we also propose two novel localization representations which increase the accuracy. Lastly, new metrics are developed relying on resolution-based multiple source association which enables us to evaluate and compare different localization approaches. We tested our method on both synthetic and real world data. The results show that our method improves upon the previous baseline approach for this problem.

up
0 users have voted:

Paper Details

Authors:
Guillaume Le Moing, Phongtharin Vinayavekhin, Tadanobu Inoue, Jayakorn Vongkulbhisal, Asim Munawar, Ryuki Tachibana, Don Joven Agravante
Submitted On:
29 September 2019 - 4:58am
Short Link:
Type:
Presentation Slides
Event:
Presenter's Name:
Phongtharin Vinayavekhin
Paper Code:
MMSP-105
Document Year:
2019
Cite

Document Files

2019-mmsp-presentation_v6_sigport.pdf

(36)

Subscribe

[1] Guillaume Le Moing, Phongtharin Vinayavekhin, Tadanobu Inoue, Jayakorn Vongkulbhisal, Asim Munawar, Ryuki Tachibana, Don Joven Agravante, "Learning Multiple Sound Source 2D Localization", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4835. Accessed: Dec. 13, 2019.
@article{4835-19,
url = {http://sigport.org/4835},
author = {Guillaume Le Moing; Phongtharin Vinayavekhin; Tadanobu Inoue; Jayakorn Vongkulbhisal; Asim Munawar; Ryuki Tachibana; Don Joven Agravante },
publisher = {IEEE SigPort},
title = {Learning Multiple Sound Source 2D Localization},
year = {2019} }
TY - EJOUR
T1 - Learning Multiple Sound Source 2D Localization
AU - Guillaume Le Moing; Phongtharin Vinayavekhin; Tadanobu Inoue; Jayakorn Vongkulbhisal; Asim Munawar; Ryuki Tachibana; Don Joven Agravante
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4835
ER -
Guillaume Le Moing, Phongtharin Vinayavekhin, Tadanobu Inoue, Jayakorn Vongkulbhisal, Asim Munawar, Ryuki Tachibana, Don Joven Agravante. (2019). Learning Multiple Sound Source 2D Localization. IEEE SigPort. http://sigport.org/4835
Guillaume Le Moing, Phongtharin Vinayavekhin, Tadanobu Inoue, Jayakorn Vongkulbhisal, Asim Munawar, Ryuki Tachibana, Don Joven Agravante, 2019. Learning Multiple Sound Source 2D Localization. Available at: http://sigport.org/4835.
Guillaume Le Moing, Phongtharin Vinayavekhin, Tadanobu Inoue, Jayakorn Vongkulbhisal, Asim Munawar, Ryuki Tachibana, Don Joven Agravante. (2019). "Learning Multiple Sound Source 2D Localization." Web.
1. Guillaume Le Moing, Phongtharin Vinayavekhin, Tadanobu Inoue, Jayakorn Vongkulbhisal, Asim Munawar, Ryuki Tachibana, Don Joven Agravante. Learning Multiple Sound Source 2D Localization [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4835