Sorry, you need to enable JavaScript to visit this website.

Audio for Multimedia

A First Attempt at Polyphonic Sound Event Detection Using Connectionist Temporal Classification


Sound event detection is the task of detecting the type, starting time, and ending time of sound events in audio streams. Recently, recurrent neural networks (RNNs) have become the mainstream solution for sound event detection. Because RNNs make a prediction at every frame, it is necessary to provide exact starting and ending times of the sound events in the training data, making data annotation an extremely time-consuming process.

Paper Details

Authors:
Florian Metze
Submitted On:
27 February 2017 - 5:12pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

2017.03 Poster for ICASSP.pdf

(74 downloads)

Keywords

Subscribe

[1] Florian Metze, "A First Attempt at Polyphonic Sound Event Detection Using Connectionist Temporal Classification", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1451. Accessed: Oct. 19, 2017.
@article{1451-17,
url = {http://sigport.org/1451},
author = {Florian Metze },
publisher = {IEEE SigPort},
title = {A First Attempt at Polyphonic Sound Event Detection Using Connectionist Temporal Classification},
year = {2017} }
TY - EJOUR
T1 - A First Attempt at Polyphonic Sound Event Detection Using Connectionist Temporal Classification
AU - Florian Metze
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1451
ER -
Florian Metze. (2017). A First Attempt at Polyphonic Sound Event Detection Using Connectionist Temporal Classification. IEEE SigPort. http://sigport.org/1451
Florian Metze, 2017. A First Attempt at Polyphonic Sound Event Detection Using Connectionist Temporal Classification. Available at: http://sigport.org/1451.
Florian Metze. (2017). "A First Attempt at Polyphonic Sound Event Detection Using Connectionist Temporal Classification." Web.
1. Florian Metze. A First Attempt at Polyphonic Sound Event Detection Using Connectionist Temporal Classification [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1451

Natural Sound Rendering for Headphones: Integration of signal processing techniques


With the strong growth of assistive and personal listening devices, natural sound rendering over headphones is becoming a necessity for prolonged listening in multimedia and virtual reality applications. The aim of natural sound rendering is to naturally recreate the sound scenes with the spatial and timbral quality as natural as possible, so as to achieve a truly immersive listening experience. However, rendering natural sound over headphones encounters many challenges. This tutorial article presents signal processing techniques to tackle these challenges to assist human listening.

Paper Details

Authors:
Kaushik Sunder, Ee-Leng Tan
Submitted On:
23 February 2016 - 1:44pm
Short Link:
Type:

Document Files

SPM2015manuscript-Natural Sound Rendering for Headphones.pdf

(382 downloads)

Keywords

Subscribe

[1] Kaushik Sunder, Ee-Leng Tan, "Natural Sound Rendering for Headphones: Integration of signal processing techniques", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/166. Accessed: Oct. 19, 2017.
@article{166-15,
url = {http://sigport.org/166},
author = {Kaushik Sunder; Ee-Leng Tan },
publisher = {IEEE SigPort},
title = {Natural Sound Rendering for Headphones: Integration of signal processing techniques},
year = {2015} }
TY - EJOUR
T1 - Natural Sound Rendering for Headphones: Integration of signal processing techniques
AU - Kaushik Sunder; Ee-Leng Tan
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/166
ER -
Kaushik Sunder, Ee-Leng Tan. (2015). Natural Sound Rendering for Headphones: Integration of signal processing techniques. IEEE SigPort. http://sigport.org/166
Kaushik Sunder, Ee-Leng Tan, 2015. Natural Sound Rendering for Headphones: Integration of signal processing techniques. Available at: http://sigport.org/166.
Kaushik Sunder, Ee-Leng Tan. (2015). "Natural Sound Rendering for Headphones: Integration of signal processing techniques." Web.
1. Kaushik Sunder, Ee-Leng Tan. Natural Sound Rendering for Headphones: Integration of signal processing techniques [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/166