Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

Learning Environmental Sounds with End-to-end Convolutional Neural Network

Abstract: 

Environmental sound classification (ESC) is usually conducted based on handcrafted features such as the log-mel feature. Meanwhile, end-to-end classification systems perform feature extraction jointly with classification and have achieved success particularly in image classification. In the same manner, if environmental sounds could be directly learned from the raw waveforms, we would be able to extract a new feature effective for classification that could not have been designed by humans, and this new feature could improve the classification performance. In this paper, we propose a novel end-to-end ESC system using a convolutional neural network (CNN). The classification accuracy of our system on ESC-50 is 5.1% higher than that achieved when using logmel-CNN with the static log-mel feature. Moreover, we achieve a 6.5% improvement in classification accuracy over the state-of-the-art logmel-CNN with
the static and delta log-mel feature, simply by combining our system and logmel-CNN.

poster1.pdf

PDF icon Poster (147 downloads)
up
0 users have voted:

Paper Details

Authors:
Yuji Tokozume, Tatsuya Harada
Submitted On:
3 March 2017 - 12:53am
Short Link:
Type:
Poster
Event:
Presenter's Name:
Yuji Tokozume
Paper Code:
MLSP-P8.1
Document Year:
2017
Cite

Document Files

Poster

(147 downloads)

Subscribe

[1] Yuji Tokozume, Tatsuya Harada, "Learning Environmental Sounds with End-to-end Convolutional Neural Network", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1599. Accessed: Sep. 19, 2017.
@article{1599-17,
url = {http://sigport.org/1599},
author = {Yuji Tokozume; Tatsuya Harada },
publisher = {IEEE SigPort},
title = {Learning Environmental Sounds with End-to-end Convolutional Neural Network},
year = {2017} }
TY - EJOUR
T1 - Learning Environmental Sounds with End-to-end Convolutional Neural Network
AU - Yuji Tokozume; Tatsuya Harada
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1599
ER -
Yuji Tokozume, Tatsuya Harada. (2017). Learning Environmental Sounds with End-to-end Convolutional Neural Network. IEEE SigPort. http://sigport.org/1599
Yuji Tokozume, Tatsuya Harada, 2017. Learning Environmental Sounds with End-to-end Convolutional Neural Network. Available at: http://sigport.org/1599.
Yuji Tokozume, Tatsuya Harada. (2017). "Learning Environmental Sounds with End-to-end Convolutional Neural Network." Web.
1. Yuji Tokozume, Tatsuya Harada. Learning Environmental Sounds with End-to-end Convolutional Neural Network [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1599