Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

Long Short-term Memory Recurrent Neural Network based Segment Features for Music Genre Classification

Abstract: 

In the conventional frame feature based music genre
classification methods, the audio data is represented by
independent frames and the sequential nature of audio is totally
ignored. If the sequential knowledge is well modeled and
combined, the classification performance can be significantly
improved. The long short-term memory(LSTM) recurrent
neural network (RNN) which uses a set of special memory
cells to model for long-range feature sequence, has been
successfully used for many sequence labeling and sequence
prediction tasks. In this paper, we propose the LSTM RNN
based segment features for music genre classification. The
LSTM RNN is used to learn the representation of LSTM frame
feature. The segment features are the statistics of frame features
in each segment. Furthermore, the LSTM segment feature
is combined with the segment representation of initial frame
feature to obtain the fusional segment feature. The evaluation
on ISMIR database show that the LSTM segment feature
performs better than the frame feature. Overall, the fusional
segment feature achieves 89.71% classification accuracy,
about 4.19% improvement over the baseline model using deep
neural network (DNN). This significant improvement show the
effectiveness of the proposed segment feature.

up
0 users have voted:

Paper Details

Authors:
Jia Dai, Shan Liang, Wei Xue, Chongjia Ni, Wenju Liu
Submitted On:
14 October 2016 - 9:18am
Short Link:
Type:
Presentation Slides
Event:
Presenter's Name:
Jia Dai
Document Year:
2016
Cite

Document Files

ISCSLP2016_JiaDai_pptA4.pdf

(330 downloads)

Subscribe

[1] Jia Dai, Shan Liang, Wei Xue, Chongjia Ni, Wenju Liu, "Long Short-term Memory Recurrent Neural Network based Segment Features for Music Genre Classification", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1195. Accessed: Nov. 19, 2018.
@article{1195-16,
url = {http://sigport.org/1195},
author = {Jia Dai; Shan Liang; Wei Xue; Chongjia Ni; Wenju Liu },
publisher = {IEEE SigPort},
title = {Long Short-term Memory Recurrent Neural Network based Segment Features for Music Genre Classification},
year = {2016} }
TY - EJOUR
T1 - Long Short-term Memory Recurrent Neural Network based Segment Features for Music Genre Classification
AU - Jia Dai; Shan Liang; Wei Xue; Chongjia Ni; Wenju Liu
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1195
ER -
Jia Dai, Shan Liang, Wei Xue, Chongjia Ni, Wenju Liu. (2016). Long Short-term Memory Recurrent Neural Network based Segment Features for Music Genre Classification. IEEE SigPort. http://sigport.org/1195
Jia Dai, Shan Liang, Wei Xue, Chongjia Ni, Wenju Liu, 2016. Long Short-term Memory Recurrent Neural Network based Segment Features for Music Genre Classification. Available at: http://sigport.org/1195.
Jia Dai, Shan Liang, Wei Xue, Chongjia Ni, Wenju Liu. (2016). "Long Short-term Memory Recurrent Neural Network based Segment Features for Music Genre Classification." Web.
1. Jia Dai, Shan Liang, Wei Xue, Chongjia Ni, Wenju Liu. Long Short-term Memory Recurrent Neural Network based Segment Features for Music Genre Classification [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1195