Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

COMPRESSED-DOMAIN VIDEO CLASSIFICATION WITH DEEP NEURAL NETWORKS: “THERE’S WAY TOO MUCH INFORMATION TO DECODE THE MATRIX”

Abstract: 

We investigate video classification via a 3D deep convolutional neural network (CNN) that directly ingests compressed bitstream information. This idea is based on the observation that video macroblock (MB) motion vectors (that are very compact and directly available from the compressed bitstream) are inherently capturing local spatiotemporal changes in each video scene. Our results on two standard video datasets show that our approach outperforms pixel-based approaches and remains within 10 percentile points from the best classification results reported by highly-complex optical-flow & deep-CNN methods. At the same time, a CPU-based realization of our approach is found to be more than 680 times faster in the motion extraction in comparison to GPU-based optical flow methods and also offers 2 to 3.4-fold reduction in the utilized deep CNN weights compared to recent architectures. This indicates that deep learning based on compressed video bitstream information may allow for advanced video classification to be deployed in very large datasets using commodity CPU hardware. Source code and further demonstration results are available at http://www.github.com/mvcnn.

up
0 users have voted:

Paper Details

Authors:
Aaron Chadha, Alhabib Abbas, Yiannis Andreopoulos
Submitted On:
14 September 2017 - 8:03pm
Short Link:
Type:
Presentation Slides
Event:
Presenter's Name:
Aaron Chadha
Paper Code:
2238
Document Year:
2017
Cite

Document Files

Compressed_domain_video_classification.pdf

(253 downloads)

Subscribe

[1] Aaron Chadha, Alhabib Abbas, Yiannis Andreopoulos, "COMPRESSED-DOMAIN VIDEO CLASSIFICATION WITH DEEP NEURAL NETWORKS: “THERE’S WAY TOO MUCH INFORMATION TO DECODE THE MATRIX”", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/2057. Accessed: Jun. 18, 2018.
@article{2057-17,
url = {http://sigport.org/2057},
author = {Aaron Chadha; Alhabib Abbas; Yiannis Andreopoulos },
publisher = {IEEE SigPort},
title = {COMPRESSED-DOMAIN VIDEO CLASSIFICATION WITH DEEP NEURAL NETWORKS: “THERE’S WAY TOO MUCH INFORMATION TO DECODE THE MATRIX”},
year = {2017} }
TY - EJOUR
T1 - COMPRESSED-DOMAIN VIDEO CLASSIFICATION WITH DEEP NEURAL NETWORKS: “THERE’S WAY TOO MUCH INFORMATION TO DECODE THE MATRIX”
AU - Aaron Chadha; Alhabib Abbas; Yiannis Andreopoulos
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/2057
ER -
Aaron Chadha, Alhabib Abbas, Yiannis Andreopoulos. (2017). COMPRESSED-DOMAIN VIDEO CLASSIFICATION WITH DEEP NEURAL NETWORKS: “THERE’S WAY TOO MUCH INFORMATION TO DECODE THE MATRIX”. IEEE SigPort. http://sigport.org/2057
Aaron Chadha, Alhabib Abbas, Yiannis Andreopoulos, 2017. COMPRESSED-DOMAIN VIDEO CLASSIFICATION WITH DEEP NEURAL NETWORKS: “THERE’S WAY TOO MUCH INFORMATION TO DECODE THE MATRIX”. Available at: http://sigport.org/2057.
Aaron Chadha, Alhabib Abbas, Yiannis Andreopoulos. (2017). "COMPRESSED-DOMAIN VIDEO CLASSIFICATION WITH DEEP NEURAL NETWORKS: “THERE’S WAY TOO MUCH INFORMATION TO DECODE THE MATRIX”." Web.
1. Aaron Chadha, Alhabib Abbas, Yiannis Andreopoulos. COMPRESSED-DOMAIN VIDEO CLASSIFICATION WITH DEEP NEURAL NETWORKS: “THERE’S WAY TOO MUCH INFORMATION TO DECODE THE MATRIX” [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/2057