Multimodal signal processing

VISUAL AND TEXTUAL SENTIMENT ANALYSIS USING DEEP FUSION CONVOLUTIONAL NEURAL NETWORKS

Read more about VISUAL AND TEXTUAL SENTIMENT ANALYSIS USING DEEP FUSION CONVOLUTIONAL NEURAL NETWORKS
Log in to post comments

Sentiment analysis is attracting more and more attentions and has become a very hot research topic due to its potential applications in personalized recommendation, opinion mining, etc. Most of the existing methods are based on either textual or visual data and can not achieve satisfactory results, as it is very hard to extract sufficient information from only one single modality data.

Chen_ICIP17_DeepFusion_Slides.pdf

Chen_ICIP17_DeepFusion_Slides.pdf (475)

Categories:: Multimodal signal processing

20 Views

CHAM: ACTION RECOGNITION USING CONVOLUTIONAL HIERARCHICAL ATTENTION MODEL

Read more about CHAM: ACTION RECOGNITION USING CONVOLUTIONAL HIERARCHICAL ATTENTION MODEL
Log in to post comments

(1)We improved the soft attention model by introducing convolutional
operations inside the LSTM cell and attention map generation process to capture the spatial layout.
(2)We built a hierarchical two layer LSTM model for action recognition. (3)We tested our model on
three widely applied datasets, the UCF sports dataset, the Olympic dataset and the HMDB51 dataset
with improved results on other published work.

poster.pdf

poster.pdf (940)

Categories:: Multimodal signal processing

19 Views

EMOTION RECOGNITION THROUGH INTEGRATING EEG AND PERIPHERAL SIGNALS

Read more about EMOTION RECOGNITION THROUGH INTEGRATING EEG AND PERIPHERAL SIGNALS
Log in to post comments

The inherent dependencies among multiple physiological signals are crucial for multimodal emotion recognition, but have not been thoroughly exploited yet. This paper propose to use restricted Boltzmann machine (RBM) to model such dependencies.Specifically, the visible nodes of RBM represent EEG and peripheral physiological signals, and thus the connections between visible nodes and hidden nodes capture the intrinsic relations among multiple physiological signals. The RBM also generate new representation from multiple physiological signals.

ICASSP_YANGYANG.pdf

ICASSP_YANGYANG.pdf (563)

Categories:: Multimodal signal processing

25 Views

Personalized video emotion tagging through a topic model

Read more about Personalized video emotion tagging through a topic model
Log in to post comments

The inherent dependencies among video content, personal characteristics, and perceptual emotion are crucial for personalized video emotion tagging, but have not been thoroughly exploited. To address this, we propose a novel topic model to capture such inherent dependencies. We assume that there are several potential human factors, or “topics,” that affect the personal characteristics and the personalized emotion responses to videos.

ICASSP_SHAN.pdf

ICASSP_SHAN.pdf (501)

Categories:: Multimodal signal processing

7 Views

Use of Affect Based Interaction Classification for Continuous Emotion Tracking

Read more about Use of Affect Based Interaction Classification for Continuous Emotion Tracking
Log in to post comments

Natural and affective handshakes of two participants define the course of dyadic interaction. Affective states of the participants are expected to be correlated with the nature of the dyadic interaction. In this paper, we extract two classes of the dyadic interaction based on temporal clustering of affective states. We use the k-means temporal clustering to define the interaction classes, and utilize support vector machine based classifier to estimate the interaction class types from multimodal, speech and motion, features.

khaki-erzin-icassp17.pdf

khaki-erzin-icassp17.pdf (319)

Categories:: Multimodal signal processing
Multimedia human-machine interface and interaction

10 Views

A First Attempt at Polyphonic Sound Event Detection Using Connectionist Temporal Classification

Sound event detection is the task of detecting the type, starting time, and ending time of sound events in audio streams. Recently, recurrent neural networks (RNNs) have become the mainstream solution for sound event detection. Because RNNs make a prediction at every frame, it is necessary to provide exact starting and ending times of the sound events in the training data, making data annotation an extremely time-consuming process.

2017.03 Poster for ICASSP.pdf

2017.03 Poster for ICASSP.pdf (58)

Categories:: Audio for Multimedia
Multimodal signal processing

8 Views

ROBUST ONLINE MULTI-OBJECT TRACKING BASED ON KCF TRACKERS AND REASSIGNMENT

Read more about ROBUST ONLINE MULTI-OBJECT TRACKING BASED ON KCF TRACKERS AND REASSIGNMENT
Log in to post comments

There is a big challenge in online multi-object tracking-by-detection, which caused by frequent occlusions, false alarms or miss detections and other factors. In this paper, we pro-posed an improved fast online multi-object tracking method through taking into account the results of multiple single-object trackers and detections synthetically. To solve the ﬁxed scale problem of conventional kernelized correlation ﬁlter in single-object tracker we used, trackers are associated with de-tections based on position and size and then an adaptive mech-anism of trackers is established.

post1170.pdf

ROBUST ONLINE MULTI-OBJECT TRACKING BASED ON KCF TRACKERS AND REASSIGNMENT (673)

Categories:: Multimodal signal processing

31 Views

TRACKING HIERARCHICAL STRUCTURE OF WEB VIDEO GROUPS BASED ON SALIENT KEYWORD MATCHING INCLUDING SEMANTIC BROADNESS ESTIMATION

This paper presents a novel method to track the hierarchical structure of Web video groups on the basis of salient keyword matching including semantic broadness estimation. To the best of our knowledge, this paper is the first work to perform extraction and tracking of the hierarchical structure simultaneously. Specifically, the proposed method first extracts the hierarchical structure of Web video groups and salient keywords of them on the basis of an improved scheme of our previously reported method.

harakawa_globalsip2016.pdf

harakawa_globalsip2016.pdf (330)

Categories:: Multimodal signal processing
Multimedia computing systems and applications

6 Views

Discriminant Correlation Analysis for Feature Level Fusion with Application to Multimodal Biometrics

In this paper, we present Discriminant Correlation Analysis (DCA), a feature level fusion technique that incorporates the class associations in correlation analysis of the feature sets. DCA performs an effective feature fusion by maximizing the pair-wise correlations across the two feature sets, and at the same time, eliminating the between-class correlations and restricting the correlations to be within classes.

DCA_ICASSP16_Poster.pdf

DCA_ICASSP16_Poster.pdf (1708)

Categories:: Information Forensics and Security
Biometrics
Applications
Multimedia Forensics
Machine Learning for Signal Processing
Applications in Data Fusion (MLR-FUSI)
Pattern recognition and classification (MLR-PATT)
Other applications of machine learning (MLR-APPL)
Image, Video, and Multidimensional Signal Processing
Image/Video Processing
Multimedia Signal Processing
Multimodal signal processing
Sensor Array and Multichannel Signal Processing

118 Views

Siamese Neural Network based Gait Recognition for Human Identification

Read more about Siamese Neural Network based Gait Recognition for Human Identification
Log in to post comments

As the remarkable characteristics of remote accessed, robust and security, gait recognition has gained significant attention in the biometrics based human identification task. However, the existed methods mainly employ the handcrafted gait features, which cannot well handle the indistinctive inter-class differences and large intra-class variations of human gait in real-world situation. In this paper, we have developed a Siamese neural network based gait recognition framework to automatically extract robust and discriminative gait features for human identification.

Poster For ICASSP 2016 - Siamese Neural Network based Gait Recognition for Human Identification.pdf

Poster For ICASSP 2016 - Siamese Neural Network based Gait Recognition for Human Identification.pdf (80)

Categories:: Multimodal signal processing

20 Views

Multimodal signal processing

Pages