ICIP 2017

The International Conference on Image Processing (ICIP), sponsored by the IEEE Signal Processing Society, is the premier forum for the presentation of technological advances and research results in the fields of theoretical, experimental, and applied image and video processing. ICIP has been held annually since 1994, brings together leading engineers and scientists in image and video processing from around the world. Visit website.

Tiny Head Pose Classification by Bodily Cues

Read more about Tiny Head Pose Classification by Bodily Cues
Log in to post comments

The head pose is an important cue for computer vision. Traditionally considered in human computer interaction applications,
it becomes very hard to model in surveillance scenarios, due to the tiny head size. Additionally, no public dataset contains continuous head pose annotations in open scenery, making the challenge even harder to face. Here we present a
framework based on Faster RCNN, which introduces a branch in the network architecture related to the head pose estimation.

Tiny head pose classifictaion by bodily cues.pdf

ICIP1701 (568)

Categories:: Image/Video Processing

235 Views

MULTI-VIEW NETWORK-BASED SOCIAL-TAGGED LANDMARK IMAGE CLUSTERING

Read more about MULTI-VIEW NETWORK-BASED SOCIAL-TAGGED LANDMARK IMAGE CLUSTERING
Log in to post comments

The multiple types of social media data have abundant information, but learning multi-modal social data is challenging due to data heterogeneity and noise in user-generated data. To address this problem, we propose a multi-view network-based clustering approach that is robust to noise and fully reflects the underlying structure of the comprehensive network. To demonstrate the proposed approach, we experimented with clustering challenging tagged images of landmarks.

ICIP2017_poster.pdf

ICIP2017_poster.pdf (497)

Categories:: Image/Video Storage, Retrieval

5 Views

INTEGRATING THOR TOOLS INTO THE EMERGING AV1 CODEC

Read more about INTEGRATING THOR TOOLS INTO THE EMERGING AV1 CODEC
Log in to post comments

Over recent years there have been several efforts which aim to standardise a royalty-free video codec, such as Thor developed by Cisco, and AV1 developed by the Alliance for Open Media. In this paper we discuss how some compression tools in Thor were integrated into the emerging AV1 codec aiming to increase compression efficiency as well as to decrease computational complexity.

icip_presentation_clpf.pdf

icip_presentation_clpf.pdf (549)

Categories:: Image/Video Coding

101 Views

A New High Precision Eye Center LocalizationTechnique

Read more about A New High Precision Eye Center LocalizationTechnique
Log in to post comments

Eyes represent the most distinctive features of the human face, while their position and movements are a significant source of information about the cognitive and affective state of humans. Precise eye center localization constitutes a challenging problem in many human-computer interaction applications. In this work, an automatic, non-intrusive method is introduced for the precise eye center localization,based on a modified version of the Fast Radial Symmetry Transform.

A new high precision eye center localization technique.pdf

A new high precision eye center localization technique.pdf (259)

Categories:: Image/Video Processing

17 Views

AN EFFICIENT DEEP NEURAL NETWORKS TRAINING FRAMEWORK FOR ROBUST FACE RECOGNITION

Read more about AN EFFICIENT DEEP NEURAL NETWORKS TRAINING FRAMEWORK FOR ROBUST FACE RECOGNITION

In recent years, the triplet loss-based deep neural networks (DNN) are widely used in the task of face recognition and achieve the state-of-the-art performance. However, the complexity of training the triplet loss-based DNN is significantly high due to the difficulty in generating high-quality training samples. In this paper, we propose a novel DNN training framework to accelerate the training process of the triplet loss-based DNN and meanwhile to improve the performance of face recognition. More specifically, the proposed framework contains two stages: 1) The DNN initialization.

poster_v5.pdf

Face recognition (506)

Categories:: Biometrics

16 Views

GPGPU Implementation of VP9 in-loop deblocking filter and improvement for AV1 codec

Read more about GPGPU Implementation of VP9 in-loop deblocking filter and improvement for AV1 codec
Log in to post comments

ICIP2017-ParallelDeblocking.pdf

ICIP2017-ParallelDeblocking.pdf (631)

Categories:: Image/Video Coding

110 Views

Saliency Detection for Seismic Applications Using Multi-dimensional Spectral Projections and Directional Comparisons

In this paper, we propose a novel approach for saliency detection for seismic applications using 3D-FFT local spectra and multi-dimensional plane projections. We develop a projection scheme by dividing a 3D-FFT local spectrum of a data volume into three distinct components, each depicting changes along a different dimension of the data. The saliency detection results obtained using each projected component are then combined to yield a saliency map.

ICIP2017oral_SaleincySeismic_20170912_AmirTariqZhiling.pdf

Oral Presentation (471)

Categories:: Image/Video Processing

4 Views

Saliency Detection for Seismic Applications Using Multi-dimensional Spectral Projections and Directional Comparisons

ICIP2017oral_SaleincySeismic_20170912_AmirTariqZhiling.pdf

Oral Presentation (482)

Categories:: Image/Video Processing

10 Views

UNSUPERVISED FEATURE SELECTION BY MANIFOLD REGULARIZED SELF-REPRESENTATION

Read more about UNSUPERVISED FEATURE SELECTION BY MANIFOLD REGULARIZED SELF-REPRESENTATION
Log in to post comments

Unsupervised feature selection has been proven to be an efficient technique in mitigating the curse of dimensionality. It helps to understand and analyze the prevalent high-dimensional unlabeled data. Recently, based on the self-similarity property of objects, self-representation which assumes that a feature can be represented by the linear combination of its relevant features has been successfully used in unsupervised feature selection. In this paper, we propose a novel algorithm termed Manifold Regularized Selfrepresentation(MRSR) based on the self-representation ability of features.

MRSR_Poster_90x142cm.pdf

MRSR_ICIP_1266 (582)

Categories:: Audio and Acoustic Signal Processing

32 Views

Multi Layer Multi Objective Extreme Learning Machine

Read more about Multi Layer Multi Objective Extreme Learning Machine
Log in to post comments

Fully connected multi layer neural networks such as Deep Boltzmann Machines (DBM) performs better than fully connected single layer neural networks in image classification tasks and has a smaller number of hidden layer neurons than Extreme Learning Machine (ELM) based fully connected multi layer neural networks such as Multi Layer ELM (ML-ELM) and Hierarchical ELM (H-ELM) However, ML-ELM and H-ELM has a smaller training time than DBM.

MLMO_ELM_presentation.pdf

MLMO_ELM_presentation.pdf (971)

Categories:: Neural network learning (MLR-NNLR)

46 Views

Pages