Sorry, you need to enable JavaScript to visit this website.

Partial occlusions in face images pose a great problem for most face recognition algorithms due to the fact that most of these algorithms mainly focus on solving a second order loss function, e.g., mean square error (MSE), which will magnify the effect from occlusion parts. In this paper, we proposed a kernel non-second order loss function for sparse representation (KNS-SR) to recognize or restore partially occluded facial images, which both take the advantages of the correntropy and the non-second order statistics measurement.

Categories:
13 Views

Multi-scale object recognition and accurate object localization are two major problems for semantic segmentation in high resolution aerial images. To handle these problems, we design a Context Fuse Module to aggregate multi-scale features and propose an Attention Mix Module to combine different level features for higher localization accuracy. We further employ a Residual Convolutional Module to refine features in all levels. Based on these modules, we construct a new end-to-end network for semantic labeling in aerial images.

Categories:
16 Views

From video streaming to security and surveillance applications , video data play an important role in our daily living today. However, managing a large amount of video data and retrieving the most useful information for the user remain a challenging task. In this paper, we propose a novel video classification system that would benefit the scene understanding task. We define our classification problem as classifying background and foreground motions using the same feature representation for outdoor scenes.

Categories:
128 Views

Stereo high dynamic range (HDR) image/video can be generated by using a pair of stereo cameras with different exposure parameters. This paper proposes a new stereo HDR imaging method using generative adversarial networks (GAN) with a low dynamic range (LDR) stereo imaging system. It is assumed here that the left-view (LV) image is under-exposed and the right-view (RV) image is over-exposed.

Categories:
33 Views

We detect and classify Table Tennis strokes in videos recorded in natural condition. The goal is to develop an intelligent computer environment where teachers and students can analyse their games for improving players performance.

Categories:
24 Views

The region proposal task is generating a set of candidate regions that contain an object. In this task, it is most important to propose as many candidates of ground-truth in a fixed number of proposals. However, in an image, there are too small number of hard negative examples compared to the vast number of easy negatives, so the region proposal networks struggle to train hard negatives. Because of these problem, network tends to propose hard negatives as the candidates and fails to propose the ground-truth, which leads poor performance.

Categories:
55 Views

A new way of performing pixel by pixel comparison between two images is proposed, taking advantage of interesting invariance properties with respect to illumination conditions and camera settings.

Categories:
50 Views

Light field (LF) cameras often have significant limitations in spatial and angular resolutions due to their design. Many techniques that attempt to reconstruct LF images at a higher resolution only consider either spatial or angular resolution, but not both. We propose a generative network using high-dimensional convolution to improve both aspects. Our experimental results on both synthetic and real-world data demonstrate that the proposed model outperforms existing state-of-the-art methods in terms of both peak signal-to-noise ratio (PSNR) and visual quality.

Categories:
47 Views

Pages