Image/Video Processing

MULTIPLE PATH SEARCH FOR ACTION TUBE DETECTION IN VIDEOS

Read more about MULTIPLE PATH SEARCH FOR ACTION TUBE DETECTION IN VIDEOS
Log in to post comments

This paper presents an efficient convolutional neural net- work (CNN)-based multiple path search (MPS) algorithm to detect multiple spatial-temporal action tubes in videos. With the pass information and the accumulated scores generated by forward message passing, the new algorithm reuses these information to simultaneously find multiple paths in back- ward path tracing without repeating the search process. More- over, to rectify the potentially inaccurate bounding boxes, we also propose a video localization refinement scheme to further boost the detection accuracy.

ICIP2017.pdf

MULTIPLE PATH SEARCH FOR ACTION TUBE DETECTION IN VIDEOS (986)

Categories:: Image/Video Processing

14 Views

WORDFENCE: TEXT DETECTION IN NATURAL IMAGES WITH BORDER AWARENESS

Read more about WORDFENCE: TEXT DETECTION IN NATURAL IMAGES WITH BORDER AWARENESS
Log in to post comments

In recent years, text recognition has achieved remarkable success in recognizing scanned document text. However, word recognition in natural images is still an open problem, which generally requires time consuming post-processing steps. We present a novel architecture for individual word detection in scene images based on semantic segmentation. Our contributions are twofold: the concept of WordFence, which detects border areas surrounding each individual word and a novel pixelwise weighted softmax loss function which penalizes background and emphasizes small text regions.

ICIP_wordfence_presentation.pdf

ICIP_wordfence_presentation.pdf (629)

Categories:: Image/Video Processing

14 Views

ROBUST FACE ALIGNMENT WITH CASCADED COARSE-TO-FINE AUTO-ENCODER NETWORK

Read more about ROBUST FACE ALIGNMENT WITH CASCADED COARSE-TO-FINE AUTO-ENCODER NETWORK
Log in to post comments

icip2017新 .pptx

icip2017新 .pptx (364)

Categories:: Image/Video Processing

14 Views

A GRAPH-BASED APPROACH FOR FEATURE EXTRACTION AND SEGMENTATION OF MULTIMODAL IMAGES

Read more about A GRAPH-BASED APPROACH FOR FEATURE EXTRACTION AND SEGMENTATION OF MULTIMODAL IMAGES
Log in to post comments

Iyer_ICIP2017_Poster_Draft_10-09-17.pdf

Iyer_ICIP2017_Poster_Draft_10-09-17.pdf (1504)

Categories:: Image/Video Processing

17 Views

Visual Salience and Stack Extension Based Ghost Removal for High-dynamic-range Imaging

High-dynamic-range imaging (HDRI) techniques are proposed to extend the dynamic range of captured images against
sensor limitation. The key issue of multi-exposure fusion in HDRI is removing ghost artifacts caused by the motion of moving objects and handheld cameras. This paper proposes a ghost-free HDRI algorithm based on visual salience and
stack extension. To improve the accuracy of ghost areas detection, visual salience based bilateral motion detection is

WZJ_poster_ICIP_final.pdf

WZJ_poster_ICIP_final.pdf (578)

Categories:: Image/Video Processing

84 Views

DenseNet for Dense Flow

Read more about DenseNet for Dense Flow
Log in to post comments

Efficient Large-Scale Video Understanding in The Wild

ICIP17_phd_forum_poster.pdf

ICIP17_phd_forum_poster.pdf (738)

Categories:: Image/Video Processing
Events & Activities

7 Views

DenseNet for Dense Flow

Read more about DenseNet for Dense Flow
Log in to post comments

Classical approaches for estimating optical flow have achieved rapid progress in the last decade. However, most of them are too slow to be applied in real-time video analysis. Due to the great success of deep learning, recent work has focused on using CNNs to solve such dense prediction problems. In this paper, we investigate a new deep architecture, Densely Connected Convolutional Networks (DenseNet), to learn optical flow. This specific architecture is ideal for the problem at hand as it provides shortcut connections throughout the network, which leads to implicit deep supervision.

ICIP17_paper2550_slides_yizhu.pdf

ICIP17_paper2550_slides_yizhu.pdf (1055)

Categories:: Image/Video Processing
Neural network learning (MLR-NNLR)

16 Views

IMAGE SEGMENTATION USING CONTOUR, SURFACE, AND DEPTH CUES (Slides)

Read more about IMAGE SEGMENTATION USING CONTOUR, SURFACE, AND DEPTH CUES (Slides)
Log in to post comments

We target at solving the problem of automatic image segmentation. Although 1D contour and 2D surface cues have been widely utilized in existing work, 3D depth information of an image, a necessary cue according to human visual perception, is however overlooked in automatic image segmentation. In this paper, we study how to fully utilize 1D contour, 2D surface, and 3D depth cues for image segmentation. First, three elementary segmentation modules are developed for these cues respectively.