This paper presents an efficient convolutional neural net- work (CNN)-based multiple path search (MPS) algorithm to detect multiple spatial-temporal action tubes in videos. With the pass information and the accumulated scores generated by forward message passing, the new algorithm reuses these information to simultaneously find multiple paths in back- ward path tracing without repeating the search process. More- over, to rectify the potentially inaccurate bounding boxes, we also propose a video localization refinement scheme to further boost the detection accuracy.
- Categories:
- Read more about WORDFENCE: TEXT DETECTION IN NATURAL IMAGES WITH BORDER AWARENESS
- Log in to post comments
In recent years, text recognition has achieved remarkable success in recognizing scanned document text. However, word recognition in natural images is still an open problem, which generally requires time consuming post-processing steps. We present a novel architecture for individual word detection in scene images based on semantic segmentation. Our contributions are twofold: the concept of WordFence, which detects border areas surrounding each individual word and a novel pixelwise weighted softmax loss function which penalizes background and emphasizes small text regions.
- Categories:
- Read more about ROBUST FACE ALIGNMENT WITH CASCADED COARSE-TO-FINE AUTO-ENCODER NETWORK
- Log in to post comments
- Categories:
- Read more about A GRAPH-BASED APPROACH FOR FEATURE EXTRACTION AND SEGMENTATION OF MULTIMODAL IMAGES
- Log in to post comments
- Categories:
- Read more about Visual Salience and Stack Extension Based Ghost Removal for High-dynamic-range Imaging
- Log in to post comments
High-dynamic-range imaging (HDRI) techniques are proposed to extend the dynamic range of captured images against
sensor limitation. The key issue of multi-exposure fusion in HDRI is removing ghost artifacts caused by the motion of moving objects and handheld cameras. This paper proposes a ghost-free HDRI algorithm based on visual salience and
stack extension. To improve the accuracy of ghost areas detection, visual salience based bilateral motion detection is
- Categories:
- Read more about DenseNet for Dense Flow
- Log in to post comments
Efficient Large-Scale Video Understanding in The Wild
- Categories:
- Read more about DenseNet for Dense Flow
- Log in to post comments
Classical approaches for estimating optical flow have achieved rapid progress in the last decade. However, most of them are too slow to be applied in real-time video analysis. Due to the great success of deep learning, recent work has focused on using CNNs to solve such dense prediction problems. In this paper, we investigate a new deep architecture, Densely Connected Convolutional Networks (DenseNet), to learn optical flow. This specific architecture is ideal for the problem at hand as it provides shortcut connections throughout the network, which leads to implicit deep supervision.
- Categories:
- Read more about IMAGE SEGMENTATION USING CONTOUR, SURFACE, AND DEPTH CUES (Slides)
- Log in to post comments
We target at solving the problem of automatic image segmentation. Although 1D contour and 2D surface cues have been widely utilized in existing work, 3D depth information of an image, a necessary cue according to human visual perception, is however overlooked in automatic image segmentation. In this paper, we study how to fully utilize 1D contour, 2D surface, and 3D depth cues for image segmentation. First, three elementary segmentation modules are developed for these cues respectively.
- Categories:
- Read more about IMAGE SEGMENTATION USING CONTOUR, SURFACE, AND DEPTH CUES (Poster)
- Log in to post comments
We target at solving the problem of automatic image segmentation. Although 1D contour and 2D surface cues have been widely utilized in existing work, 3D depth information of an image, a necessary cue according to human visual perception, is however overlooked in automatic image segmentation. In this paper, we study how to fully utilize 1D contour, 2D surface, and 3D depth cues for image segmentation. First, three elementary segmentation modules are developed for these cues respectively.
- Categories:
- Read more about Intelligent Detail Enhancement for Differently Exposed Images
- Log in to post comments
- Categories: