The International Conference on Image Processing (ICIP), sponsored by the IEEE Signal Processing Society, is the premier forum for the presentation of technological advances and research results in the fields of theoretical, experimental, and applied image and video processing. ICIP has been held annually since 1994, brings together leading engineers and scientists in image and video processing from around the world. Visit website.
- Read more about 5D Video Stabilization through Sensor Vision Fusion
- 1 comment
- Log in to post comments
- Categories:
- Read more about Fashion Recommendation on Street Images
- Log in to post comments
Learning the compatibility relationship is of vital importance to a fashion recommendation system, while existing works achieve this merely on product images but not on street images in the complex daily life scenario. In this paper, we propose a novel fashion recommendation system: Given a query item of interest in the street scenario, the system can return the compatible items. More specifically, a two-stage curriculum learning scheme is developed to transfer the semantics from the product to street outfit images.
- Categories:
- Read more about Temporal Interframe Pattern Analysis for Static and Dynamic Hand Gesture Recognition
- Log in to post comments
Hand gesture, a common non-verbal language, is being studied for Human Computer Interaction. Hand gestures can be categorized as static hand gestures and dynamic hand gestures. In recent years, effective approaches have been applied to hand gesture recognition.
- Categories:
- Read more about Fast Inpainting-based Compression: Combinging Shepard Interpolation with Joint Inpainting and Prediction
- Log in to post comments
Inpainting-based compression has been suggested as a qualitative competitor to the JPEG family of transform-based codecs, specifically for high compression ratios. However, it also requires sophisticated interpolation, data optimisation and encoding tasks that are both slow and hard to implement. We propose a fast and simple alternative that combines Shepard interpolation with a novel joint inpainting and prediction approach. It represents the image by a fraction of its pixel values on a sparse regular subgrid that are selected by an efficient optimisation strategy.
- Categories:
- Read more about ACCURATE SEGMENTATION OF SYNAPTIC CLEFT WITH CONTOUR GROWING CONCATENATED WITH A CONVNET
- Log in to post comments
- Categories:
- Read more about Dense Optical Flow for the Reconstruction of Weakly Textured and Structured Surfaces: Application to Endoscopy
- Log in to post comments
- Categories:
- Read more about INFORMATIVE FRAME CLASSIFICATION OF ENDOSCOPIC VIDEOS USING CONVOLUTIONAL NEURAL NETWORKS AND HIDDEN MARKOV MODELS
- Log in to post comments
The goal of endoscopic analysis is to find abnormal lesions and determine further therapy from the obtained information. However, the procedure produces a variety of non-informative frames and lesions can be missed due to poor video quality. Especially when analyzing entire endoscopic videos made by non-expert endoscopists, informative frame classification is crucial to e.g. video quality grading. This work concentrates on the design of an automated indication of informativeness of video frames.
- Categories:
- Read more about Two-stage Unsupervised Learning Method for Affine and Deformable Registration
- Log in to post comments
Conventional medical image registration relies on time-consuming iterative optimization. We propose a two-stage unsupervised learning method for 3D medical image registration. In the first stage, we learn a global image-wise affine map by a deep network. In the second stage, we learn a local voxel-wise deformation vector field by an encoder-decoder architecture. The final registered image is acquired by applying the local deformation field to the moved image of the first stage.
- Categories:
- Read more about ARCHITECTURE-AWARE NETWORK PRUNING FOR VISION QUALITY APPLICATIONS
- Log in to post comments
Convolutional neural network (CNN) delivers impressive achievements in computer vision and machine learning field. However, CNN incurs high computational complexity, especially for vision quality applications because of large image resolution. In this paper, we propose an iterative architecture-aware pruning algorithm with adaptive magnitude threshold while cooperating with quality-metric measurement simultaneously. We show the performance improvement applied on vision quality applications and provide comprehensive analysis with flexible pruning configuration.
- Categories:
- Read more about MEMORY-BASED PARAMETERIZED SKILLS LEARNING FOR MAPLESS VISUAL NAVIGATION
- Log in to post comments
The recently-proposed reinforcement learning for mapless visual navigation can generate an optimal policy for searching different targets. However, most state-of-the-art deep reinforcement learning (DRL) models depend on hard rewards to learn the optimal policy, which can lead to the lack of previous diverse experiences. Moreover, these pre-trained DRL models cannot generalize well to un-trained tasks. To overcome these problems above, in this paper, we propose a Memorybased Parameterized Skills Learning (MPSL) model for mapless visual navigation.
- Categories: