Sorry, you need to enable JavaScript to visit this website.

The International Conference on Image Processing (ICIP), sponsored by the IEEE Signal Processing Society, is the premier forum for the presentation of technological advances and research results in the fields of theoretical, experimental, and applied image and video processing. ICIP has been held annually since 1994, brings together leading engineers and scientists in image and video processing from around the world. Visit website.

A simple and scalable denoising algorithm is proposed that can be applied to a wide range of source and noise models. At the core of the proposed CUDE algorithm is symbol-by-symbol universal denoising used by the celebrated DUDE algorithm, whereby the optimal estimate of the source from an unknown distribution is computed by inverting the empirical distribution of the noisy observation sequence by a deep neural network, which naturally and implicitly aggregates multiple contexts of similar characteristics and estimates the conditional distribution more accurately.

Categories:
17 Views

Recent low-rank based tensor completion (LRTC) algorithms have been successfully applied into color image inpainting. However, most of existing LRTC algorithms treat each dimension of tensors equally, which ignores the differences of the intrinsic structure correlations among dimensions. In this paper, we make a detailed analysis about the rank properties of each dimension and design a simple yet effective reweighted low-rank tensor completion model that truthfully capture the intrinsic structure correlations with reduced computational burden.

Categories:
16 Views

The image processing community has witnessed remarkable advances in enhancing and restoring images. Nevertheless, restoring the visual quality of underwater images remains a great challenge. End-to-end frameworks might fail to enhance the visual quality of underwater images since in several scenarios it is not feasible to provide the ground truth of the scene radiance. In this work, we propose a CNN-based approach that does not require ground truth data since it uses a set of image quality metrics to guide the restoration learning process.

Categories:
9 Views

Finding visual features and suitable models for lipreading tasks that are more complex than a well-constrained vocabulary has proven challenging. This paper explores state-of-the-art Deep Neural Network architectures for lipreading based on a Sequence to Sequence Recurrent Neural Network. We report results for both hand-crafted and 2D/3D Convolutional Neural Network visual front-ends, online monotonic attention, and a joint Connectionist Temporal Classification-Sequence-to-Sequence loss.

Categories:
12 Views

Great progresses have been achieved on object detection in the wild. However, it still remains a challenging problem due to tiny objects. In this paper, we present a Three-category Classification Neural Network to find tiny faces under complex environments by leveraging contextual information around faces. Tiny faces (within 20x20 pixels) are so fuzzy that the facial patterns are not clear or even ambiguous for detection.

Categories:
17 Views

Pages