Sorry, you need to enable JavaScript to visit this website.

The International Conference on Image Processing (ICIP), sponsored by the IEEE Signal Processing Society, is the premier forum for the presentation of technological advances and research results in the fields of theoretical, experimental, and applied image and video processing. ICIP has been held annually since 1994, brings together leading engineers and scientists in image and video processing from around the world. Visit website.

Reconstruction of signals from compressively sensed measurements is an ill-posed problem. In this paper, we leverage the recurrent generative model, RIDE, as an image prior for compressive image reconstruction. Recurrent networks can model long-range dependencies in images and hence are suitable to handle global multiplexing in reconstruction from compressive imaging. We perform MAP inference with RIDE using back-propagation to the inputs and projected gradient method. We propose an entropy thresholding based approach for preserving texture in images well.

Categories:
9 Views

In this paper we propose a method to segment individual leaves of crop plants from UAV imagery for the purposes of deriving phenotypic properties of the plant. The crop plant used in our study is sorghum Sorghum bicolor (L.) Moench. Phenotyping is a set of methodologies for analyzing and obtaining characteristic traits of a plant. In a phenotypic study, leaves are often used to estimate traits such as individual leaf area and Leaf Area Index (LAI). Our approach is to segment the leaves in polar coordinates using the plant center as the origin.

Categories:
65 Views

Recent studies showed that subtle changes in human’s face color due to the heartbeat can be captured by digital video recorders. Most work focused on still/rest cases or those with relatively small motions. In this work, we propose a heart-rate monitoring method for fitness exercise videos. We focus on building a highly precise motion compensation scheme with the help of the optical flow, and use motion information as a cue to adaptively remove ambiguous frequency components for improving the heart rates estimates.

Categories:
495 Views

Left ventricle (LV) segmentation is crucial for quantitative analysis of the cardiac contractile function. In this paper, we propose a joint multi-scale convolutional neural network to fully automatically segment the LV. Our method adopts two kinds of multi-scale features of cardiac magnetic resonance (CMR) images, including multi-scale features directly extracted from CMR images with different scales and multi-scale features constructed by intermediate layers of standard CNN architecture.

Categories:
10 Views

Untextured scenes with complex occlusions still present challenges to modern stereo algorithms. We consider the pathological case of Mondrian Stereo—scenes consisting solely of solid-colored planar regions, inspired by paintings by Piet Mondrian. We analyze assumptions that allow disambiguating such scenes and present a novel stereo algorithm employing symbolic reasoning about matched edge segments. We demonstrate compelling stereo matching results on synthetic scenes and discuss how our insights could be utilized in robust real-world stereo algorithms for untextured environments.

Categories:
7 Views

Most of the RGB-D fusion methods extract features from RGB
data and depth data separately and then simply concatenate
them or encode these two kinds of features. Such frameworks
cannot explore the correlation between the RGB pixels and
their corresponding depth pixels. Motivated by the physical
concept that range data correspond to the phase change and
color information corresponds to the intensity, we first project
raw RGB-D data into a complex space and then jointly extract
features from the fused RGB-D images. Consequently, the

Categories:
10 Views

The reconstruction of real world objects becomes even more important in the view of creating highly realistic scenes for Virtual Reality applications. In this paper, we present a fully automated algorithmic pipeline for high-quality 3D reconstruction of real world objects. The proposed method refines an initial 3D model by exploiting the results of additional pairwise stereo depth estimation. An automatic camera selection approach provides different point clouds, which are fused into a common coherent and highly detailed 3D model.

Categories:
40 Views

Although many visual attention models have been proposed, very few saliency models investigated the impact of audio information. To develop audio-visual attention models, researchers need to have a ground truth of eye movements recorded while exploring complex natural scenes in different audio conditions. They also need tools to compare eye movements and gaze patterns between these different audio conditions.

Categories:
30 Views

While a number of different algorithms have recently been proposed for convolutional dictionary learning, this remains an expensive problem. The single biggest impediment to learning from large training sets is the memory requirements, which grow at least linearly with the size of the training set since all existing methods are batch algorithms. The work reported here addresses this limitation by extending online dictionary learning ideas to the convolutional context.

Categories:
9 Views

Pages