Sorry, you need to enable JavaScript to visit this website.

ICIP 2021 - The International Conference on Image Processing (ICIP), sponsored by the IEEE Signal Processing Society, is the premier forum for the presentation of technological advances and research results in the fields of theoretical, experimental, and applied image and video processing. ICIP has been held annually since 1994, brings together leading engineers and scientists in image and video processing from around the world. Visit website.

In many of the existing alpha matting implementations, an intermediate representation called a trimap needs to be created manually. To automate the process, we propose a generic neural network for trimap generation based on saliency map detection. Our model multi-modally learns a saliency map and a trimap simultaneously. Because of this structure, the network focuses on reducing the error of the trimap especially within the areas with high salience.

Categories:
3 Views

No-reference (NR) image sharpness assessment is an important issue for image quality assessment and algorithm performance evaluation. Many objective NR sharpness assessment metrics have been proposed which are often intended to be strongly associated with the human visual system (HVS). However, recent studies show that common sharpness assessment indicators may misjudge the degree of blurring for images with shallow depth of field that are often used to highlight the main subject in the view.

Categories:
1 Views

Quarter sampling is a novel sensor design that allows for an acquisition of higher resolution images without increasing the number of pixels. When being used for video data, one out of four pixels is measured in each frame. Effectively, this leads to a non-regular spatio-temporal sub-sampling. Compared to purely spatial or temporal sub-sampling, this allows for an increased reconstruction quality, as aliasing artifacts can be reduced. For the fast reconstruction of such sensor data with a fixed mask, recursive variant of frequency selective reconstruction (FSR) was proposed.

Categories:
1 Views

Warp-based methods for image animation estimate a warp
field what do a rearrangement on the pixels of the input image to roughly align with the target image. Current methods
predict accurate warp field by using manually annotated data.
In this paper, we propose a simple method (MAT-net) to predict more precise warp field in self-supervised way. MAT-net
decomposes complex spatial object movement between two

Categories:
1 Views

We present a computational accommodation-invariant near-eye display, which relies on imaging with coherent light and utilizes static optics together with convolutional neural network-based preprocessing. The network and the display optics are co-optimized to obtain a depth-invariant display point spread function, and thus relieve the conflict between accommodation and ocular vergence cues that typically exists in conventional near-eye displays.

Categories:
9 Views

With the emergence of social media, voluminous video clips are uploaded every day, and retrieving the most relevant visual content with a language query becomes critical. Most approaches aim to learn a joint embedding space for plain textual and visual contents without adequately exploiting their intra-modality structures and inter-modality correlations.

Categories:
2 Views

Pages