Sorry, you need to enable JavaScript to visit this website.

The document is the supplementary materials of the paper of "Towards Validating Face Editing Ability in Generative Models" to provide more quantitative and qualitative results complementing the main paper.

Categories:
9 Views

Recent developments in self-supervised learning (SSL) have made it possible to learn data representations without the need for annotations.
, which enhances the performance of various downstream tasks.

Categories:
8 Views

In this paper, we introduce a novel unsupervised video denoising deep learning approach that can help to mitigate data scarcity issues and shows robustness against different noise patterns, enhancing its broad applicability. Our method comprises three modules: a Feature generator creating features maps, a Denoise-Net generating denoised but slightly blurry reference frames, and a Refine-Net re-introducing high-frequency details.

Categories:
15 Views

Underwater images often suffer from degradation due to refraction, back-scattering, and absorption, resulting in color cast, blur, and limited visibility. Such degradation hampers higher-level computer vision applications in autonomous underwater vehicles. Existing methods for enhancing degraded images often fail to preserve fine edges and true colors. Hence, an effective pre-processing network is vital for underwater image enhancement. Addressing this need, we propose a frequency modulated deformable transformer network.

Categories:
12 Views

This document provides supplementary material for the paper titled “Latent Enhancing AutoEncoder for Occluded Image Classification” submitted to the regular track of the ICIP 2024. This document consists of details of the architecture of the LEARN, illustration of improvements in inter-class differentiability in latent space for OccludedPASCAL3D+ dataset (hereafter referred to as Pascal), and detailed classification results.

Categories:
14 Views

Drones have been widely employed in various fields, but the number of drones being used illegally and for hazardous purposes has recently increased. To prevent illegal drones, in this work, we propose a novel framework for reconstructing three-dimensional (3D) drone trajectories using a single camera. By leveraging calibrated cameras, we exploit the relationship between 2D and 3D spaces. We automatically track the drones in 2D images using a drone tracker and estimate their 2D rotations.

Categories:
9 Views

Identifying people’s identity from a group photo through face
recognition models has applications in various fields. There
are two major challenges, first due to the presence of several
faces with various degrees of clarity and scale, and second due
to angular orientation of faces in usual group photos. Detect-
ing and cropping the faces have been reasonably solved using
various segmentation-like models. Recognizing identity after
cropping a frontal face has also been successful to some ex-

Categories:
3 Views

Prune Channel and Distill: Discriminative Knowledge Distillation for Semantic Segmentation - Supplementary Material -

Categories:
9 Views

Pages