Image/Video Processing

PVD4RCV: A Photo-realistic Multi-Distortion Video Dataset for Benchmarking and Developing Robust Computer Vision Models

This work addresses a significant gap in existing
image and video databases commonly used in computer vision
applications by introducing a unique and comprehensive
database named Photo-realistic Multi-Distortion Video Dataset
for Benchmarking and Developing Robust Computer Vision
Models (PVD4RCV). A key innovation of PVD4RCV lies in
its incorporation of some relevant physical factors (e.g. depth
information, interaction of light with scene contents) inherent to
video signal acquisition in constrained and complex real-world

VCIP_PVD4RCV_Beghdadi_2025_compressed.pdf

Dataset for benchmarking and developing object detection and tracking models (81)

Categories:: Image/Video Processing

42 Views

Robust Estimation of Bump Height for Wafer-Level Packaging Using Opcital Triangulation

Paper Abstraction:

ICIP2025_Supplementary_Materials_Robust_estimation_of_bump_height_for_wafer_level_packaging_using_optical_triangulation.pdf

ICIP2025_Supplementary_Materials_Robust_estimation_of_bump_height_for_wafer_level_packaging_using_optical_triangulation.pdf (231)

Categories:: Image/Video Processing

45 Views

Supplementary Materials for F-PTQ

Read more about Supplementary Materials for F-PTQ
Log in to post comments

This is the Supplementary Material for the manuscript titled as "F-LBQ: Fine-grained Low Bit Quantization for Efficient and Accurate Object Detection" in submission to ICIP 2025.
The pdf contains implementation details and auxiliary experimental discussions.

ICIP_2025_supp.pdf

ICIP_2025_supp.pdf (263)

Categories:: Image/Video Processing

34 Views

Demo Video 3

Read more about Demo Video 3
Log in to post comments

Virtual transparency

description.txt

description.txt

Categories:: Image/Video Processing

22 Views

Demo Video 2

Virtual Transparency

description.txt

Categories:: Image/Video Processing

25 Views

Demo Video

Read more about Demo Video
Log in to post comments

Virtual Transparency.

description.txt

Categories:: Image/Video Processing

15 Views

FACE QUALITY TRANSFORMER: A FACE QUALITY ASSESSMENT AND ENHANCEMENT FRAMEWORK

Read more about FACE QUALITY TRANSFORMER: A FACE QUALITY ASSESSMENT AND ENHANCEMENT FRAMEWORK
Log in to post comments

The quality of image generation has reached impressive levels.
Advanced text-to-image models have become
amazingly good at creating objects, depicting actions with high precision.
However, despite significant progress in image generation,
the quality of generated faces remains a
critical factor for users. Even the most advanced text-to-image
diffusion models struggle to generate high quality faces consistently. This
highlights the importance of estimation of face quality in generated images as one of

FaceQ_transformer.zip

FaceQ_transformer.zip (211)

Categories:: Image/Video Processing

15 Views

MEASURING DISTORTION STRENGTH WITH DEWARPING DIFFUSION MODELS IN ANOMALY DETECTION

Read more about MEASURING DISTORTION STRENGTH WITH DEWARPING DIFFUSION MODELS IN ANOMALY DETECTION
Log in to post comments

Surface anomaly detection aims to localize abnormal regions in images. A representative approach is the reconstruction-based method, which detect defects via reconstruction errors using generative models trained on normal images. However, these methods cannot directly estimate local distortion levels, which are commonly used for products made of metal or resin. To address this issue, we propose DiffuDewarp, a novel method that directly estimates local distortions. Our approach defines a pseudo-deformation defect generation process as a new diffusion process based on localized warping.

SUPPLEMENTARY_MATERIALS_FOR_MEASURING_DISTORTION_STRENGTHS_WITH_DEWARPING_DIFFUSION_MODEL_IN_ANOMALY_DETECTION.pdf

SUPPLEMENTARY_MATERIALS_FOR_MEASURING_DISTORTION_STRENGTHS_WITH_DEWARPING_DIFFUSION_MODEL_IN_ANOMALY_DETECTION.pdf (258)

Categories:: Image/Video Processing
Other ITT Topics

63 Views

Supplementary material

Read more about Supplementary material
Log in to post comments

Automatic facial stroke and palsy assessment systems based on computer vision have strong benefits in their application. We propose a framework that exploits facial graphs with temporal connections and analyses them through a graph attention-based model. The temporal facial graph captures structural and motion cues from the facial image sequence, while the graph attention mechanism effectively analyses the interrelation between facial regions in close proximity.

supplementary.pdf

supplementary.pdf (205)

Categories:: Image/Video Processing

17 Views

Supplementary_Material

Read more about Supplementary_Material
Log in to post comments

In this paper, we propose a novel approach, HDNet-LE, designed to enhance low-light (LL) images in terms of contrast, noise removal, and other degradation kinds by leveraging the power of Wavelet transform (WT) and Fourier Transform (FT). HDNet-LE combines the strengths of Generative Adversarial Networks (GAN) with the multi-scale analysis capabilities of the Wavelet transform and the spatial frequency domain through the FT. Experimental results demonstrate the effectiveness of the proposed method in improving the visibility and quality of LL images.

Supplementary_Material.pdf

Supplementary_Material.pdf (294)

Categories:: Image/Video Processing

12 Views

Image/Video Processing

Pages