Image/Video Processing

Multi Class Part Parsing based on Multi-class Boundaries ICIP 25 Supplementary Material

Multi-class part parsing is a dense prediction task that segments objects into semantic components with multi-level abstractions. Despite its significance, this task remains challenging due to ambiguities at both part and class levels. In this paper, we propose a network that incorporates multi-class boundaries to precisely identify and emphasize the spatial boundaries of part classes, thereby improving segmentation quality. Additionally, we employ a weighted multi-label cross-entropy loss function to ensure balanced and effective learning from all parts.

Multi_Class_Part_Parsing_based_on_Multi_class_Boundaries_ICIP_25_Supplementary_Material.pdf

Multi_Class_Part_Parsing_based_on_Multi_class_Boundaries_ICIP_25_Supplementary_Material.pdf (33)

Categories:: Image/Video Processing

58 Views

Supplementary Material for Deep Features based on Contrastive Fusion of Transformer and CNN for Semantic Segmentation

supplementary_material_for_the_research_paper.pdf

supplementary_material_for_the_research_paper.pdf (49)

Categories:: Image/Video Processing

4 Views

Supplementary Material: CHUG - Crowdsourced User-Generated HDR Video Quality

Read more about Supplementary Material: CHUG - Crowdsourced User-Generated HDR Video Quality
1 comment
Log in to post comments

High Dynamic Range (HDR) videos enhance visual experiences with superior brightness, contrast, and color depth. The surge of User-Generated Content (UGC) on platforms like YouTube and TikTok introduces unique challenges for HDR video quality assessment (VQA) due to diverse capture conditions, editing artifacts, and compression distortions. Existing HDR-VQA datasets primarily focus on professionally generated content (PGC), leaving a gap in understanding real-world UGC-HDR degradations.

ICIP2025-Supple.pdf

ICIP2025-Supple.pdf (50)

Categories:: Image/Video Processing

9 Views

Supplementary Material: CHUG - Crowdsourced User-Generated HDR Video Quality

Read more about Supplementary Material: CHUG - Crowdsourced User-Generated HDR Video Quality
Log in to post comments

ICIP2025-Supple.pdf

ICIP2025-Supple.pdf (54)

Categories:: Image/Video Processing

9 Views

ExDF: Supplementary Material

Read more about ExDF: Supplementary Material
Log in to post comments

Supplementary Material

ExDF.pdf

ExDF.pdf (84)

Categories:: Image/Video Processing

24 Views

Supplementary Material

Although many deepfake detection methods have been proposed to fight against severe misuse of generative AI, none provide detailed human-interpretable explanations beyond simple real/fake responses. This limitation makes it challenging for humans to assess the accuracy of detection results, especially when the models encounter unseen deepfakes. To address this issue, we propose a novel deepfake detector based on a large Vision-Language Model (VLM), capable of explaining manipulated facial regions.

ICIP25 Paper (1).pdf

ICIP25 Paper (1).pdf (138)

Categories:: Image/Video Processing
Image, Video, and Multidimensional Signal Processing

17 Views

Supplementary Material

In nighttime conditions, high noise levels and bright Illumination sources degrade image quality, making low-light image enhancement challenging. Thermal images provide complementary information, offering richer textures and structural details. We propose RT-X Net, a cross-attention network that fuses RGB and thermal images for nighttime image enhancement. We leverage self-attention networks for feature extraction and a cross-attention mechanism for fusion to effectively integrate information from both modalities.

Supplementary.pdf

Updated, and Final Supplementary for ICIP 2025 (65)

Categories:: Image/Video Processing

34 Views

Cross-Domain Video Object Detection via Augmented-Shot FineTuning-0

Read more about Cross-Domain Video Object Detection via Augmented-Shot FineTuning-0
Log in to post comments

This document contains supplementary material for the ICIP PAPER.

ICIP2025_SuppM.pdf

ICIP2025_SuppM.pdf (95)

Categories:: Image/Video Processing

56 Views

Supplementary Material for LeMoRe

Read more about Supplementary Material for LeMoRe
Log in to post comments

Lightweight semantic segmentation is essential for many downstream vision tasks. Unfortunately, existing methods often struggle to balance efficiency and performance due to the complexity of feature modeling. Many of these existing approaches are constrained by rigid architectures and implicit representation learning, often characterized by parameter-heavy designs and a reliance on computationally intensive Vision Transformer-based frameworks.