Image, Video, and Multidimensional Signal Processing

Appendix for "Perceptual Classifiers for Detecting Generative Images"

Read more about Appendix for "Perceptual Classifiers for Detecting Generative Images"
Log in to post comments

This 2-page document provides supplementary material for the paper titled "Perceptual Classifiers for Detecting Generative Images". It provides details on the datasets used and their composition. We also include the real and fake detection accuracies for each class to help readers better understand the strengths and drawbacks of the proposed approach. Finally, we provide t-SNE visualizations to understand the effectiveness of the chosen feature extractors.

appendix.pdf

appendix.pdf (71)

Categories:: Image, Video, and Multidimensional Signal Processing

35 Views

Supplement Material for SCIGS: 3D GAUSSIANS SPLATTING FROM A SNAPSHOT COMPRESSIVE IMAGE

In this supplementary material for "SCIGS: 3D GAUSSIANS SPLATTING FROM A SNAPSHOT COMPRESSIVE IMAGE", the results of comparative experiments on all the datasets from dynamic scenes and static scenes are shown. The experiment compares our SCIGS against current state-of-the-art SCI decoding methods and state-of-the-art SCI image-based reconstruction method. An additional experiment is conducted to assess the impact of various mask overlapping rates during the SCI image modulated.

SCIGS Supplementary Material.pdf

SCIGS Supplementary Material.pdf (60)

Categories:: Image, Video, and Multidimensional Signal Processing

25 Views

Supplementary Materials for "WEAKLY SUPERVISED DEFECT LOCALIZATION WITH RESIDUAL FEATURES"

ICIP2025_1776_Supple.pdf

ICIP2025_1776_Supple.pdf (75)

Categories:: Image, Video, and Multidimensional Signal Processing

83 Views

Supplementary Material for DARTs: Deformable Animation Ready Templates for Clothing Humans

Accurate 3D modeling of humans and high-fidelity garments is crucial in computer vision and graphics, impacting gaming, virtual, and augmented reality applications.

Darts_Supplementary.pdf

Darts_Supplementary.pdf (79)

Categories:: Image, Video, and Multidimensional Signal Processing

30 Views

HSBS: COMPREHENSIVE BOOSTING OF FACIAL EXPRESSION RECOGNITION VIA HIERARCHICAL SEMANTIC AND BATCH-WISE SIMILARITY

Facial Expression Recognition (FER) has achieved significant success in recent years due to the rise of deep learning. Meanwhile, latent semantic information is crucial for recognizing facial expressions with subtle differences. Inspired by inconsistencies in learning intensity across different layers of deep learning networks — where shallow-layer features lack generalization and task relevance compared to deep-layer features — we propose a novel Hierarchical Semantic Transfer (HST) method.

HSBS.zip

HSBS.zip (79)

Categories:: Image, Video, and Multidimensional Signal Processing

44 Views

HSBS: COMPREHENSIVE BOOSTING OF FACIAL EXPRESSION RECOGNITION VIA HIERARCHICAL SEMANTIC AND BATCH-WISE SIMILARITY

HSBS.zip

HSBS.zip (113)

Categories:: Image, Video, and Multidimensional Signal Processing

2 Views

Test submission

Read more about Test submission
Log in to post comments

test

test.pdf

test.pdf (716)

Categories:: Image, Video, and Multidimensional Signal Processing

26 Views

Retrieval-Augmented Natural Language Reasoning for Explainable Visual Question Answering

Visual Question Answering with Natural Language Explanation (VQA-NLE) task is challenging due to its high demand for reasoning-based inference. Recent VQA-NLE studies focus on enhancing model networks to amplify the model’s reasoning capability but this approach is resource consuming and unstable. In this work, we introduce a new VQA-NLE model, ReRe (Retrieval-augmented natural language Reasoning), using leverage retrieval information from the memory to aid in generating accurate answers and persuasive explanations without relying on complex networks and extra datasets.

ReRE_ICIP_Workshop.pptx

ReRE_ICIP_Workshop.pptx (82)

Categories:: Image, Video, and Multidimensional Signal Processing

4 Views

FEATURE-CONSTRAINED AND ATTENTION-CONDITIONED DISTILLATION LEARNING FOR VISUAL ANOMALY DETECTION

Visual anomaly detection in computer vision is an essential one-class classification and segmentation problem. The student-teacher (S-T) approach has proven effective in addressing this challenge. However, previous studies based on S-T underutilize the feature representations learned by the teacher network, which restricts anomaly detection performance.

ICASSP2024_FCACDL.pptx

ICASSP2024_FCACDL.pptx (202)

Categories:: Image, Video, and Multidimensional Signal Processing

46 Views

MULTI-MODALITY ACTION RECOGNITION BASED ON DUAL FEATURE SHIFT IN VEHICLE CABIN MONITORING

Driver Action Recognition (DAR) is crucial in vehicle cabin monitoring systems. In real-world applications, it is common for vehicle cabins to be equipped with cameras featuring different modalities. However, multi-modality fusion strategies for the DAR task within car cabins have rarely been studied. In this paper, we propose a novel yet efficient multi-modality driver action recognition method based on dual feature shift, named DFS. DFS first integrates complementary features across modalities by performing modality feature interaction.

ICASSP_Poster.pdf

ICASSP_Poster.pdf (376)

Categories:: Image, Video, and Multidimensional Signal Processing

44 Views

Image, Video, and Multidimensional Signal Processing

Pages