Other

UTAL-GNN: Unsupervised Temporal Action Localization using Graph Neural Networks

Read more about UTAL-GNN: Unsupervised Temporal Action Localization using Graph Neural Networks
Log in to post comments

Fine-grained action localization in untrimmed sports videos presents a significant challenge due to rapid and subtle motion transitions over short durations. Existing supervised and weakly supervised solutions often rely on extensive annotated datasets and high-capacity models, making them computationally intensive and less adaptable to real-world scenarios. In this work, we introduce a lightweight and unsupervised skeleton-based action localization pipeline that leverages spatio-temporal graph neural representations.

ICIP(SW)_SUPPLEMENTARY.pdf

Additional Ablation Study and Performance Evaluation Results (9)

Categories:: Other

5 Views

GIVE: A Multi-Agent Framework for Generating Immersive Multi-Modal Virtual Environments for 3D Games - Supplementary Material

In this work, we present a novel multi-agent framework for generating immersive 3D virtual environments from high-level semantic inputs, powered by large language and vision-language models (LLMs/VLMs). Unlike prior work that focuses primarily on visual output, data-intensive training pipelines, and code generation, our system coordinates a team of specialized agents, each assigned a role such as manager, planner, or expert in visual, audio, or spatial domains, to decompose and execute environment construction tasks within a game engine.

Generative_Immersive_Virtual_Environment_Supplementary_Material.pdf

Supplementary Material (10)

Categories:: Other

55 Views

(Appendix) Rethinking the Backbone in Class Imbalanced Federated Source Free Domain Adaptation: The Utility of Vision Foundation Models

Appendix of our paper: "Rethinking the Backbone in Class Imbalanced Federated Source Free Domain Adaptation: The Utility of Vision Foundation Models" accepted at IEEE ICIP 2025 workshop: Edge Intelligence: Smart, Efficient, and Scalable Solutions for IoT, Wearables, and Embedded Devices (SEEDS)

appendix.pdf

v1_appendix_submitted (94)

appendix.pdf

v2_appendix_revised (3)

Categories:: Other

62 Views

ICIP2025 supplementary material camera ready

Read more about ICIP2025 supplementary material camera ready
Log in to post comments

We address the challenges of local feature matching under large scale and rotation changes by focusing on keypoint positions.
First, we propose a novel module called similarity normalization (SN).
This module normalizes keypoint positions to remove translation, rotation and scale differences between image pairs.
By performing positional encoding on these normalized positions, a network incorporating with SN can effectively avoid encoding largely different positions into descriptors from the two images.

supp_main_cameraready.pdf

supp_main_cameraready.pdf (124)

Categories:: Other

17 Views

ICIP 2025 Supplementary

Read more about ICIP 2025 Supplementary
Log in to post comments

This supplementary material accompanies our paper titled "Texturing Endoscopic 3D Stomach via Neural Radiance Field under Uneven Lighting."

ICIP_2025_supplementary.pdf

ICIP_2025_supplementary.pdf (126)

Categories:: Other

33 Views

(Appendix) MultiMAE Meets Earth Observation: Pre-training Multi-modal Multi-task Masked Autoencoders for Earth Observation Tasks

MULTIMAE MEETS EARTH OBSERVATION: PRE-TRAINING MULTI-MODAL MULTI-TASK MASKED AUTOENCODERS FOR EARTH OBSERVATION TASKS (APPENDIX)

appendix_multimae_meets_eo.pdf

Appendix (127)

Categories:: Other

29 Views

Supplmental - o1-mini prompt

Read more about Supplmental - o1-mini prompt
Log in to post comments

O1-mini prompt to expose the seven components of agricultural disease management evaluation framework

o1-mini prompt.pdf

o1-mini prompt.pdf (124)

Categories:: Other

17 Views

Shuffle PatchMix Augmentation with Confidence-Margin Weighted Pseudo-Labels for Enhanced Source-Free Domain Adaptation

Supplementary Material.

Supplementary_Material_ICIP2025.pdf

Supplementary Material for ICIP 2025 submission (116)

Categories:: Other

95 Views

Supplementary Material ICCE-TW 2025

Read more about Supplementary Material ICCE-TW 2025
Log in to post comments

This supplementary material presents detailed transformer model architectures, training parameters, and comprehensive evaluation metrics to complement our comparison of RNN and transformer models for Indonesian news classification. Our analysis provides deeper insights into why transformer models outperform RNN approaches despite their larger parameter counts.