Pattern recognition and classification (MLR-PATT)

ADAPTIVE CONFIDENCE MULTI-VIEW HASHING FOR MULTIMEDIA RETRIEVAL

Read more about ADAPTIVE CONFIDENCE MULTI-VIEW HASHING FOR MULTIMEDIA RETRIEVAL
Log in to post comments

The multi-view hash method converts heterogeneous data from multiple views into binary hash codes, which is one of the critical technologies in multimedia retrieval. However, the current methods mainly explore the complementarity among multiple views while lacking confidence in learning and fusion. Moreover, in practical application scenarios, the single-view data contains redundant noise. To conduct confidence learning and eliminate unnecessary noise, we propose a novel Adaptive Confidence Multi-View Hashing (ACMVH) method.

ICASSP--ACMVH.pdf

ICASSP--ACMVH.pdf (168)

Categories:: Pattern recognition and classification (MLR-PATT)

11 Views

PROBMCL: SIMPLE PROBABILISTIC CONTRASTIVE LEARNING FOR MULTI-LABEL VISUAL CLASSIFICATION

Multi-label image classification presents a challenging task in many domains, including computer vision and medical imaging. Recent advancements have introduced graph-based and transformer-based methods to improve performance and capture label dependencies. However, these methods often include complex modules that entail heavy computation and lack interpretability. In this paper, we propose Probabilistic Multi-label Contrastive Learning (ProbMCL), a novel framework to address these challenges in multi-label image classification tasks.

ProbMCL_ICASSP24_Poster.pdf

ProbMCL_ICASSP24_Poster.pdf (188)

Categories:: Image/Video Processing
Pattern recognition and classification (MLR-PATT)

24 Views

A CONTRARIO PARADIGM FOR YOLO-BASED INFRARED SMALL TARGET DETECTION

Read more about A CONTRARIO PARADIGM FOR YOLO-BASED INFRARED SMALL TARGET DETECTION
Log in to post comments

Detecting small to tiny targets in infrared images is a challenging task in computer vision, especially when it comes to differentiating these targets from noisy or textured backgrounds. Traditional object detection methods such as YOLO

Poster_Ciocarlan.pdf

Poster_Ciocarlan.pdf (177)

Categories:: Pattern recognition and classification (MLR-PATT)

12 Views

2D Human Pose Estimation Calibration and Keypoint Visibility Classification

Read more about 2D Human Pose Estimation Calibration and Keypoint Visibility Classification
Log in to post comments

The confidence scores of 2D pose estimation are widely utilized in various fields, including multi-view 3D human pose estimation, skeleton-based human tracking, human action recognition, human re-identification, etc. Despite widespread use, confidence scores from 2D pose estimation methods are unreliable in indicating the accuracy of estimation results, particularly in occlusion situations, i.e., keypoints with high confidence scores may have low accuracy and vice versa. To address this issue, we propose a new 2D human pose estimation calibration method in this paper.

icassp2024_poster.pdf

icassp2024_poster.pdf (207)

Categories:: Pattern recognition and classification (MLR-PATT)

36 Views

SEMANTIC DISTILLATION AND STRUCTURAL ALIGNMENT NETWORK FOR FAKE NEWS DETECTION

Read more about SEMANTIC DISTILLATION AND STRUCTURAL ALIGNMENT NETWORK FOR FAKE NEWS DETECTION
Log in to post comments

In recent years, the rapid proliferation of multi-modal fake news has posed potential harm across various sectors of society, making the detection of multi-modal fake news crucial. Most existing methods can not effectively reduce the redundant information and preserve both semantic and structural information. To address these problems, this paper proposes a semantic distillation and structural alignment (SDSA) network. We design an semantic distillation module for modality-specific features to preserve task-relevant semantic information and eliminate redundant information.

Semantic distillation and structural aligement network.pdf

Semantic distillation and structural aligement network.pdf (185)

Categories:: Pattern recognition and classification (MLR-PATT)

39 Views

ESA: Expert-and-Samples-Aware Incremental Learning under Longtail Distribution

Read more about ESA: Expert-and-Samples-Aware Incremental Learning under Longtail Distribution
Log in to post comments

Most works in class incremental learning (CIL) assume disjoint sets of classes as tasks. Although a few works deal with overlapped sets of classes, they either assume a balanced data distribution or assume a mild imbalanced distribution. Instead, in this paper, we explore one of the understudied real-world CIL settings where (1) different tasks can share some classes but with new data samples, and (2) the training data of each task follows a long-tail distribution. We call this setting CIL-LT.

ICASSP 2024 poster.pdf

ICASSP 2024 poster.pdf (192)

Categories:: Pattern recognition and classification (MLR-PATT)

49 Views

SSL-Net: A Synergistic Spectral and Learning-based Network for Efficient Bird Sound Classification

Efficient and accurate bird sound classification is of importance for ecology, habitat protection and scientific research, as it plays a central role in monitoring the distribution and abundance of species. However, prevailing methods typically demand extensively labeled audio datasets and have highly customized frameworks, imposing substantial computational and annotation loads. In this study, we present an efficient and general framework called SSL-Net, which combines spectral and learned features to identify different bird sounds.

ICASSP2024-SSLNET-Yiyuan 1.pdf

ICASSP2024-SSLNET-Yiyuan 1.pdf (147)

Categories:: Pattern recognition and classification (MLR-PATT)

7 Views

SSL-Net: A Synergistic Spectral and Learning-based Network for Efficient Bird Sound Classification

ICASSP_2024_submit_version.pdf

ICASSP_2024_submit_version.pdf (143)

Categories:: Pattern recognition and classification (MLR-PATT)

10 Views

Neural Network Training Strategy to Enhance Anomaly Detection Performance: A Perspective on Reconstruction Loss Amplification

Unsupervised anomaly detection (UAD) is a widely adopted approach in industry due to rare anomaly occurrences and data imbalance. A desirable characteristic of an UAD model is contained generalization ability which excels in the reconstruction of seen normal patterns but struggles with unseen anomalies. Recent studies have pursued to contain the generalization capability of their UAD models in reconstruction from different perspectives, such as design of neural network (NN) structure and training strategy.

ICASSP24_MLSP-P32.9_LAMP_YeongHyeonPark.pdf

ICASSP24_MLSP-P32.9_LAMP_YeongHyeonPark.pdf (226)

Categories:: Image/Video Processing
Pattern recognition and classification (MLR-PATT)
Pattern recognition and classification (MLR-PATT)

283 Views

SPASE: SPAtial Saliency Explanation for time series models

Read more about SPASE: SPAtial Saliency Explanation for time series models
1 comment
Log in to post comments

We have seen recent advances in the fields of Machine Learning (ML), Deep Learning (DL), and Artificial intelligence (AI) that the models are becoming increasingly complex and large in terms of architecture and parameter size. These complex ML/DL models have beaten the state of the art in most fields of computer science like computer vision, NLP, tabular data prediction and time series forecasting, etc. With the increase in models’ performance, model explainability and interpretability has become essential to explain/justify model outcome, especially for business use cases.

ICASSP_SPASE.pdf

Paper pre-print (178)

Categories:: Pattern recognition and classification (MLR-PATT)

38 Views

Pattern recognition and classification (MLR-PATT)

Pages