Other

Recurrent 3-D Multi-level Visual Transformer for Joint Classification of Heterogeneous 2-D and 3-D Radiographic Data

Recent advancements in artificial intelligence algorithms for medical imaging show significant potential in automating the detection of lung infections from chest radiograph scans. However, current approaches often focus solely on either 2-D or 3-D scans, failing to leverage the combined advantages of both modalities. Moreover, conventional slice-based methods place a manual burden on radiologists for slice selection.

Paper_Multimodal_CT_X_Ray_in_ICIP_2024 Final Version.pdf

Paper_Multimodal_CT_X_Ray_in_ICIP_2024 Final Version.pdf (111)

Categories:: Other

24 Views

SEMI-SUPERVISED GRAPHICAL DEEP DICTIONARY LEARNING FOR HYPERSPECTRAL IMAGE CLASSIFICATION FROM LIMITED SAMPLES

In this work, we propose a semi-supervised deep feature generation network that accounts for local similarities. It is based on the deep dictionary learning (DDL) framework. The formulation accounts for two unique aspects of hyperspectral classification. First, the fact that the total number of pixels / samples to be labeled is constant; this allows for a semi-supervised formulation allowing only a few pixels / samples to be labeled as training data. Second, the samples / pixels are spatially correlated; this leads to a graph regularization formulation.

ICIP_poster_2.pdf

ICIP_poster_2.pdf (106)

Categories:: Other

12 Views

FOLLOWING THE EMBEDDING: IDENTIFYING TRANSITION PHENOMENA IN WAV2VEC 2.0 REPRESENTATIONS OF SPEECH AUDIO

Although transformer-based models have improved the state-of-the-art in speech recognition, it is still not well understood what information from the speech signal these models encode in their latent representations. This study investigates the potential of using labelled data (TIMIT) to probe wav2vec 2.0 embeddings for insights into the encoding and visualisation of speech signal information at phone boundaries. Our experiment involves training probing models to detect phone-specific articulatory features in the hidden layers based on IPA classifications.

ICASSP2024_poster_follwing_the_embedding.pdf

ICASSP2024_poster_follwing_the_embedding.pdf (283)

Categories:: Speech Analysis (SPE-ANLS)
General Topics in Speech Recognition (SPE-GASR)
Other

81 Views

ENHANCING NOISY LABEL LEARNING VIA UNSUPERVISED CONTRASTIVE LOSS WITH LABEL CORRECTION BASED ON PRIOR KNOWLEDGE

To alleviate the negative impacts of noisy labels, most of the noisy label learning (NLL) methods dynamically divide the training data into two types, “clean samples” and “noisy samples”, in the training process. However, the conventional selection of clean samples heavily depends on the features learned in the early stages of training, making it difficult to guarantee the cleanliness of the selected samples in scenarios where the noise ratio is high.

ICASSP2024_poster_kashiwagi_final.pdf

ICASSP2024_poster (273)

Categories:: Other

78 Views

A_Binary_BP_Decoding_using_Posterior_Adjustment_for_Quantum_LDPC_Codes

Read more about A_Binary_BP_Decoding_using_Posterior_Adjustment_for_Quantum_LDPC_Codes
Log in to post comments

Although BP decoders are efficient and provide significant
performance for classical low-density parity-check (LDPC)
codes, they will suffer a degradation in performance for quantum
LDPC (QLDPC) codes due to the limitations in the quantum
field. In this paper, we propose a posterior adjustment of
either a single qubit or multiple qubits within binary belief
propagation (BP). The adjustment process changes the posterior
likelihood ratio for one or multiple qubits according to the

2024icassp.pptx

2024icassp.pptx (209)

Categories:: Other

79 Views

Inference of genetic effects via Approximate Message Passing

Read more about Inference of genetic effects via Approximate Message Passing
Log in to post comments

Efficient utilization of large-scale biobank data is crucial for inferring the genetic basis of disease and predicting health outcomes from the DNA. Yet we lack efficient, accurate methods that scale to data where electronic health records are linked to whole genome sequence information. To address this issue, our paper develops a new algorithmic paradigm based on Approximate Message Passing (AMP), which is specifically tailored for genomic prediction and association testing.

ICASSP24_presentation.pdf

ICASSP24_presentation.pdf (147)

Categories:: Other

14 Views

EFFICIENT VIDEO AND AUDIO PROCESSING WITH LOIHI 2

Read more about EFFICIENT VIDEO AND AUDIO PROCESSING WITH LOIHI 2
Log in to post comments

Loihi 2 is a fully event-based neuromorphic processor that supports a wide range of synaptic connectivity configurations and temporal neuron dynamics. Loihi 2's temporal and event-based paradigm is naturally well-suited to processing data from an event-based sensor, such as a Dynamic Vision Sensor (DVS) or a Silicon Cochlea. However, this begs the question: How general are signal processing efficiency gains on Loihi 2 versus conventional computer architectures?

ICASSP Presentation.pptx

ICASSP Presentation.pptx (196)

Categories:: Other

65 Views

Slides for Renyi Divergences Learning for explainable classification of SAR Image Pairs

We consider the problem of classifying a pair of Synthetic Aperture Radar (SAR) images by proposing an explainable and frugal algorithm that integrates a set of divergences. The approach relies on a statistical framework that takes standard probability distributions into account for modelling SAR data. Then, by learning a combination of parameterized Renyi divergences and their parameters from the data, we are able to classify the pair of images with fewer parameters than regular machine learning approaches while also allowing an interpretation of the results related to the priors used.

main.pdf

Slides presentation for paper RENYI DIVERGENCES LEARNING FOR EXPLAINABLE CLASSIFICATION OF SAR IMAGE PAIRS (195)

Categories:: Information-theoretic learning (MLR-INFO)
Other

24 Views

GENERATING PERSONA-AWARE EMPATHETIC RESPONSES WITH RETRIEVAL-AUGMENTED PROMPT LEARNING

Read more about GENERATING PERSONA-AWARE EMPATHETIC RESPONSES WITH RETRIEVAL-AUGMENTED PROMPT LEARNING
2 comments
Log in to post comments

Empathetic response generation requires perceiving and un- derstanding the user’s emotion to deliver a suitable response. However, existing models generally remain oblivious of an interlocutor’s persona, which has been shown to play a vital role in expressing appropriate empathy to different users. To address this problem, we propose a novel Transformer-based architecture that incorporates retrieval-augmented prompt learning to generate persona-aware empathetic responses.

ICASSPslide_llwang.pptx

ICASSPslide_llwang.pptx (182)

Categories:: Other

42 Views

Enhanced Axle-Based Vehicle Classification Using Angle-Based Micro-Doppler Signature

Read more about Enhanced Axle-Based Vehicle Classification Using Angle-Based Micro-Doppler Signature
Log in to post comments

This study introduces an angle-based micro-Doppler analysis using Frequency Modulated Continuous Wave (FMCW) radar tailored for axle-based vehicle classification. The novel approach exploits the signal angle of arrival to separate incoming signals and noise from distinct targets. This is done by analysing the phase difference of a dual antenna radar system based on the time-frequency representation of the radar beat signal. Vehicles driving side by side can now be discriminated. Multipath signals and clutter are more easily identified and filtered out.

Poster_ICASSP_A0_PDF.pdf

Poster_ICASSP_A0_PDF.pdf (181)

Categories:: Other

13 Views

Pages