Other

CO-OCCURRENCE GRAPH-ENHANCED HIERARCHICAL PREDICTION OF ICD CODES

Read more about CO-OCCURRENCE GRAPH-ENHANCED HIERARCHICAL PREDICTION OF ICD CODES
Log in to post comments

Recent healthcare applications of natural language processing involve multi-label classification of health records using the International Classification of Diseases (ICD). While prior research highlights intricate text models and explores external knowledge like hierarchical ICD ontology, fewer studies integrate code relationships from whole datasets to enhance ICD coding accuracy. This study presents a modular approach, sequentially combining graph-based integration of ICD code co-occurrence with a hard-coded hierarchical enriched text representation drawn from the ICD ontology.

Poster_for_ICASSP_2024__CO_OCCURRENCE_GRAPH_ENHANCED_HIERARCHICAL_PREDICTION_OF_ICD_CODES_final.pdf

Poster_for_ICASSP_2024__CO_OCCURRENCE_GRAPH_ENHANCED_HIERARCHICAL_PREDICTION_OF_ICD_CODES_final.pdf (209)

Categories:: Other

17 Views

Enabling Device Control Planning Capabilities of Small Language Model

Read more about Enabling Device Control Planning Capabilities of Small Language Model
Log in to post comments

Smart home device control is a difficult task if the instruction is abstract and the planner needs to adjust dynamic home configurations. With the increasing capability of Large Language Model (LLM), they have become the customary model for zero-shot planning tasks similar to smart home device control. Although cloud supported large language models can seamlessly do device control tasks, on-device small language models show limited capabilities. In this work, we show how we can leverage large language models to enable small language models for device control task.

icassp_sudipta.pptx

icassp_sudipta.pptx (192)

Categories:: Other

18 Views

dklement_dvbx_slides

Read more about dklement_dvbx_slides
Log in to post comments

Bayesian HMM clustering of x-vector sequences (VBx) has become a widely adopted diarization baseline model in publications and challenges. It uses an HMM to model speaker turns, a generatively trained probabilistic linear discriminant analysis (PLDA) for speaker distribution modeling, and Bayesian inference to estimate the assignment of x-vectors to speakers. This paper presents a new framework for updating the VBx parameters using discriminative training, which directly optimizes a predefined loss.

DVBx-slides_fin.pdf

DVBx-slides_fin.pdf (282)

Categories:: Other

23 Views

Quantum Federated Learning with Quantum Networks PPT

Read more about Quantum Federated Learning with Quantum Networks PPT
Log in to post comments

A major concern of deep learning models is the large amount of data that is required to build and train them, much of which is reliant on sensitive and personally identifiable information that is vulnerable to access by third parties. Ideas of using the quantum internet to address this issue have been previously proposed, which would enable fast and completely secure online communications. Previous work has yielded a hybrid quantum-classical transfer learning scheme for classical data and communication with a hub-spoke topology.

Quantum Federated Learning with Quantum Networks ICASSP 2024.pptx

Quantum Federated Learning with Quantum Networks ICASSP 2024.pptx (184)

Categories:: Other

86 Views

Object Trajectory Estimation with Multi-Band Wi-Fi Neural Dynamic Fusion

Read more about Object Trajectory Estimation with Multi-Band Wi-Fi Neural Dynamic Fusion
Log in to post comments

In contrast to existing multi-band Wi-Fi fusion in a frame-to-frame basis for simple classification, this paper considers asynchronous sequence-to-sequence fusion between sub-7GHz channel state information (CSI) and 60GHz beam SNR for more challenging downstream tasks such as continuous regression.

icassp_skato_release.pptx

icassp_skato_release.pptx (120)

Categories:: Other

46 Views

Poster for ICASSP 2024 paper "Turn-taking and Backchannel Prediction with Acoustic and Large Language Model Fusion"

We propose an approach for continuous prediction of turn-taking and backchanneling locations in spoken dialogue by fusing a neural acoustic model with a large language model (LLM). Experiments on the Switchboard human-human conversation dataset demonstrate that our approach consistently outperforms the baseline models with single modality. We also develop a novel multi-task instruction fine-tuning strategy to further benefit from LLM-encoded knowledge for understanding the tasks and conversational contexts, leading to additional improvements.

Turn-taking LLM ICASSP2024 Poster_v2 (1).pdf

Turn-taking LLM ICASSP2024 Poster_v2 (1).pdf (154)

Categories:: Other

58 Views

Poster for ICASSP 2024 paper "Hot-Fixing Wake Work Recognition for End-to-End ASR via Neural Model Reprogramming"

This paper proposes two novel variants of neural reprogramming to enhance wake word recognition in streaming end-to-end ASR models without updating model weights. The first, "trigger-frame reprogramming", prepends the input speech feature sequence with the learned trigger-frames of the target wake word to adjust ASR model’s hidden states for improved wake word recognition. The second, "predictor-state initialization", trains only the initial state vectors (cell and hidden states) of the LSTMs in the prediction network.

WW_HF_w_NP_ICASSP2024 Poster.pdf

WW_HF_w_NP_ICASSP2024 Poster.pdf (133)

Categories:: Other

18 Views

Improving Medical Dialogue Generation with Abstract Meaning Representations

Read more about Improving Medical Dialogue Generation with Abstract Meaning Representations
Log in to post comments

Medical Dialogue Generation plays a critical role in telemedicine by facilitating the dissemination of medical expertise to patients. Existing studies focus on incorporating textual representations, which have limited their ability to represent text semantics, such as ignoring important medical entities.

Improving__Medical_Dialogue_Generation_with_Abstract_Meaning_Representations__ICASSP_2024_.pdf

paper (234)

oral_icassp.pptx

slides (222)

Categories:: Other
Knowledge and Data Engineering
Spoken Language Processing

14 Views

Privacy Preserving Federated Learning from Multi-input Functional Proxy Re-encryption

Read more about Privacy Preserving Federated Learning from Multi-input Functional Proxy Re-encryption
Log in to post comments

Federated learning (FL) allows different participants to collaborate on model training without transmitting raw data, thereby protecting user data privacy. However, FL faces a series of security and privacy issues (e.g. the leakage of raw data from publicly shared parameters). Several privacy protection technologies, such as homomorphic encryption, differential privacy and functional encryption, are introduced for privacy enhancement in FL. Among them, the FL frameworks based on functional encryption better balance security and performance, thus receiving increasing attention.

ICASSP24_Poster___MI_FPRE (1).pdf

ICASSP__24___Poster___MI_FPRE (1).pdf (192)

Categories:: Other

29 Views

Poster for IMAGE ATTRIBUTION BY GENERATING IMAGES

Read more about Poster for IMAGE ATTRIBUTION BY GENERATING IMAGES
Log in to post comments

We introduce GPNN-CAM, a novel method for CNN explanation, that bridges two distinct areas of computer vision:
Image Attribution, which aims to explain a predictor by highlighting image regions it finds important, and Single
Image Generation (SIG), that focuses on learning how to generate variations of a single sample. GPNN-CAM leverages samples generated by Generative

ICASSP-poster.pdf

ICASSP-poster.pdf (137)

Categories:: Other

19 Views

Pages