ICASSP 2022

ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The ICASSP 2022 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit the website.

Poster: ICASSP2022-2228: Cramer-Rao Bound for the Time-Varying Poisson

Read more about Poster: ICASSP2022-2228: Cramer-Rao Bound for the Time-Varying Poisson
Log in to post comments

Point processes are finding increasing applications in neuroscience, genomics, and social media. But basic modelling properties are little studied. Here we consider a periodic time-varying Poisson model and develop the asymptotic Cramer-Rao bound. We also develop, for the first time, a maximum likelihood algorithm for parameter estimation.

crb-poster-v2.pdf

crb-poster-v2.pdf (347)

Categories:: Signal and System Modeling, Representation and Estimation

59 Views

Attributable Watermarking of Speech Generative Models

Read more about Attributable Watermarking of Speech Generative Models
Log in to post comments

Generative models are now capable of synthesizing images, speeches, and videos that are hardly distinguishable from authentic contents. Such capabilities cause concerns such as malicious impersonation and IP theft. This paper investigates a solution for model attribution, i.e., the classification of synthetic contents by their source models via watermarks embedded in the contents.

icassep ppt.pptx

icassep ppt.pptx (326)

Categories:: Watermarking and Steganography

109 Views

SEGNET-BASED DEEP REPRESENTATION LEARNING FOR DYSPHAGIA CLASSIFICATION

Read more about SEGNET-BASED DEEP REPRESENTATION LEARNING FOR DYSPHAGIA CLASSIFICATION
Log in to post comments

poster_5466.pdf

Dysphagia Classification - Poster (267)

Categories:: Biomedical signal processing

83 Views

MULTIMODAL EMOTION RECOGNITION WITH SURGICAL AND FABRIC MASKS

Read more about MULTIMODAL EMOTION RECOGNITION WITH SURGICAL AND FABRIC MASKS
Log in to post comments

In this study, we investigate how different types of masks affect automatic emotion classification in different channels of audio, visual, and multimodal. We train emotion classification models for each modality with the original data without mask and the re-generated data with mask respectively, and investigate how muffled speech and occluded facial expressions change the prediction of emotions.

ICASSP_poster.pdf

ICASSP_poster.pdf (441)

Categories:: Bio Imaging and Signal Processing

76 Views

ICASSP - Sequential MCMC methods for audio signal enhancement

Read more about ICASSP - Sequential MCMC methods for audio signal enhancement
1 comment
Log in to post comments

With the aim of addressing audio signal restoration as a sequential inference problem, we build upon Gabor regression to propose a state-space model for audio time series. Exploiting the structure of our model, we devise a sequential Markov chain Monte Carlo algorithm to explore the sequence of filtering distributions of the synthesis coefficients. The algorithm is then tested on a series of denoising examples.

ICASSP_poster.pdf

ICASSP poster 8 (418)

Categories:: Source Separation and Signal Enhancement

53 Views

A MULTITASK LEARNING FRAMEWORK FOR SPEAKER CHANGE DETECTION WITH CONTENT INFORMATION FROM UNSUPERVISED SPEECH DECOMPOSITION

2022_ICASSP_Poster_HangSu.pdf

2022_ICASSP_Poster_HangSu.pdf (301)

Categories:: Speaker Recognition and Characterization (SPE-SPKR)

46 Views

SELF-KNOWLEDGE DISTILLATION BASED SELF-SUPERVISED LEARNING FOR COVID-19 DETECTION FROM CHEST X-RAY IMAGES

1206-3.pdf

1206-3.pdf (325)

Categories:: Medical image analysis

149 Views

TRIBYOL: TRIPLET BYOL FOR SELF-SUPERVISED REPRESENTATION LEARNING

Read more about TRIBYOL: TRIPLET BYOL FOR SELF-SUPERVISED REPRESENTATION LEARNING
Log in to post comments

1917-1.pdf

1917-1.pdf (345)

Categories:: Neural network learning (MLR-NNLR)

163 Views

FRAUG: A FRAME RATE BASED DATA AUGMENTATION METHOD FOR DEPRESSION DETECTION FROM SPEECH SIGNALS

icassp_2381_fraug_slides.pptx

icassp_2381_fraug_slides.pptx (619)

Categories:: Speech Analysis (SPE-ANLS)

61 Views

Investigating the Potential of Auxiliary-Classifier GANs for Image Classification in Low Data Regimes

Generative Adversarial Networks (GANs) have shown promise in augmenting datasets and boosting convolutional neural networks' (CNN) performance on image classification tasks. But they introduce more hyperparameters to tune as well as the need for additional time and computational power to train supplementary to the CNN. In this work, we examine the potential for Auxiliary-Classifier GANs (AC-GANs) as a 'one-stop-shop' architecture for image classification, particularly in low data regimes.

Dravid_ICASSP_oral_2022_Slides.pdf

Dravid_ICASSP_oral_2022_Slides.pdf (436)

Categories:: Other
Other

50 Views

Pages