ICASSP 2023

IEEE ICASSP 2023 - IEEE International Conference on Acoustics, Speech and Signal Processing is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The ICASSP 2023 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit the website.

Model-Free Learning of Optimal Beamformers for Passive IRS-Assisted Sumrate Maximization

Although Intelligent Reflective Surfaces (IRSs) are a cost-effective technology promising high spectral efficiency in future wireless networks, obtaining optimal IRS beamformers is a challenging problem with several practical limitations. Assuming fully-passive, sensing- free IRS operation, we introduce a new data-driven Zeroth-order Stochastic Gradient Ascent (ZoSGA) algorithm for sumrate optimization in an IRS-aided downlink setting.

ZoSGA_Poster.pdf

Poster (239)

Categories:: Learning theory and algorithms (MLR-LEAR)
Communication Systems and Applications

48 Views

A study on the invariance in security whatever the dimension of images for the steganalysis by deep-learning

In this paper, we study the performance invariance of convolutional neural networks when confronted with variable image sizes in the context of a more ”wild steganalysis”. First, we propose two algorithms and definitions for a fine experimental protocol with datasets owning ”similar difficulty” and ”similar security”. The ”smart crop 2” algorithm allows the introduction of the Nearly Nested Image Datasets (NNID) that ensure ”a similar difficulty” between various datasets, and a dichotomous research algorithm allows a ”similar security”.

ICASSP2023_PLANOLLES_CHAUMONT_COMBY_Security_Invariance[1].pdf

The ICASSP 2023 article (199)

Categories:: Watermarking and Steganography

13 Views

Robust FIR Filters for Wireless Low-frequency Sound Zones

Read more about Robust FIR Filters for Wireless Low-frequency Sound Zones
Log in to post comments

Low frequency personal sound zones can be created by controlling the sound pressure in separate spatially confined regions. The performance of a sound zone system using wireless communication may be degraded due to potential packet losses. In this paper, we propose robust FIR filters for low-frequency sound zone system by incorporating information about the expected packet losses into the design.

Robust FIR Filters for Wireless Low-frequency Sound Zones.pdf

Robust FIR Filters for Wireless Low-frequency Sound Zones.pdf (191)

Zhou ICASSP_Slides.pdf

ICASSP 2023 Slides by Mo Zhou (180)

Zhou Poster.pdf

ICASSP 2023 Poster by Mo Zhou (203)

Categories:: Room Acoustics and Acoustic System Modeling

23 Views

EXPLOITING PRNU AND LINEAR PATTERNS IN FORENSIC CAMERA ATTRIBUTION UNDER COMPLEX LENS DISTORTION CORRECTION

More complex and ever more common lens distortion correction post-processing is seriously hampering state-of-the-art camera attribution techniques. In this paper, we show that

ICASSP_OFF.pdf

ICASSP_OFF.pdf (134)

Categories:: Multimedia Forensics

16 Views

PHONATION MODE DETECTION IN SINGING: A SINGER ADAPTED MODEL

Read more about PHONATION MODE DETECTION IN SINGING: A SINGER ADAPTED MODEL
Log in to post comments

Phonation modes play a vital role in voice quality evaluation and vocal health diagnosis. Existing studies on phonation modes cover feature analysis and classification of vowels, which does not apply to real-life scenarios. In this paper, we define the phonation mode detection (PMD) problem, which entails the prediction of phonation mode labels as well as their onset and offset timestamps.

ICASSP2023_poster.pdf

ICASSP2023_poster.pdf (227)

Categories:: Music Signal Processing

49 Views

Spatial Graph Signal Interpolation with an Application for Merging BCI Datasets with Various Dimensionalities

BCI Motor Imagery datasets usually are small and have different electrodes setups. When training a Deep Neural Network, one may want to capitalize on all these datasets to increase the amount of data available and hence obtain good generalization results.

6456.pdf

6456.pdf (182)

Categories:: Biomedical signal processing

27 Views

Spatial Active Noise Control Method Based on Sound Field Interpolation From Reference Microphone Signals

icassp2023_arikawa.pdf

icassp2023_arikawa.pdf (148)

Categories:: Spatial and Multichannel Audio

23 Views

Amplitude Matching for Multizone Sound Field Control

Read more about Amplitude Matching for Multizone Sound Field Control
Log in to post comments

icassp2023_abe.pdf

icassp2023_abe.pdf (153)

Categories:: Spatial and Multichannel Audio

20 Views

ASSD: Synthetic Speech Detection in the AAC Compressed Domain

Read more about ASSD: Synthetic Speech Detection in the AAC Compressed Domain
Log in to post comments

Synthetic human speech signals have become very easy to generate given modern text-to-speech methods. When these signals are shared on social media they are often compressed using the Advanced Audio Coding (AAC) standard. Our goal is to study if a small set of coding metadata contained in the AAC compressed bit stream is sufficient to detect synthetic speech. This would avoid decompressing of the speech signals before analysis. We call our proposed method AAC Synthetic Speech Detection (ASSD).

icassp_assd_slides_v03.pdf

icassp_assd_slides_v03.pdf (165)

Categories:: Multimedia Forensics
Speech Analysis (SPE-ANLS)

32 Views

F0 ESTIMATION FROM TELEPHONE SPEECH USING DEEP FEATURE LOSS

Read more about F0 ESTIMATION FROM TELEPHONE SPEECH USING DEEP FEATURE LOSS
1 comment
Log in to post comments

Accurate pitch estimation in speech signal plays a vital role in several applications. Robust pitch estimation in telephone speech is still a challenge due to the narrow bandwidth of the signal. Electroglottograph (EGG) signal is a reliable means for pitch estimation, however, it’s not practically possible to

4051_poster_and_paper_preprint.zip

Poster and paper preprint (160)

Categories:: Other

17 Views

Pages