IEEE ICASSP 2023 - IEEE International Conference on Acoustics, Speech and Signal Processing is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The ICASSP 2023 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit the website.
- Read more about Model-Free Learning of Optimal Beamformers for Passive IRS-Assisted Sumrate Maximization
- Log in to post comments
Although Intelligent Reflective Surfaces (IRSs) are a cost-effective technology promising high spectral efficiency in future wireless networks, obtaining optimal IRS beamformers is a challenging problem with several practical limitations. Assuming fully-passive, sensing- free IRS operation, we introduce a new data-driven Zeroth-order Stochastic Gradient Ascent (ZoSGA) algorithm for sumrate optimization in an IRS-aided downlink setting.
ZoSGA_Poster.pdf
- Categories:
- Read more about A study on the invariance in security whatever the dimension of images for the steganalysis by deep-learning
- Log in to post comments
In this paper, we study the performance invariance of convolutional neural networks when confronted with variable image sizes in the context of a more ”wild steganalysis”. First, we propose two algorithms and definitions for a fine experimental protocol with datasets owning ”similar difficulty” and ”similar security”. The ”smart crop 2” algorithm allows the introduction of the Nearly Nested Image Datasets (NNID) that ensure ”a similar difficulty” between various datasets, and a dichotomous research algorithm allows a ”similar security”.
- Categories:
Low frequency personal sound zones can be created by controlling the sound pressure in separate spatially confined regions. The performance of a sound zone system using wireless communication may be degraded due to potential packet losses. In this paper, we propose robust FIR filters for low-frequency sound zone system by incorporating information about the expected packet losses into the design.
- Categories:
- Read more about EXPLOITING PRNU AND LINEAR PATTERNS IN FORENSIC CAMERA ATTRIBUTION UNDER COMPLEX LENS DISTORTION CORRECTION
- Log in to post comments
More complex and ever more common lens distortion correction post-processing is seriously hampering state-of-the-art camera attribution techniques. In this paper, we show that
- Categories:
Phonation modes play a vital role in voice quality evaluation and vocal health diagnosis. Existing studies on phonation modes cover feature analysis and classification of vowels, which does not apply to real-life scenarios. In this paper, we define the phonation mode detection (PMD) problem, which entails the prediction of phonation mode labels as well as their onset and offset timestamps.
- Categories:
- Read more about Spatial Graph Signal Interpolation with an Application for Merging BCI Datasets with Various Dimensionalities
- Log in to post comments
BCI Motor Imagery datasets usually are small and have different electrodes setups. When training a Deep Neural Network, one may want to capitalize on all these datasets to increase the amount of data available and hence obtain good generalization results.
- Categories:
- Read more about Spatial Active Noise Control Method Based on Sound Field Interpolation From Reference Microphone Signals
- Log in to post comments
- Categories:
- Categories:
- Read more about ASSD: Synthetic Speech Detection in the AAC Compressed Domain
- Log in to post comments
Synthetic human speech signals have become very easy to generate given modern text-to-speech methods. When these signals are shared on social media they are often compressed using the Advanced Audio Coding (AAC) standard. Our goal is to study if a small set of coding metadata contained in the AAC compressed bit stream is sufficient to detect synthetic speech. This would avoid decompressing of the speech signals before analysis. We call our proposed method AAC Synthetic Speech Detection (ASSD).
- Categories:
- Read more about F0 ESTIMATION FROM TELEPHONE SPEECH USING DEEP FEATURE LOSS
- 1 comment
- Log in to post comments
Accurate pitch estimation in speech signal plays a vital role in several applications. Robust pitch estimation in telephone speech is still a challenge due to the narrow bandwidth of the signal. Electroglottograph (EGG) signal is a reliable means for pitch estimation, however, it’s not practically possible to
- Categories: