
This paper investigates negative sampling for contrastive learning in the context of audio-text retrieval. The strategy for negative sampling refers to selecting negatives (either audio clips or textual descriptions) from a pool of candidates for a positive audio-text pair. We explore sampling strategies via model-estimated within-modality and cross-modality relevance scores for audio and text samples. With a constant training setting on the retrieval system from [1], we study eight sampling strategies, including hard and semi-hard negative sampling.
- Categories:

- Read more about Cross-Lingual Transfer Learning for Alzheimer’s Detection from Spontaneous Speech
- Log in to post comments
Alzheimer’s disease (AD) is a progressive neurodegenerative disease most often associated with memory deficits and cognitive decline. With the aging population, there has been much interest in automated methods for cognitive impairment detection. One approach that has attracted attention in recent years is AD detection through spontaneous speech. While the results are promising, it is not certain whether the learned speech features can be generalized across languages. To fill this gap, the ADReSS-M challenge was organized.
- Categories:

- Read more about IQGAN: Robust Quantum Generative Adversarial Network for Image Synthesis On NISQ Devices
- Log in to post comments
- Categories:

- Categories:

- Read more about Transient Dictionary Learning for Compressed Time-of-Flight Imaging
- Log in to post comments
Time-of-Flight imaging aims to retrieve the 3D geometry of a scene from the delay that a modulated light waveform experiences when interacting with the former.
Multi-path interference, arising from translucent objects or concave geometries, poses a challenge when the problem is to be solved from few measurements.
2023039374.pdf

- Categories:

- Read more about Transient Dictionary Learning for Compressed Time-of-Flight Imaging
- Log in to post comments
Time-of-Flight imaging aims to retrieve the 3D geometry of a scene from the delay that a modulated light waveform experiences when interacting with the former.
Multi-path interference, arising from translucent objects or concave geometries, poses a challenge when the problem is to be solved from few measurements.
- Categories:

- Read more about Transient Dictionary Learning for Compressed Time-of-Flight Imaging
- Log in to post comments
Time-of-Flight imaging aims to retrieve the 3D geometry of a scene from the delay that a modulated light waveform experiences when interacting with the former.
Multi-path interference, arising from translucent objects or concave geometries, poses a challenge when the problem is to be solved from few measurements.
- Categories:

- Read more about Single-shot Fractional Fourier Phase Retrieval
- Log in to post comments
Traditional phase retrieval is generally concerned with recovering a signal from its Fourier magnitude measurements whose inherent ambiguities make this problem especially difficult. In this work, we present an efficient phase retrieval technique from the single fractional Fourier transform (FrFT) magnitude measurement. Specifically, the FrFT measurement can be well-combined with signal priors via a generalized alternating projection framework, which can effectively alleviate the ambiguities of phase retrieval and the stagnation problem of numerical iterative processes.
- Categories:

- Read more about Permutation Entropy for Graph Signals
- Log in to post comments
Entropy metrics (for example, permutation entropy) are nonlinear measures of irregularity in time series (one-dimensional data). Some of these entropy metrics can be generalised to data on periodic structures such as a grid or lattice pattern (two-dimensional data) using its symmetry, thus enabling their application to images. However, these metrics have not been developed for signals sampled on irregular domains, defined by a graph.
- Categories:

- Read more about Constrained Autoencoders: Incorporating equality constraints in learned scientific data compression
- Log in to post comments
As today's scientific simulations with high-performance computing produce an extremely large amount of data, reliable data compression techniques are becoming important. In scientific applications, downstream quantities derived from the original or primary data (PD) are crucial for post-analysis, and therefore these Quantities of Interest (QoI) are required to be preserved during compression. While autoencoders (AEs) have recently been used for the compression of scientific datasets, there is a clear lack in the literature w.r.t.
- Categories: