ICASSP 2022 - IEEE International Conference on Acoustics, Speech and Signal Processing is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The ICASSP 2022 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit the website.
- Read more about GRAPH ATTENTIVE FEATURE AGGREGATION FOR TEXT-INDEPENDENT SPEAKER VERIFICATION
- Log in to post comments
The objective of this paper is to combine multiple frame-level features into a single utterance-level representation considering pair wise relationships. For this purpose, we propose a novel graph attentive feature aggregation module by interpreting each frame-level feature as a node of a graph. The inter-relationship between all possible pairs of features, typically exploited indirectly, can be directly modeled using a graph. The module comprises a graph attention layer and a graph pooling layer followed by a readout operation.
- Categories:
- Read more about MULTI-DOMAIN UNPAIRED ULTRASOUND IMAGE ARTIFACT REMOVAL USING A SINGLE CONVOLUTIONAL NEURAL NETWORK
- Log in to post comments
Ultrasound imaging (US) often suffers from distinct image artifacts from various sources. Classic approaches for solving these problems are usually model-based iterative approaches that have been developed specifically for each type of artifact, which are often computationally intensive. Recently, deep learning approaches have been proposed as computationally efficient and high performance alternatives. Unfortunately, in the current deep learning approaches, a dedicated neural network should be trained with matched training data for each artifact type.
- Categories:
This paper studies the problem of discrete super-resolution. Existing stability guarantees rely on the fact that certain sep- aration conditions are satisfied by the true support. However, such structural conditions have not been exploited in the cor- responding algorithmic designs. This paper proposes a novel Bayesian approach based on the model aggregation idea that can generate an exact sparse estimate, and maintain the re- quired structures of the support.
- Categories:
- Read more about Deep Learning for Prominence Detection in Children's Read Speech
- Log in to post comments
Poster.pdf
- Categories:
- Read more about SYNTHESIZING DYSARTHRIC SPEECH USING MULTI-SPEAKER TTS FOR DYSARTHRIC SPEECH RECOGNITION
- Log in to post comments
- Categories:
- Read more about Internet Streaming Audio Based Speech Perception Threshold Measurement in Cochlear Implant Users
- Log in to post comments
Traditional face-to-face subjective listening test has become a challenge due to the COVID-19 pandemic. We developed a remote assessment system with Tencent Meeting, a video conferencing application, to address this issue. This paper
- Categories: