Audio and Acoustic Signal Processing

AN ATTENTION ENHANCED MULTI-TASK MODEL FOR OBJECTIVE SPEECH ASSESSMENT IN REAL-WORLD ENVIRONMENTS

slide_1603.pdf

slide_1603.pdf (351)

Categories:: Audio and Acoustic Signal Processing
Machine Learning for Signal Processing

18 Views

Source separation with weakly labelled data An approach to computational auditory scene analysis_v1.1.pdf

Source separation with weakly labelled data An approach to computational auditory scene analysis_v1.1.pdf (722)

Categories:: Audio and Acoustic Signal Processing

12 Views

URTIS: A SMALL 3D IMAGING SONAR SENSOR FOR ROBOTIC APPLICATIONS

Read more about URTIS: A SMALL 3D IMAGING SONAR SENSOR FOR ROBOTIC APPLICATIONS
Log in to post comments

CoSys_PosterURTIS_2020.pdf

CoSys_PosterURTIS_2020.pdf (432)

Categories:: Audio and Acoustic Signal Processing

27 Views

AUDIO CODEC ENHANCEMENT WITH GENERATIVE ADVERSARIAL NETWORKS

Read more about AUDIO CODEC ENHANCEMENT WITH GENERATIVE ADVERSARIAL NETWORKS
Log in to post comments

Audio codecs are typically transform-domain based and efficiently code stationary audio signals, but they struggle with speech and signals containing dense transient events such as applause. Specifically, with these two classes of signals as examples, we demonstrate a technique for restoring audio from coding noise based on generative adversarial networks (GAN). A primary advantage of the proposed GAN-based coded audio enhancer is that the method operates end-to-end directly on decoded audio samples, eliminating the need to design any manually-crafted frontend.

AUDIO CODEC ENHANCEMENT WITH GENERATIVE ADVERSARIAL NETWORKS.pdf

AUDIO CODEC ENHANCEMENT WITH GENERATIVE ADVERSARIAL NETWORKS.pdf (410)

Categories:: Audio and Acoustic Signal Processing

75 Views

Duration robust weakly supervised sound event detection

Read more about Duration robust weakly supervised sound event detection
Log in to post comments

ICASSP2020_Duration_robust_Slides (1).pdf

ICASSP2020_Duration_robust_Slides (1).pdf (294)

Categories:: Audio and Acoustic Signal Processing

6 Views

Learning with Out of Distribution Data for Audio Classification

Read more about Learning with Out of Distribution Data for Audio Classification
Log in to post comments

In supervised machine learning, the assumption that training data is labelled correctly is not always satisfied. In this paper, we investigate an instance of labelling error for classification tasks in which the dataset is corrupted with out-of-distribution (OOD) instances: data that does not belong to any of the target classes, but is labelled as such. We show that detecting and relabelling certain OOD instances, rather than discarding them, can have a positive effect on learning.

Presentation.pdf

Presentation.pdf (581)

Categories:: Audio and Acoustic Signal Processing

18 Views

A-CRNN: A DOMAIN ADAPTATION MODEL FOR SOUND EVENT DETECTION

Read more about A-CRNN: A DOMAIN ADAPTATION MODEL FOR SOUND EVENT DETECTION
Log in to post comments

This paper presents a domain adaptation model for sound event detection. A common challenge for sound event detection is how to deal with the mismatch among different datasets. Typically, the performance of a model will decrease if it is tested on a dataset which is different from the one that the model is trained on. To address this problem, based on convolutional recurrent neural networks (CRNNs), we propose an adapted CRNN (A-CRNN) as an unsupervised adversarial domain adaptation model for sound event detection.

Presentation slides.pdf

Presentation slides (366)

Categories:: Audio and Acoustic Signal Processing

110 Views

A Sequence Matching Network for Polyphonic Sound Event Localization and Detection

Read more about A Sequence Matching Network for Polyphonic Sound Event Localization and Detection
Log in to post comments

Polyphonic sound event detection and direction-of-arrival estimation require different input features from audio signals. While sound event detection mainly relies on time-frequency patterns, direction-of-arrival estimation relies on magnitude or phase differences between microphones. Previous approaches use the same input features for sound event detection and direction-of-arrival estimation, and train the two tasks jointly or in a two-stage transfer-learning manner.

ICASSP2020 - Tho Nguyen - NTU - Sequence Matching - Sigpost.pdf

ICASSP2020 - Tho Nguyen - NTU - Sequence Matching - Sigpost.pdf (433)

Categories:: Content-Based Audio Processing
Audio and Acoustic Signal Processing

76 Views

HIGH PERFORMANCE SUPERVISED TIME-DELAY ESTIMATION USING NEURAL NETWORKS

Read more about HIGH PERFORMANCE SUPERVISED TIME-DELAY ESTIMATION USING NEURAL NETWORKS
Log in to post comments

Time-delay estimation is an essential building block of many signal processing applications. This paper follows up on earlier work for acoustic source localization and time delay estimation using pattern recognition techniques; it presents high performance results obtained with supervised training of neural networks which challenge the state of the art and compares its performance to that of well-known methods such as the Generalized Cross-Correlation or Adaptive Eigenvalue Decomposition.

paperludwighouegnigan_shortversion.pdf

paperludwighouegnigan_shortversion.pdf (597)

Categories:: Multi-channel Signal Processing
Neural network learning (MLR-NNLR)
Audio and Acoustic Signal Processing

94 Views

WASPAA 2019 POSTER: MULTIPLE HYPOTHESIS TRACKING FOR OVERLAPPING SPEAKER SEGMENTATION

Read more about WASPAA 2019 POSTER: MULTIPLE HYPOTHESIS TRACKING FOR OVERLAPPING SPEAKER SEGMENTATION
Log in to post comments

Speaker segmentation is an essential part of any diarization system.Applications of diarization include tasks such as speaker indexing, improving automatic speech recognition (ASR) performance and making single speaker-based algorithms available for use in multi-speaker environments.This paper proposes a multiple hypothesis tracking (MHT) method that exploits the harmonic structure associated with the pitch in voiced speech in order to segment the onsets and end-points of speech from multiple, overlapping speakers.

Poster___WASPAA_2019.pdf

Poster___WASPAA_2019.pdf (477)

Categories:: Audio and Acoustic Signal Processing

73 Views