- Read more about Divergence Based Weighting for Information Channels in Deep Convolutional Neural Networks for Bird Audio Detection
- Log in to post comments
In this paper, we address the problem of bird audio detec-
tion and propose a new convolutional neural network archi-
tecture together with a divergence based information channel
weighing strategy in order to achieve improved state-of-the-
art performance and faster convergence. The effectiveness of
the methodology is shown on the Bird Audio Detection Chal-
lenge 2018 (Detection and Classification of Acoustic Scenes
and Events Challenge, Task 3) development data set.
- Categories:
- Read more about SPATIALLY ADAPTIVE LOSSES FOR VIDEO SUPER-RESOLUTION WITH GANS
- Log in to post comments
ICASSP_PPT.pdf
- Categories:
- Read more about Stochatic Adaptive Neural Architecture Search
- Log in to post comments
- Categories:
- Read more about Improve Diverse Text Generation by Self Labeling Conditional Variational Auto Encoder
- Log in to post comments
Diversity plays a vital role in many text generating applications. In recent years, Conditional Variational Auto Encoders (CVAE) have shown promising performances for this task. However, they often encounter the so called KL-Vanishing problem. Previous works mitigated such problem by heuristic methods such as strengthening the encoder or weakening the decoder while optimizing the CVAE objective function. Nevertheless, the optimizing direction of these methods are implicit and it is hard to find an appropriate degree to which these methods should be applied.
slcvae.pptx
- Categories:
- Read more about An End-to-End Network to Synthesize Intonation using a Generalized Command Response Model - Poster
- Log in to post comments
The generalized command response (GCR) model represents intonation as a
superposition of muscle responses to spike command signals. We have previously
shown that the spikes can be predicted by a two-stage system, consisting of a recurrent neural network and a post-processing procedure, but the responses themselves were fixed dictionary atoms. We propose an end-to-end
neural architecture that replaces the dictionary atoms with trainable
second-order recurrent elements analogous to recursive filters. We demonstrate
- Categories:
- Read more about 1-D Convolutional Neural Networks for Signal Processing Applications
- Log in to post comments
1D Convolutional Neural Networks (CNNs) have recently become the state-of-the-art technique for crucial signal processing applications such as patient-specific ECG classification, structural health monitoring, anomaly detection in power electronics circuitry and motor-fault detection. This is an expected outcome as there are numerous advantages of using an adaptive and compact 1D CNN instead of a conventional (2D) deep counterparts.
- Categories:
- Read more about DEEP LEARNING THE EEG MANIFOLD FOR PHONOLOGICAL CATEGORIZATION FROM ACTIVE THOUGHTS
- Log in to post comments
Speech-related Brain Computer Interfaces (BCI) aim primarily at finding an alternative vocal communication pathway for
people with speaking disabilities. As a step towards full decoding of imagined speech from active thoughts, we present a
BCI system for subject-independent classification of phonological categories exploiting a novel deep learning based
- Categories:
- Read more about Missing Data In Traffic Estimation: A Variational Autoencoder Imputation Method
- Log in to post comments
Road traffic forecasting systems are in scenarios where sensor or system failure occur. In those scenarios, it is known that missing values negatively affect estimation accuracy although it is being often underestimate in current deep neural network approaches. Our assumption is that traffic data can be generated from a latent space. Thus, we propose an online unsupervised data imputation method based on learning the data distribution using a variational autoencoder (VAE).
- Categories:
- Read more about Blind Room Volume Estimation from Single-Channel Noisy Speech
- Log in to post comments
Recent work on acoustic parameter estimation indicates that geometric room volume can be useful for modeling the character of an acoustic environment. However, estimating volume from audio signals remains a challenging problem. Here we propose using a convolutional neural network model to estimate the room volume blindly from reverberant single-channel speech signals in the presence of noise. The model is shown to produce estimates within approximately a factor of two to the true value, for rooms ranging in size from small offices to large concert halls.
- Categories:
- Categories: