Neural network learning (MLR-NNLR)

GENERALIZED END-TO-END LOSS FOR SPEAKER VERIFICATION

Read more about GENERALIZED END-TO-END LOSS FOR SPEAKER VERIFICATION
Log in to post comments

In this paper, we propose a new loss function called generalized end-to-end (GE2E) loss, which makes the training of speaker verification models more efficient than our previous tuple-based end-to-end (TE2E) loss function. Unlike TE2E, the GE2E loss function updates the network in a way that emphasizes examples that are difficult to verify at each step of the training process. Additionally, the GE2E loss does not require an initial stage of example selection.

ICASSP 2018 GE2E.pptx

ICASSP 2018 GE2E.pptx (0)

Categories:: Speaker Recognition and Characterization (SPE-SPKR)
Neural network learning (MLR-NNLR)

162 Views

A Random Matrix and Concentration Inequalities framework for Neural Networks Analysis

Read more about A Random Matrix and Concentration Inequalities framework for Neural Networks Analysis
Log in to post comments

Our article provides a theoretical analysis of the asymptotic performance of a regression or classification task performed by a simple random neural network. This result is obtained by leveraging a new framework at the crossroads between random matrix theory and the concentration of measure theory. This approach is of utmost interest for neural network analysis at large in that it naturally dismisses the difficulty induced by the non-linear activation functions, so long that these are Lipschitz functions.

conc_measure_NN_ICASSP18(3).pdf

conc_measure_NN_ICASSP18(3).pdf (426)

Categories:: Neural network learning (MLR-NNLR)

41 Views

Joint Verification-Identification in End-to-End Multi-Scale CNN Framework for Topic Identification

We present an end-to-end multi-scale Convolutional Neural
Network (CNN) framework for topic identification (topic ID).
In this work, we examined multi-scale CNN for classification
using raw text input. Topical word embeddings are learnt at
multiple scales using parallel convolutional layers. A technique
to integrate verification and identification objectives is
examined to improve topic ID performance. With this approach,
we achieved significant improvement in identification
task. We evaluated our framework on two contrasting

Final.pdf

Final.pdf (506)

Categories:: Spoken Language Understanding (SLP-UNDE)
Neural network learning (MLR-NNLR)

44 Views

Speaker Invariant Feature Extraction for Zero-Resource Languageswith Adversarial Learning

We introduce a novel type of representation learning to obtain a speaker invariant feature for zero-resource languages. Speaker adaptation is an important technique to build a robust acoustic model. For a zero-resource language, however, conventional model-dependent speaker adaptation methods such as constrained maximum likelihood linear regression are insufficient because the acoustic model of the target language is not accessible. Therefore, we introduce a model-independent feature extraction based on a neural network.

speaker-invariant-feature-extraction-for-zero-resource-languages-with-adversarial-learning.pdf

speaker-invariant-feature-extraction-for-zero-resource-languages-with-adversarial-learning.pdf (731)

Categories:: Neural network learning (MLR-NNLR)
Statistical Signal Processing

22 Views

A MULTI-CAMERA DEEP NEURAL NETWORK FOR DETECTING ELEVATED ALERTNESS IN DRIVERS

Read more about A MULTI-CAMERA DEEP NEURAL NETWORK FOR DETECTING ELEVATED ALERTNESS IN DRIVERS
Log in to post comments

We present a system for the detection of elevated levels of driver alertness in driver-facing video captured from multiple viewpoints. This problem is important in automotive safety as a helpful feedback signal to determine driver engagement and as a means of automatically flagging anomalous driving events. We generated a dataset of videos from 25 participants overseeing an hour each of driving sequences in a simulator consisting of a mixture of normal and near-miss driving events.

poster.pdf

poster.pdf (476)

Categories:: Neural network learning (MLR-NNLR)

12 Views

CATSEYES: Categorizing Seismic structures with tessellated scattering wavelet networks

As field seismic data sizes are dramatically increasing toward exabytes, automating the labeling of ``structural monads'' --- corresponding to geological patterns and yielding subsurface interpretation --- in a huge amount of available information would drastically reduce interpretation time. Since customary designed features may not account for gradual deformations observable in seismic data, we propose to adapt the wavelet-based scattering network methodology with a tessellation of geophysical images.

Bhalgat-Yash-2018-p-icassp-catseyes-classification-seismic-structure-scattering-transform.pdf

Supervised seismic structure classification clustering with wavelet scattering networks (678)

Categories:: Neural network learning (MLR-NNLR)
Image Formation

230 Views

A Large-Scale Study Of Language Models for Chord Prediction

Read more about A Large-Scale Study Of Language Models for Chord Prediction
Log in to post comments

icassp2018.pdf

icassp2018.pdf (126)

Categories:: Neural network learning (MLR-NNLR)

6 Views

Cofnet: Predict with Confidence

Read more about Cofnet: Predict with Confidence
Log in to post comments

POSTER.pdf

POSTER.pdf (1496)

Categories:: Neural network learning (MLR-NNLR)

7 Views

A Supervised STDP-Based Training Algorithm for Living Neural Networks

Read more about A Supervised STDP-Based Training Algorithm for Living Neural Networks
Log in to post comments

Neural networks have shown great potential in many applications like speech recognition, drug discovery, image classification, and object detection. Neural network models are inspired by biological neural networks, but they are optimized to perform machine learning tasks on digital computers.

Poster_icassp.pdf

Poster_icassp.pdf (2051)

Categories:: Neural network learning (MLR-NNLR)

6 Views

SOLVING LINEAR INVERSE PROBLEMS USING GAN PRIORS: AN ALGORITHM WITH PROVABLE GUARANTEES

In recent works, both sparsity-based methods as well as learning-based methods have proven to be successful in solving several challenging linear inverse problems. However, sparsity priors for natural signals and images suffer from poor discriminative capability, while learning-based methods seldom provide concrete theoretical guarantees. In this work, we advocate the idea of replacing hand-crafted priors, such as sparsity, with a Generative Adversarial Network (GAN) to solve linear inverse problems such as compressive sensing.

poster_icassp.pdf

poster_icassp.pdf (614)

Categories:: Neural network learning (MLR-NNLR)
Sampling and Reconstruction

7 Views

Neural network learning (MLR-NNLR)

Pages