Source separation (MLR-SSEP)

Efficient Parameter Estimation for Semi-Continuous Data: An Application to Independent Component Analysis

MLSP_2019.pdf

MLSP_2019.pdf (482)

Categories:: Source separation (MLR-SSEP)

12 Views

HOW MANY FMRI SCANS ARE NECESSARY AND SUFFICIENT FOR RESTING BRAIN CONNECTIVITY ANALYSIS?

Functional connectivity analysis by detecting neuronal coactivation in the brain can be efficiently done using Resting State Functional Magnetic Resonance Imaging (rs-fMRI) analysis. Most of the existing research in this area employ correlation-based group averaging strategies of spatial smoothing and temporal normalization of fMRI scans, whose reliability of results heavily depends on the voxel resolution of fMRI scan as well as scanning duration. Scanning period from 5 to 11 minutes has been chosen by most of the studies while estimating the connectivity of brain networks.

Dash_oral_GlobalSIP_fMRI.pdf

Dash_oral_GlobalSIP_fMRI.pdf (552)

Categories:: Medical image analysis
Source separation (MLR-SSEP)

29 Views

Deep attractor networks for speaker re-identification and blind source separation

Read more about Deep attractor networks for speaker re-identification and blind source separation
Log in to post comments

Deep Clustering (DC) and Deep Attractor Networks (DANs) are a data-driven way to monaural blind source separation.
Both approaches provide astonishing single channel performance but have not yet been generalized to block-online processing.
When separating speech in a continuous stream with a block-online algorithm, it needs to be determined in each block which of the output streams belongs to whom.
In this contribution we solve this block permutation problem by introducing an additional speaker identification embedding to the DAN model structure.

2018-04-17_drude.pdf

2018-04-17_drude.pdf (644)

Categories:: Source separation (MLR-SSEP)

28 Views

TasNet: time-domain audio separation network for real-time, single-channel speech separation

Robust speech processing in multi-talker environments requires effective speech separation. Recent deep learning systems have made significant progress toward solving this problem, yet it remains challenging particularly in real-time, short latency applications. Most methods attempt to construct a mask for each source in time-frequency representation of the mixture signal which is not necessarily an optimal representation for speech separation.

ICASSP2018-poster.pdf

ICASSP2018-poster.pdf (745)

Categories:: Source Separation and Signal Enhancement
Speech Enhancement (SPE-ENHA)
Source separation (MLR-SSEP)
Neural network learning (MLR-NNLR)

86 Views

Semi-Supervised Adversarial Audio Source Separation applied to Singing Voice Extraction

The state of the art in music source separation employs neural networks trained in a supervised fashion on multi-track databases to estimate the sources from a given mixture. With only few datasets available, often extensive data augmentation is used to combat overfitting. Mixing random tracks, however, can even reduce separation performance as instruments in real music are strongly correlated. The key concept in our approach is that source estimates of an optimal separator should be indistinguishable from real source signals.

presentation_V3.pdf

Presentation slides version 3 (600)

presentation_FINAL.pdf

Presentation slides final version (847)

Categories:: Source separation (MLR-SSEP)

23 Views

SPARSE BOUNDED COMPONENT ANALYSIS FOR CONVOLUTIVE MIXTURES

Read more about SPARSE BOUNDED COMPONENT ANALYSIS FOR CONVOLUTIVE MIXTURES
Log in to post comments

In this article, we propose a Bounded Component Analysis (BCA) approach for the separation of the convolutive mixtures of sparse sources. The corresponding algorithm is derived from a geometric objective function defined over a completely deterministic setting. Therefore, it is applicable to sources which can be independent or dependent in both space and time dimensions. We show that all global optima of the proposed objective are perfect separators. We also provide numerical examples to illustrate the performance of the algorithm.

posterconvolutivesparsebca.pdf

posterconvolutivesparsebca.pdf (860)

Categories:: Source separation (MLR-SSEP)
Independent component analysis (MLR-ICAN)

33 Views

A Case Study of Machine Learning Hardware: Real-Time Source Separation using Markov Random Fields via Sampling-based Inference

ko-icassp2017-poster.pdf

ko-icassp2017-poster.pdf (830)

Categories:: DSP algorithm implementation in hardware and software
Source separation (MLR-SSEP)

22 Views

Learning complex-valued latent filters with absolute cosine similarity

Read more about Learning complex-valued latent filters with absolute cosine similarity
Log in to post comments

icassp2017_slides.pdf

icassp2017_slides.pdf (265)

Categories:: Source Separation and Signal Enhancement
Source separation (MLR-SSEP)

12 Views

Learning complex-valued latent filters with absolute cosine similarity

Read more about Learning complex-valued latent filters with absolute cosine similarity
Log in to post comments

We propose a new sparse coding technique based on the power mean of phase-invariant cosine distances. Our approach is a generalization of sparse ﬁltering and K-hyperlines clustering. It offers a better sparsity enforcer than the L1/L2 norm ratio that is typically used in sparse ﬁltering. At the same time, the proposed approach scales better than the clustering counter parts for high-dimensional input. Our algorithm fully exploits the prior information obtained by preprocessing the observed data with whitening via an efﬁcient row-wise decoupling scheme.

AnhHTNguyen_icassp2017_poster.pdf

AnhHTNguyen_icassp2017_poster.pdf (729)

Categories:: Source Separation and Signal Enhancement
Source separation (MLR-SSEP)

11 Views

LOW-LATENCY SOUND SOURCE SEPARATION USING DEEP NEURAL NETWORKS

Read more about LOW-LATENCY SOUND SOURCE SEPARATION USING DEEP NEURAL NETWORKS
Log in to post comments

Sound source separation at low-latency requires that each in- coming frame of audio data be processed at very low de- lay, and outputted as soon as possible. For practical pur- poses involving human listeners, a 20 ms algorithmic delay is the uppermost limit which is comfortable to the listener. In this paper, we propose a low-latency (algorithmic delay ≤ 20 ms) deep neural network (DNN) based source sepa- ration method.

GlobalSIP_poster2.pdf

GlobalSIP_poster2.pdf (847)

Categories:: Source Separation and Signal Enhancement
Source separation (MLR-SSEP)

32 Views

Source separation (MLR-SSEP)

Pages