Sorry, you need to enable JavaScript to visit this website.

Magnetic resonance (MR) plays an important role in medical imaging. It can be flexibly tuned towards different applications for deriving a meaningful diagnosis. However, its long acquisition times and flexible parametrization make it on the other hand prone to artifacts which obscure the underlying image content or can be misinterpreted as anatomy. Patient-induced motion artifacts are still one of the major extrinsic factors which degrade image quality.

Categories:
94 Views

For many years, i-vector based audio embedding techniques were the dominant approach for speaker verification and speaker diarization applications. However, mirroring the rise of deep learning in various domains, neural network based audio embeddings, also known as d-vectors, have consistently demonstrated superior speaker verification performance. In this paper, we build on the success of d-vector based speaker verification systems to develop a new d-vector based approach to speaker diarization.

Categories:
42 Views

Attention-based models have recently shown great performance on a range of tasks, such as speech recognition, machine translation, and image captioning due to their ability to summarize relevant information that expands through the entire length of an input sequence. In this paper, we analyze the usage of attention mechanisms to the problem of sequence summarization in our end-to-end text-dependent speaker recognition system. We explore different topologies and their variants of the attention layer, and compare different pooling methods on the attention weights.

Categories:
157 Views

Deep neural networks inherently have large representational power for approximating complex target functions. However,

Categories:
5 Views

The signal processing (SP) landscape has been enriched by recent advances in artificial intelligence (AI) and machine learning (ML), especially since 2010 or so, yielding new tools for signal estimation, classification, prediction, and manipulation. Layered signal representations, nonlinear function approximation, and nonlinear signal prediction are now feasible at very large scale in both dimensionality and data size.

Categories:
2619 Views

Artery/vein classification in fundus images is a prerequisite for the assessment of diseases such as diabetes, hypertension or other cardiovascular pathologies. One clinical measure used to assess the severity of cardiovascular risk is the retinal arterio-venous ratio (AVR), which significantly depends on the accuracy of vessel classification into arteries or veins. This paper proposes a novel method for artery/vein classification combining deep learning and graph propagation strategies.

Categories:
39 Views

We present an efficient multi-image super resolution (MISR) method. Our solution consists of a L1-norm optimized reconstruction scheme for super resolution (SR), and a three-layer convolutional network for artifacts removal, in a concatenated fashion. Such a two-stage method achieves excellent performance, which outperforms the existing state-of-the-art SR methods in both subjective and objective measurements (e.g., 5 to 7 dB improvements on popular image database using PSNR metric).

Categories:
22 Views

Deep neural networks have led to dramatic improvements in performance for many machine learning tasks, yet the mathematical reasons for this success remain largely unclear. In this talk we present recent developments in the mathematical framework of convolutive neural networks (CNN). In particular we discuss the scattering network of Mallat and how it relates to another problem in harmonic analysis, namely the phase retrieval problem. Then we discuss the general convolutive neural network from a theoretician point of view.

Categories:
12 Views

Pages