Sorry, you need to enable JavaScript to visit this website.

Recently, the speaker code based adaptation has been successfully expanded to recurrent neural networks using bidirectional Long Short-Term Memory (BLSTM-RNN) [1]. Experiments on the small-scale TIMIT task have demonstrated that the speaker code based adaptation is also valid for BLSTM-RNN. In this paper, we evaluate this method on large-scale task and introduce an error normalization method to balance the back-propagation errors derived from different layers for speaker codes. Meanwhile, we use singular value decomposition (SVD) method to conduct model compression.

Categories:
10 Views

We present a geometry-inspired characterization of
target response for active sonar that exploits similarity between
intra-class features to distinguish between different targets
against environmental objects such as a rock. Key innovation is to
represent feature manifolds as a set of ellipsoids, each of which
geometrically encompasses a unique physical characteristic of
the target’s response. We have demonstrated over experimental
field data that for a given target class, these feature ellipsoids

Categories:
17 Views

With the strong growth of assistive and personal listening devices, natural sound rendering over headphones is becoming a necessity for prolonged listening in multimedia and virtual reality applications. The aim of natural sound rendering is to naturally recreate the sound scenes with the spatial and timbral quality as natural as possible, so as to achieve a truly immersive listening experience. However, rendering natural sound over headphones encounters many challenges. This tutorial article presents signal processing techniques to tackle these challenges to assist human listening.

Categories:
18 Views

Pages