Sorry, you need to enable JavaScript to visit this website.

The recent standard on Enhanced Voiced Services (EVS) contains two memory-less gain coding mechanisms achieving better performance than the prediction-based techniques used in 3GPP AMR-WB and ITU-T G.729 codecs. The EVS gain encoder uses joint vector quantization without the need of information from previous frames. Inter-frame prediction is replaced by alternative schemes based on sub-frame prediction or estimated average target signal energy. This eliminates the propagation of error inside the adaptive codebook.

Categories:
5 Views

One of the key issues in spatial audio analysis and reproduction is to decompose a signal into primary and ambient components based on their directional and diffuse spatial features, respectively. Existing approaches employed in primary-ambient extraction (PAE), such as principal component analysis (PCA), are mainly based on a basic stereo signal model. The performance of these PAE approaches has not been well studied for the input signals that do not satisfy all the assumptions of the stereo signal model.

Categories:
3 Views

In spatial audio analysis-synthesis, one of the key issues is to decompose a signal into primary and ambient components based on their spatial features. Principal component analysis (PCA) has been widely employed in primary component extraction, and shifted PCA (SPCA) is employed to enhance the primary extraction for input signals involving the inter-channel time difference. However, SPCA generally requires the primary components to come from one direction and cannot produce good results in the case of multiple directions.

Categories:
4 Views

Individualization of head-related transfer functions (HRTFs) can be realized using the person’s anthropometry with a pre-trained model. This model usually establishes a direct linear or non-linear mapping from anthropometry to HRTFs in the training database. Due to the complex relation between anthropometry and HRTFs, the accuracy of this model depends heavily on the correct selection of the anthropometric features.

Categories:
128 Views

In spatial audio analysis-synthesis, one of the key issues is to decompose a signal into primary and ambient components based on their spatial features. Stereo audio signals are often modeled as a linear mixture of primary and ambient components. Existing approaches like principal component analysis (PCA) and least squares (LS) have been widely employed to extract primary and ambient components from stereo signals. However, the performance and comparisons of these approaches in primary-ambient extraction (PAE) have not been well studied.

Categories:
6 Views

In spatial audio analysis-synthesis, one of the key issues is to decompose a signal into primary and ambient components based on their spatial features. Stereo audio signals are often modeled as a linear mixture of primary and ambient components. Existing approaches like principal component analysis (PCA) and least squares (LS) have been widely employed to extract primary and ambient components from stereo signals. However, the performance and comparisons of these approaches in primary-ambient extraction (PAE) have not been well studied.

Categories:
16 Views

The diversity of today’s playback systems requires a flexible, efficient, and immersive reproduction of sound scenes in digital media. Spatial audio reproduction based on primary-ambient extraction (PAE) fulfills this objective, where accurate extraction of primary and ambient components from sound mixtures in channel-based audio is crucial. Severe extraction error was found in existing PAE approaches when dealing with sound mixtures that contain a relatively strong ambient component, a commonly encountered case in the sound scenes of digital media.

Categories:
6 Views

Parallel-form narrowband feedback active noise control (FBANC) system has been shown to perform better than conventional internal model control (IMC) based FBANC system in cancelling multi-tonal noise. A previous paper illustrated a novel approach in estimating the frequencies of the multi-tone noise, and using an internal tonal generator cum frequency grouping unit to increase its frequency separation in each channel of the parallel-form FBANC system based on a full-band error.

Categories:
5 Views

Parallel-form narrowband feedback active noise control (FBANC) system has been shown to perform better than conventional internal model control (IMC) based FBANC system in cancelling multi-tonal noise. A previous paper illustrated a novel approach in estimating the frequencies of the multi-tone noise, and using an internal tonal generator cum frequency grouping unit to increase its frequency separation in each channel of the parallel-form FBANC system based on a full-band error.

Categories:
12 Views

Individualization of head-related transfer functions (HRTFs) can be realized using the person’s anthropometry with a pre-trained model. This model usually establishes a direct linear or non-linear mapping from anthropometry to HRTFs in the training database. Due to the complex relation between anthropometry and HRTFs, the accuracy of this model depends heavily on the correct selection of the anthropometric features.

Categories:
4 Views

Pages