Linear estimation based primary-ambient extraction for stereo audio signals
- Citation Author(s):
- Submitted by:
- Jianjun HE
- Last updated:
- 23 February 2016 - 1:43pm
- Document Type:
- Research Manuscript
Audio signals for moving pictures and video games are often linear combinations of primary and ambient components. In spatial audio analysis-synthesis, these mixed signals are usually decomposed into primary and ambient components to facilitate flexible spatial rendering and enhancement. Existing approaches such as principal component analysis (PCA) and least squares (LS) are widely used to perform this decomposition from stereo signals. However, the performance of these approaches in primary-ambient extraction (PAE) has not been well studied and no comparative analysis among the existing approaches has been carried out so far. In this paper, we generalize the existing approaches into a linear estimation framework. Under this framework, we propose a series of performance measures to identify the components that contribute to the extraction error. Based on the generalized linear estimation framework and our proposed performance measures, a comparative study and experimental testing of the linear estimation based PAE approaches including existing PCA, LS, and three proposed variant LS approaches are presented.
This paper has been published in IEEE/ACM Trans. Audio, Speech, Lang. Processing, vol. 22. no. 2, pp. 505-517, Feb. 2014.
For published version of the paper, please visit: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6698287