Sorry, you need to enable JavaScript to visit this website.

Audio Coding

Decorrelation for Audio Object Coding


Object-based representations of audio content are increasingly
used in entertainment systems to deliver immersive and
personalized experiences. Efficient storage and transmission
of such content can be achieved by joint object coding algorithms
that convey a reduced number of downmix signals
together with parametric side information that enables object
reconstruction in the decoder. This paper presents an
approach to improve the performance of joint object coding
by adding one or more decorrelators to the decoding process.

Paper Details

Authors:
Lars Villemoes, Toni Hirvonen, and Heiko Purnhagen
Submitted On:
19 May 2017 - 7:40am
Short Link:
Type:
Event:
Paper Code:
Document Year:
Cite

Document Files

ICASSP2017_FINAL.pdf

(27 downloads)

Keywords

Subscribe

[1] Lars Villemoes, Toni Hirvonen, and Heiko Purnhagen, "Decorrelation for Audio Object Coding", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1795. Accessed: Jun. 24, 2017.
@article{1795-17,
url = {http://sigport.org/1795},
author = {Lars Villemoes; Toni Hirvonen; and Heiko Purnhagen },
publisher = {IEEE SigPort},
title = {Decorrelation for Audio Object Coding},
year = {2017} }
TY - EJOUR
T1 - Decorrelation for Audio Object Coding
AU - Lars Villemoes; Toni Hirvonen; and Heiko Purnhagen
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1795
ER -
Lars Villemoes, Toni Hirvonen, and Heiko Purnhagen. (2017). Decorrelation for Audio Object Coding. IEEE SigPort. http://sigport.org/1795
Lars Villemoes, Toni Hirvonen, and Heiko Purnhagen, 2017. Decorrelation for Audio Object Coding. Available at: http://sigport.org/1795.
Lars Villemoes, Toni Hirvonen, and Heiko Purnhagen. (2017). "Decorrelation for Audio Object Coding." Web.
1. Lars Villemoes, Toni Hirvonen, and Heiko Purnhagen. Decorrelation for Audio Object Coding [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1795

Pre-Echo Noise Reduction in Frequency-Domain Audio Codecs (Poster)


One of the most common yet detrimental compression artifacts in frequency-domain audio codecs is known as pre-echo, which is perceived as a brief noise preceding transient signals, and is discernable even without direct comparison to the original signal. Because of its substantial negative impact on audio quality, many techniques have been proposed to alleviate it, but not without effect on coding efficiency.

jim-poster.pdf

PDF icon Poster (38 downloads)

Paper Details

Authors:
Jimmy Lapierre, Roch Lefebvre
Submitted On:
1 March 2017 - 7:11pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

Poster

(38 downloads)

Keywords

Additional Categories

Subscribe

[1] Jimmy Lapierre, Roch Lefebvre, "Pre-Echo Noise Reduction in Frequency-Domain Audio Codecs (Poster)", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1569. Accessed: Jun. 24, 2017.
@article{1569-17,
url = {http://sigport.org/1569},
author = {Jimmy Lapierre; Roch Lefebvre },
publisher = {IEEE SigPort},
title = {Pre-Echo Noise Reduction in Frequency-Domain Audio Codecs (Poster)},
year = {2017} }
TY - EJOUR
T1 - Pre-Echo Noise Reduction in Frequency-Domain Audio Codecs (Poster)
AU - Jimmy Lapierre; Roch Lefebvre
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1569
ER -
Jimmy Lapierre, Roch Lefebvre. (2017). Pre-Echo Noise Reduction in Frequency-Domain Audio Codecs (Poster). IEEE SigPort. http://sigport.org/1569
Jimmy Lapierre, Roch Lefebvre, 2017. Pre-Echo Noise Reduction in Frequency-Domain Audio Codecs (Poster). Available at: http://sigport.org/1569.
Jimmy Lapierre, Roch Lefebvre. (2017). "Pre-Echo Noise Reduction in Frequency-Domain Audio Codecs (Poster)." Web.
1. Jimmy Lapierre, Roch Lefebvre. Pre-Echo Noise Reduction in Frequency-Domain Audio Codecs (Poster) [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1569

Adaptive selection of lag-window shape for linear predictive analysis in the 3GPP EVS codec

Paper Details

Authors:
Submitted On:
23 February 2016 - 1:44pm
Short Link:
Type:
Event:

Document Files

kmhGlobalSIP2015ALW_1212a.pdf

(231 downloads)

Keywords

Subscribe

[1] , "Adaptive selection of lag-window shape for linear predictive analysis in the 3GPP EVS codec", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/489. Accessed: Jun. 24, 2017.
@article{489-15,
url = {http://sigport.org/489},
author = { },
publisher = {IEEE SigPort},
title = {Adaptive selection of lag-window shape for linear predictive analysis in the 3GPP EVS codec},
year = {2015} }
TY - EJOUR
T1 - Adaptive selection of lag-window shape for linear predictive analysis in the 3GPP EVS codec
AU -
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/489
ER -
. (2015). Adaptive selection of lag-window shape for linear predictive analysis in the 3GPP EVS codec. IEEE SigPort. http://sigport.org/489
, 2015. Adaptive selection of lag-window shape for linear predictive analysis in the 3GPP EVS codec. Available at: http://sigport.org/489.
. (2015). "Adaptive selection of lag-window shape for linear predictive analysis in the 3GPP EVS codec." Web.
1. . Adaptive selection of lag-window shape for linear predictive analysis in the 3GPP EVS codec [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/489

A PACKET LOSS RECOVERY TECHNIQUE WITH LINE SPECTRAL FREQUENCY MODIFICATION IN 3GPP EVS CODEC


This paper examines a method for controlling the energy of decoded signal at the recovery frame from a packet loss. Our observation unveiled that a packet loss around speech onset causes sudden increase in the amplitude of the decoded signal at the recovery frame. To mitigate the artifact caused by the overshoot, a detector of the speech overshoot is proposed as well as a method that controls the amplitude of the decoded signal by adjusting distances of adjacent line spectral frequencies.

Paper Details

Authors:
Kei Kikuiri
Submitted On:
23 February 2016 - 1:44pm
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

20151215_GlobalSIP2015_presentation.pdf

(210 downloads)

Keywords

Subscribe

[1] Kei Kikuiri, "A PACKET LOSS RECOVERY TECHNIQUE WITH LINE SPECTRAL FREQUENCY MODIFICATION IN 3GPP EVS CODEC", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/385. Accessed: Jun. 24, 2017.
@article{385-15,
url = {http://sigport.org/385},
author = {Kei Kikuiri },
publisher = {IEEE SigPort},
title = {A PACKET LOSS RECOVERY TECHNIQUE WITH LINE SPECTRAL FREQUENCY MODIFICATION IN 3GPP EVS CODEC},
year = {2015} }
TY - EJOUR
T1 - A PACKET LOSS RECOVERY TECHNIQUE WITH LINE SPECTRAL FREQUENCY MODIFICATION IN 3GPP EVS CODEC
AU - Kei Kikuiri
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/385
ER -
Kei Kikuiri. (2015). A PACKET LOSS RECOVERY TECHNIQUE WITH LINE SPECTRAL FREQUENCY MODIFICATION IN 3GPP EVS CODEC. IEEE SigPort. http://sigport.org/385
Kei Kikuiri, 2015. A PACKET LOSS RECOVERY TECHNIQUE WITH LINE SPECTRAL FREQUENCY MODIFICATION IN 3GPP EVS CODEC. Available at: http://sigport.org/385.
Kei Kikuiri. (2015). "A PACKET LOSS RECOVERY TECHNIQUE WITH LINE SPECTRAL FREQUENCY MODIFICATION IN 3GPP EVS CODEC." Web.
1. Kei Kikuiri. A PACKET LOSS RECOVERY TECHNIQUE WITH LINE SPECTRAL FREQUENCY MODIFICATION IN 3GPP EVS CODEC [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/385

DELAY-LESS FREQUENCY DOMAIN PACKET-LOSS CONCEALMENT FOR TONAL AUDIO SIGNALS

Paper Details

Authors:
Ralph Sperschneider, Janine Sukowski
Submitted On:
23 February 2016 - 1:44pm
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

FD_Tonal_PLC_GlobalSIP2015.pdf

(219 downloads)

Keywords

Subscribe

[1] Ralph Sperschneider, Janine Sukowski, "DELAY-LESS FREQUENCY DOMAIN PACKET-LOSS CONCEALMENT FOR TONAL AUDIO SIGNALS", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/320. Accessed: Jun. 24, 2017.
@article{320-15,
url = {http://sigport.org/320},
author = {Ralph Sperschneider; Janine Sukowski },
publisher = {IEEE SigPort},
title = {DELAY-LESS FREQUENCY DOMAIN PACKET-LOSS CONCEALMENT FOR TONAL AUDIO SIGNALS},
year = {2015} }
TY - EJOUR
T1 - DELAY-LESS FREQUENCY DOMAIN PACKET-LOSS CONCEALMENT FOR TONAL AUDIO SIGNALS
AU - Ralph Sperschneider; Janine Sukowski
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/320
ER -
Ralph Sperschneider, Janine Sukowski. (2015). DELAY-LESS FREQUENCY DOMAIN PACKET-LOSS CONCEALMENT FOR TONAL AUDIO SIGNALS. IEEE SigPort. http://sigport.org/320
Ralph Sperschneider, Janine Sukowski, 2015. DELAY-LESS FREQUENCY DOMAIN PACKET-LOSS CONCEALMENT FOR TONAL AUDIO SIGNALS. Available at: http://sigport.org/320.
Ralph Sperschneider, Janine Sukowski. (2015). "DELAY-LESS FREQUENCY DOMAIN PACKET-LOSS CONCEALMENT FOR TONAL AUDIO SIGNALS." Web.
1. Ralph Sperschneider, Janine Sukowski. DELAY-LESS FREQUENCY DOMAIN PACKET-LOSS CONCEALMENT FOR TONAL AUDIO SIGNALS [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/320

Enhanced AMR-WB Bandwidth Extension in 3GPP EVS Codec


This presentation describes the bandwidth extension (BWE) method developed for the AMR-WB interoperable (AMR-WB IO) modes of the 3GPP EVS codec. The low-band signal (0-6.4 kHz) is coded using an enhanced version of ACELP as in AMR-WB and post-processed; the high-band (above 6.4 kHz) in contrast to AMR-WB is represented with a new BWE method. The decoded low-band excitation is adaptively extended to high frequencies and filtered in the DCT domain. The extended excitation is scaled by subframe gains and shaped by a weighted LPC synthesis filter.

Paper Details

Authors:
Magdalena Kaniewska, Stéphane Ragot, Zexin Liu, Lei Miao, Xingtao Zhang, Jon Gibbs
Submitted On:
23 February 2016 - 1:44pm
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

EVS_AMR_WB_IO_BWE_GlobalSIP2015.ppt

(290 downloads)

Keywords

Subscribe

[1] Magdalena Kaniewska, Stéphane Ragot, Zexin Liu, Lei Miao, Xingtao Zhang, Jon Gibbs, "Enhanced AMR-WB Bandwidth Extension in 3GPP EVS Codec", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/317. Accessed: Jun. 24, 2017.
@article{317-15,
url = {http://sigport.org/317},
author = {Magdalena Kaniewska; Stéphane Ragot; Zexin Liu; Lei Miao; Xingtao Zhang; Jon Gibbs },
publisher = {IEEE SigPort},
title = {Enhanced AMR-WB Bandwidth Extension in 3GPP EVS Codec},
year = {2015} }
TY - EJOUR
T1 - Enhanced AMR-WB Bandwidth Extension in 3GPP EVS Codec
AU - Magdalena Kaniewska; Stéphane Ragot; Zexin Liu; Lei Miao; Xingtao Zhang; Jon Gibbs
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/317
ER -
Magdalena Kaniewska, Stéphane Ragot, Zexin Liu, Lei Miao, Xingtao Zhang, Jon Gibbs. (2015). Enhanced AMR-WB Bandwidth Extension in 3GPP EVS Codec. IEEE SigPort. http://sigport.org/317
Magdalena Kaniewska, Stéphane Ragot, Zexin Liu, Lei Miao, Xingtao Zhang, Jon Gibbs, 2015. Enhanced AMR-WB Bandwidth Extension in 3GPP EVS Codec. Available at: http://sigport.org/317.
Magdalena Kaniewska, Stéphane Ragot, Zexin Liu, Lei Miao, Xingtao Zhang, Jon Gibbs. (2015). "Enhanced AMR-WB Bandwidth Extension in 3GPP EVS Codec." Web.
1. Magdalena Kaniewska, Stéphane Ragot, Zexin Liu, Lei Miao, Xingtao Zhang, Jon Gibbs. Enhanced AMR-WB Bandwidth Extension in 3GPP EVS Codec [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/317

A Novel Frequency Domain BWE with Relaxed Synchronization and Associated BWE Switching


This presentation describes a novel frequency domain bandwidth extension (BWE) scheme with relaxed synchronization, optimized for coding inactive and music/mixed content signals. The algorithm achieves high subjective quality at low and medium bitrates and it has a low algorithmic delay. The algorithm is part of the 3GPP Enhanced Voice Services (EVS) codec. In addition to the presented algorithm, the EVS codec employs also a time domain BWE scheme optimized for active speech coding.

Paper Details

Authors:
Lei Miao, Zexin Liu, Xingtao Zhang, Chen Hu, Jon Gibbs, Kihyun Choo, Eunmi Oh
Submitted On:
23 February 2016 - 1:44pm
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

EVS_FD_BWE_GlobalSIP2015.pptx

(232 downloads)

Keywords

Subscribe

[1] Lei Miao, Zexin Liu, Xingtao Zhang, Chen Hu, Jon Gibbs, Kihyun Choo, Eunmi Oh, "A Novel Frequency Domain BWE with Relaxed Synchronization and Associated BWE Switching", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/316. Accessed: Jun. 24, 2017.
@article{316-15,
url = {http://sigport.org/316},
author = {Lei Miao; Zexin Liu; Xingtao Zhang; Chen Hu; Jon Gibbs; Kihyun Choo; Eunmi Oh },
publisher = {IEEE SigPort},
title = {A Novel Frequency Domain BWE with Relaxed Synchronization and Associated BWE Switching},
year = {2015} }
TY - EJOUR
T1 - A Novel Frequency Domain BWE with Relaxed Synchronization and Associated BWE Switching
AU - Lei Miao; Zexin Liu; Xingtao Zhang; Chen Hu; Jon Gibbs; Kihyun Choo; Eunmi Oh
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/316
ER -
Lei Miao, Zexin Liu, Xingtao Zhang, Chen Hu, Jon Gibbs, Kihyun Choo, Eunmi Oh. (2015). A Novel Frequency Domain BWE with Relaxed Synchronization and Associated BWE Switching. IEEE SigPort. http://sigport.org/316
Lei Miao, Zexin Liu, Xingtao Zhang, Chen Hu, Jon Gibbs, Kihyun Choo, Eunmi Oh, 2015. A Novel Frequency Domain BWE with Relaxed Synchronization and Associated BWE Switching. Available at: http://sigport.org/316.
Lei Miao, Zexin Liu, Xingtao Zhang, Chen Hu, Jon Gibbs, Kihyun Choo, Eunmi Oh. (2015). "A Novel Frequency Domain BWE with Relaxed Synchronization and Associated BWE Switching." Web.
1. Lei Miao, Zexin Liu, Xingtao Zhang, Chen Hu, Jon Gibbs, Kihyun Choo, Eunmi Oh. A Novel Frequency Domain BWE with Relaxed Synchronization and Associated BWE Switching [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/316

Audio Bandwidth Detection in the EVS Codec


Speech and audio codecs are usually designed such that they encode all the frequency bands of the input signal spectrum. If the higher bands do not contain any perceptually meaningful content, these codecs often do not work optimally as they assign part of the available bit budget to encode these bands. In this paper we describe a bandwidth detection algorithm that determines the effective audio bandwidth of the input signal.

Paper Details

Authors:
Milan Jelinek, Wolfgang Jaegers
Submitted On:
23 February 2016 - 1:44pm
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

GlobalSIP15_BWD_presentation.ppt

(234 downloads)

Keywords

Subscribe

[1] Milan Jelinek, Wolfgang Jaegers, "Audio Bandwidth Detection in the EVS Codec", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/315. Accessed: Jun. 24, 2017.
@article{315-15,
url = {http://sigport.org/315},
author = {Milan Jelinek; Wolfgang Jaegers },
publisher = {IEEE SigPort},
title = {Audio Bandwidth Detection in the EVS Codec},
year = {2015} }
TY - EJOUR
T1 - Audio Bandwidth Detection in the EVS Codec
AU - Milan Jelinek; Wolfgang Jaegers
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/315
ER -
Milan Jelinek, Wolfgang Jaegers. (2015). Audio Bandwidth Detection in the EVS Codec. IEEE SigPort. http://sigport.org/315
Milan Jelinek, Wolfgang Jaegers, 2015. Audio Bandwidth Detection in the EVS Codec. Available at: http://sigport.org/315.
Milan Jelinek, Wolfgang Jaegers. (2015). "Audio Bandwidth Detection in the EVS Codec." Web.
1. Milan Jelinek, Wolfgang Jaegers. Audio Bandwidth Detection in the EVS Codec [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/315

Linear estimation based primary-ambient extraction for stereo audio signals (slides)


Audio signals for moving pictures and video games are often linear combinations of primary and ambient components. In spatial audio analysis-synthesis, these mixed signals are usually decomposed into primary and ambient components to facilitate flexible spatial rendering and enhancement. Existing approaches such as principal component analysis (PCA) and least squares (LS) are widely used to perform this decomposition from stereo signals.

Paper Details

Authors:
Ee Leng Tan
Submitted On:
23 February 2016 - 1:43pm
Short Link:
Type:

Document Files

ASLP14slides_Linear estimation based primary-ambient extraction for stereo audio signals-short.pdf

(286 downloads)

Keywords

Subscribe

[1] Ee Leng Tan, "Linear estimation based primary-ambient extraction for stereo audio signals (slides)", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/159. Accessed: Jun. 24, 2017.
@article{159-15,
url = {http://sigport.org/159},
author = {Ee Leng Tan },
publisher = {IEEE SigPort},
title = {Linear estimation based primary-ambient extraction for stereo audio signals (slides)},
year = {2015} }
TY - EJOUR
T1 - Linear estimation based primary-ambient extraction for stereo audio signals (slides)
AU - Ee Leng Tan
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/159
ER -
Ee Leng Tan. (2015). Linear estimation based primary-ambient extraction for stereo audio signals (slides). IEEE SigPort. http://sigport.org/159
Ee Leng Tan, 2015. Linear estimation based primary-ambient extraction for stereo audio signals (slides). Available at: http://sigport.org/159.
Ee Leng Tan. (2015). "Linear estimation based primary-ambient extraction for stereo audio signals (slides)." Web.
1. Ee Leng Tan. Linear estimation based primary-ambient extraction for stereo audio signals (slides) [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/159

Linear estimation based primary-ambient extraction for stereo audio signals


Linear estimation based primary-ambient extraction for stereo audio signals

Audio signals for moving pictures and video games are often linear combinations of primary and ambient components. In spatial audio analysis-synthesis, these mixed signals are usually decomposed into primary and ambient components to facilitate flexible spatial rendering and enhancement. Existing approaches such as principal component analysis (PCA) and least squares (LS) are widely used to perform this decomposition from stereo signals.

Paper Details

Authors:
Ee Leng Tan
Submitted On:
23 February 2016 - 1:43pm
Short Link:
Type:

Document Files

ASLP14manuscript_Linear estimation based primary-ambient extraction for stereo audio signals.pdf

(290 downloads)

Keywords

Subscribe

[1] Ee Leng Tan, "Linear estimation based primary-ambient extraction for stereo audio signals", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/158. Accessed: Jun. 24, 2017.
@article{158-15,
url = {http://sigport.org/158},
author = {Ee Leng Tan },
publisher = {IEEE SigPort},
title = {Linear estimation based primary-ambient extraction for stereo audio signals},
year = {2015} }
TY - EJOUR
T1 - Linear estimation based primary-ambient extraction for stereo audio signals
AU - Ee Leng Tan
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/158
ER -
Ee Leng Tan. (2015). Linear estimation based primary-ambient extraction for stereo audio signals. IEEE SigPort. http://sigport.org/158
Ee Leng Tan, 2015. Linear estimation based primary-ambient extraction for stereo audio signals. Available at: http://sigport.org/158.
Ee Leng Tan. (2015). "Linear estimation based primary-ambient extraction for stereo audio signals." Web.
1. Ee Leng Tan. Linear estimation based primary-ambient extraction for stereo audio signals [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/158