Sorry, you need to enable JavaScript to visit this website.

Audio Coding

A PACKET LOSS RECOVERY TECHNIQUE WITH LINE SPECTRAL FREQUENCY MODIFICATION IN 3GPP EVS CODEC


This paper examines a method for controlling the energy of decoded signal at the recovery frame from a packet loss. Our observation unveiled that a packet loss around speech onset causes sudden increase in the amplitude of the decoded signal at the recovery frame. To mitigate the artifact caused by the overshoot, a detector of the speech overshoot is proposed as well as a method that controls the amplitude of the decoded signal by adjusting distances of adjacent line spectral frequencies.

Paper Details

Authors:
Kei Kikuiri
Submitted On:
23 February 2016 - 1:44pm
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

20151215_GlobalSIP2015_presentation.pdf

(534)

Keywords

Subscribe

[1] Kei Kikuiri, "A PACKET LOSS RECOVERY TECHNIQUE WITH LINE SPECTRAL FREQUENCY MODIFICATION IN 3GPP EVS CODEC", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/385. Accessed: Nov. 17, 2019.
@article{385-15,
url = {http://sigport.org/385},
author = {Kei Kikuiri },
publisher = {IEEE SigPort},
title = {A PACKET LOSS RECOVERY TECHNIQUE WITH LINE SPECTRAL FREQUENCY MODIFICATION IN 3GPP EVS CODEC},
year = {2015} }
TY - EJOUR
T1 - A PACKET LOSS RECOVERY TECHNIQUE WITH LINE SPECTRAL FREQUENCY MODIFICATION IN 3GPP EVS CODEC
AU - Kei Kikuiri
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/385
ER -
Kei Kikuiri. (2015). A PACKET LOSS RECOVERY TECHNIQUE WITH LINE SPECTRAL FREQUENCY MODIFICATION IN 3GPP EVS CODEC. IEEE SigPort. http://sigport.org/385
Kei Kikuiri, 2015. A PACKET LOSS RECOVERY TECHNIQUE WITH LINE SPECTRAL FREQUENCY MODIFICATION IN 3GPP EVS CODEC. Available at: http://sigport.org/385.
Kei Kikuiri. (2015). "A PACKET LOSS RECOVERY TECHNIQUE WITH LINE SPECTRAL FREQUENCY MODIFICATION IN 3GPP EVS CODEC." Web.
1. Kei Kikuiri. A PACKET LOSS RECOVERY TECHNIQUE WITH LINE SPECTRAL FREQUENCY MODIFICATION IN 3GPP EVS CODEC [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/385

DELAY-LESS FREQUENCY DOMAIN PACKET-LOSS CONCEALMENT FOR TONAL AUDIO SIGNALS

Paper Details

Authors:
Ralph Sperschneider, Janine Sukowski
Submitted On:
23 February 2016 - 1:44pm
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

FD_Tonal_PLC_GlobalSIP2015.pdf

(484)

Keywords

Subscribe

[1] Ralph Sperschneider, Janine Sukowski, "DELAY-LESS FREQUENCY DOMAIN PACKET-LOSS CONCEALMENT FOR TONAL AUDIO SIGNALS", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/320. Accessed: Nov. 17, 2019.
@article{320-15,
url = {http://sigport.org/320},
author = {Ralph Sperschneider; Janine Sukowski },
publisher = {IEEE SigPort},
title = {DELAY-LESS FREQUENCY DOMAIN PACKET-LOSS CONCEALMENT FOR TONAL AUDIO SIGNALS},
year = {2015} }
TY - EJOUR
T1 - DELAY-LESS FREQUENCY DOMAIN PACKET-LOSS CONCEALMENT FOR TONAL AUDIO SIGNALS
AU - Ralph Sperschneider; Janine Sukowski
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/320
ER -
Ralph Sperschneider, Janine Sukowski. (2015). DELAY-LESS FREQUENCY DOMAIN PACKET-LOSS CONCEALMENT FOR TONAL AUDIO SIGNALS. IEEE SigPort. http://sigport.org/320
Ralph Sperschneider, Janine Sukowski, 2015. DELAY-LESS FREQUENCY DOMAIN PACKET-LOSS CONCEALMENT FOR TONAL AUDIO SIGNALS. Available at: http://sigport.org/320.
Ralph Sperschneider, Janine Sukowski. (2015). "DELAY-LESS FREQUENCY DOMAIN PACKET-LOSS CONCEALMENT FOR TONAL AUDIO SIGNALS." Web.
1. Ralph Sperschneider, Janine Sukowski. DELAY-LESS FREQUENCY DOMAIN PACKET-LOSS CONCEALMENT FOR TONAL AUDIO SIGNALS [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/320

Enhanced AMR-WB Bandwidth Extension in 3GPP EVS Codec


This presentation describes the bandwidth extension (BWE) method developed for the AMR-WB interoperable (AMR-WB IO) modes of the 3GPP EVS codec. The low-band signal (0-6.4 kHz) is coded using an enhanced version of ACELP as in AMR-WB and post-processed; the high-band (above 6.4 kHz) in contrast to AMR-WB is represented with a new BWE method. The decoded low-band excitation is adaptively extended to high frequencies and filtered in the DCT domain. The extended excitation is scaled by subframe gains and shaped by a weighted LPC synthesis filter.

Paper Details

Authors:
Magdalena Kaniewska, Stéphane Ragot, Zexin Liu, Lei Miao, Xingtao Zhang, Jon Gibbs
Submitted On:
23 February 2016 - 1:44pm
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

EVS_AMR_WB_IO_BWE_GlobalSIP2015.ppt

(576)

Subscribe

[1] Magdalena Kaniewska, Stéphane Ragot, Zexin Liu, Lei Miao, Xingtao Zhang, Jon Gibbs, "Enhanced AMR-WB Bandwidth Extension in 3GPP EVS Codec", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/317. Accessed: Nov. 17, 2019.
@article{317-15,
url = {http://sigport.org/317},
author = {Magdalena Kaniewska; Stéphane Ragot; Zexin Liu; Lei Miao; Xingtao Zhang; Jon Gibbs },
publisher = {IEEE SigPort},
title = {Enhanced AMR-WB Bandwidth Extension in 3GPP EVS Codec},
year = {2015} }
TY - EJOUR
T1 - Enhanced AMR-WB Bandwidth Extension in 3GPP EVS Codec
AU - Magdalena Kaniewska; Stéphane Ragot; Zexin Liu; Lei Miao; Xingtao Zhang; Jon Gibbs
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/317
ER -
Magdalena Kaniewska, Stéphane Ragot, Zexin Liu, Lei Miao, Xingtao Zhang, Jon Gibbs. (2015). Enhanced AMR-WB Bandwidth Extension in 3GPP EVS Codec. IEEE SigPort. http://sigport.org/317
Magdalena Kaniewska, Stéphane Ragot, Zexin Liu, Lei Miao, Xingtao Zhang, Jon Gibbs, 2015. Enhanced AMR-WB Bandwidth Extension in 3GPP EVS Codec. Available at: http://sigport.org/317.
Magdalena Kaniewska, Stéphane Ragot, Zexin Liu, Lei Miao, Xingtao Zhang, Jon Gibbs. (2015). "Enhanced AMR-WB Bandwidth Extension in 3GPP EVS Codec." Web.
1. Magdalena Kaniewska, Stéphane Ragot, Zexin Liu, Lei Miao, Xingtao Zhang, Jon Gibbs. Enhanced AMR-WB Bandwidth Extension in 3GPP EVS Codec [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/317

A Novel Frequency Domain BWE with Relaxed Synchronization and Associated BWE Switching


This presentation describes a novel frequency domain bandwidth extension (BWE) scheme with relaxed synchronization, optimized for coding inactive and music/mixed content signals. The algorithm achieves high subjective quality at low and medium bitrates and it has a low algorithmic delay. The algorithm is part of the 3GPP Enhanced Voice Services (EVS) codec. In addition to the presented algorithm, the EVS codec employs also a time domain BWE scheme optimized for active speech coding.

Paper Details

Authors:
Lei Miao, Zexin Liu, Xingtao Zhang, Chen Hu, Jon Gibbs, Kihyun Choo, Eunmi Oh
Submitted On:
23 February 2016 - 1:44pm
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

EVS_FD_BWE_GlobalSIP2015.pptx

(530)

Subscribe

[1] Lei Miao, Zexin Liu, Xingtao Zhang, Chen Hu, Jon Gibbs, Kihyun Choo, Eunmi Oh, "A Novel Frequency Domain BWE with Relaxed Synchronization and Associated BWE Switching", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/316. Accessed: Nov. 17, 2019.
@article{316-15,
url = {http://sigport.org/316},
author = {Lei Miao; Zexin Liu; Xingtao Zhang; Chen Hu; Jon Gibbs; Kihyun Choo; Eunmi Oh },
publisher = {IEEE SigPort},
title = {A Novel Frequency Domain BWE with Relaxed Synchronization and Associated BWE Switching},
year = {2015} }
TY - EJOUR
T1 - A Novel Frequency Domain BWE with Relaxed Synchronization and Associated BWE Switching
AU - Lei Miao; Zexin Liu; Xingtao Zhang; Chen Hu; Jon Gibbs; Kihyun Choo; Eunmi Oh
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/316
ER -
Lei Miao, Zexin Liu, Xingtao Zhang, Chen Hu, Jon Gibbs, Kihyun Choo, Eunmi Oh. (2015). A Novel Frequency Domain BWE with Relaxed Synchronization and Associated BWE Switching. IEEE SigPort. http://sigport.org/316
Lei Miao, Zexin Liu, Xingtao Zhang, Chen Hu, Jon Gibbs, Kihyun Choo, Eunmi Oh, 2015. A Novel Frequency Domain BWE with Relaxed Synchronization and Associated BWE Switching. Available at: http://sigport.org/316.
Lei Miao, Zexin Liu, Xingtao Zhang, Chen Hu, Jon Gibbs, Kihyun Choo, Eunmi Oh. (2015). "A Novel Frequency Domain BWE with Relaxed Synchronization and Associated BWE Switching." Web.
1. Lei Miao, Zexin Liu, Xingtao Zhang, Chen Hu, Jon Gibbs, Kihyun Choo, Eunmi Oh. A Novel Frequency Domain BWE with Relaxed Synchronization and Associated BWE Switching [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/316

Audio Bandwidth Detection in the EVS Codec


Speech and audio codecs are usually designed such that they encode all the frequency bands of the input signal spectrum. If the higher bands do not contain any perceptually meaningful content, these codecs often do not work optimally as they assign part of the available bit budget to encode these bands. In this paper we describe a bandwidth detection algorithm that determines the effective audio bandwidth of the input signal.

Paper Details

Authors:
Milan Jelinek, Wolfgang Jaegers
Submitted On:
23 February 2016 - 1:44pm
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

GlobalSIP15_BWD_presentation.ppt

(662)

Subscribe

[1] Milan Jelinek, Wolfgang Jaegers, "Audio Bandwidth Detection in the EVS Codec", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/315. Accessed: Nov. 17, 2019.
@article{315-15,
url = {http://sigport.org/315},
author = {Milan Jelinek; Wolfgang Jaegers },
publisher = {IEEE SigPort},
title = {Audio Bandwidth Detection in the EVS Codec},
year = {2015} }
TY - EJOUR
T1 - Audio Bandwidth Detection in the EVS Codec
AU - Milan Jelinek; Wolfgang Jaegers
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/315
ER -
Milan Jelinek, Wolfgang Jaegers. (2015). Audio Bandwidth Detection in the EVS Codec. IEEE SigPort. http://sigport.org/315
Milan Jelinek, Wolfgang Jaegers, 2015. Audio Bandwidth Detection in the EVS Codec. Available at: http://sigport.org/315.
Milan Jelinek, Wolfgang Jaegers. (2015). "Audio Bandwidth Detection in the EVS Codec." Web.
1. Milan Jelinek, Wolfgang Jaegers. Audio Bandwidth Detection in the EVS Codec [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/315

Linear estimation based primary-ambient extraction for stereo audio signals (slides)


Audio signals for moving pictures and video games are often linear combinations of primary and ambient components. In spatial audio analysis-synthesis, these mixed signals are usually decomposed into primary and ambient components to facilitate flexible spatial rendering and enhancement. Existing approaches such as principal component analysis (PCA) and least squares (LS) are widely used to perform this decomposition from stereo signals.

Paper Details

Authors:
Ee Leng Tan
Submitted On:
23 February 2016 - 1:43pm
Short Link:
Type:

Document Files

ASLP14slides_Linear estimation based primary-ambient extraction for stereo audio signals-short.pdf

(99)

Subscribe

[1] Ee Leng Tan, "Linear estimation based primary-ambient extraction for stereo audio signals (slides)", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/159. Accessed: Nov. 17, 2019.
@article{159-15,
url = {http://sigport.org/159},
author = {Ee Leng Tan },
publisher = {IEEE SigPort},
title = {Linear estimation based primary-ambient extraction for stereo audio signals (slides)},
year = {2015} }
TY - EJOUR
T1 - Linear estimation based primary-ambient extraction for stereo audio signals (slides)
AU - Ee Leng Tan
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/159
ER -
Ee Leng Tan. (2015). Linear estimation based primary-ambient extraction for stereo audio signals (slides). IEEE SigPort. http://sigport.org/159
Ee Leng Tan, 2015. Linear estimation based primary-ambient extraction for stereo audio signals (slides). Available at: http://sigport.org/159.
Ee Leng Tan. (2015). "Linear estimation based primary-ambient extraction for stereo audio signals (slides)." Web.
1. Ee Leng Tan. Linear estimation based primary-ambient extraction for stereo audio signals (slides) [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/159

Linear estimation based primary-ambient extraction for stereo audio signals


Linear estimation based primary-ambient extraction for stereo audio signals

Audio signals for moving pictures and video games are often linear combinations of primary and ambient components. In spatial audio analysis-synthesis, these mixed signals are usually decomposed into primary and ambient components to facilitate flexible spatial rendering and enhancement. Existing approaches such as principal component analysis (PCA) and least squares (LS) are widely used to perform this decomposition from stereo signals.

Paper Details

Authors:
Ee Leng Tan
Submitted On:
23 February 2016 - 1:43pm
Short Link:
Type:

Document Files

ASLP14manuscript_Linear estimation based primary-ambient extraction for stereo audio signals.pdf

(104)

Subscribe

[1] Ee Leng Tan, "Linear estimation based primary-ambient extraction for stereo audio signals", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/158. Accessed: Nov. 17, 2019.
@article{158-15,
url = {http://sigport.org/158},
author = {Ee Leng Tan },
publisher = {IEEE SigPort},
title = {Linear estimation based primary-ambient extraction for stereo audio signals},
year = {2015} }
TY - EJOUR
T1 - Linear estimation based primary-ambient extraction for stereo audio signals
AU - Ee Leng Tan
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/158
ER -
Ee Leng Tan. (2015). Linear estimation based primary-ambient extraction for stereo audio signals. IEEE SigPort. http://sigport.org/158
Ee Leng Tan, 2015. Linear estimation based primary-ambient extraction for stereo audio signals. Available at: http://sigport.org/158.
Ee Leng Tan. (2015). "Linear estimation based primary-ambient extraction for stereo audio signals." Web.
1. Ee Leng Tan. Linear estimation based primary-ambient extraction for stereo audio signals [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/158

Pages