Sorry, you need to enable JavaScript to visit this website.

Spatial and Multichannel Audio

Natural Sound Rendering for Headphones: Integration of signal processing techniques (slides)


With the strong growth of assistive and personal listening devices, natural sound rendering over headphones is becoming a necessity for prolonged listening in multimedia and virtual reality applications. The aim of natural sound rendering is to naturally recreate the sound scenes with the spatial and timbral quality as natural as possible, so as to achieve a truly immersive listening experience. However, rendering natural sound over headphones encounters many challenges. This tutorial article presents signal processing techniques to tackle these challenges to assist human listening.

Paper Details

Authors:
Kaushik Sunder, Ee-Leng Tan
Submitted On:
23 February 2016 - 1:43pm
Short Link:
Type:

Document Files

SPM15slides_Natural Sound Rendering for Headphones.pdf

(112)

Subscribe

[1] Kaushik Sunder, Ee-Leng Tan, "Natural Sound Rendering for Headphones: Integration of signal processing techniques (slides)", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/167. Accessed: Dec. 12, 2019.
@article{167-15,
url = {http://sigport.org/167},
author = {Kaushik Sunder; Ee-Leng Tan },
publisher = {IEEE SigPort},
title = {Natural Sound Rendering for Headphones: Integration of signal processing techniques (slides)},
year = {2015} }
TY - EJOUR
T1 - Natural Sound Rendering for Headphones: Integration of signal processing techniques (slides)
AU - Kaushik Sunder; Ee-Leng Tan
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/167
ER -
Kaushik Sunder, Ee-Leng Tan. (2015). Natural Sound Rendering for Headphones: Integration of signal processing techniques (slides). IEEE SigPort. http://sigport.org/167
Kaushik Sunder, Ee-Leng Tan, 2015. Natural Sound Rendering for Headphones: Integration of signal processing techniques (slides). Available at: http://sigport.org/167.
Kaushik Sunder, Ee-Leng Tan. (2015). "Natural Sound Rendering for Headphones: Integration of signal processing techniques (slides)." Web.
1. Kaushik Sunder, Ee-Leng Tan. Natural Sound Rendering for Headphones: Integration of signal processing techniques (slides) [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/167

Analytical Method of 2.5D Exterior Sound Field Synthesis by Using Multipole Loudspeaker Array


We propose an analytical method of 2.5-dimensional exterior sound field reproduction by using a multipole loudspeaker array. The method reproduces the sound field modeled by expansion coefficients of spherical harmonics based on multipole superposition. We also present an analytical method for converting the expansion coefficients of spherical harmonics to weighting coefficients for multipole superposition.

Paper Details

Authors:
Kenta Imaizumi, Kimitaka Tsuitsuimi, Atsushi Nakadaira, Yoichi Haneda
Submitted On:
18 October 2019 - 1:47am
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

WASPAA2019_Poster_Imaizumi.pdf

(33)

Subscribe

[1] Kenta Imaizumi, Kimitaka Tsuitsuimi, Atsushi Nakadaira, Yoichi Haneda, "Analytical Method of 2.5D Exterior Sound Field Synthesis by Using Multipole Loudspeaker Array", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4880. Accessed: Dec. 12, 2019.
@article{4880-19,
url = {http://sigport.org/4880},
author = {Kenta Imaizumi; Kimitaka Tsuitsuimi; Atsushi Nakadaira; Yoichi Haneda },
publisher = {IEEE SigPort},
title = {Analytical Method of 2.5D Exterior Sound Field Synthesis by Using Multipole Loudspeaker Array},
year = {2019} }
TY - EJOUR
T1 - Analytical Method of 2.5D Exterior Sound Field Synthesis by Using Multipole Loudspeaker Array
AU - Kenta Imaizumi; Kimitaka Tsuitsuimi; Atsushi Nakadaira; Yoichi Haneda
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4880
ER -
Kenta Imaizumi, Kimitaka Tsuitsuimi, Atsushi Nakadaira, Yoichi Haneda. (2019). Analytical Method of 2.5D Exterior Sound Field Synthesis by Using Multipole Loudspeaker Array. IEEE SigPort. http://sigport.org/4880
Kenta Imaizumi, Kimitaka Tsuitsuimi, Atsushi Nakadaira, Yoichi Haneda, 2019. Analytical Method of 2.5D Exterior Sound Field Synthesis by Using Multipole Loudspeaker Array. Available at: http://sigport.org/4880.
Kenta Imaizumi, Kimitaka Tsuitsuimi, Atsushi Nakadaira, Yoichi Haneda. (2019). "Analytical Method of 2.5D Exterior Sound Field Synthesis by Using Multipole Loudspeaker Array." Web.
1. Kenta Imaizumi, Kimitaka Tsuitsuimi, Atsushi Nakadaira, Yoichi Haneda. Analytical Method of 2.5D Exterior Sound Field Synthesis by Using Multipole Loudspeaker Array [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4880

3D localized sound zone generation with a planar omni-directional loudspeaker array


This paper provides a 3D localized sound zone generation method using a planar omni-directional loudspeaker array. In the proposed method, multiple co-centered circular arrays are arranged on the horizontal plane and an additional loudspeaker is located at the array’s center. The sound field produced by this center loudspeaker is then cancelled using the multiple circular arrays. A localized 3D sound zone can thus be generated inside a sphere with a maximum radius of that of the circular arrays because the residual sound field is contained within the sphere.

Paper Details

Authors:
Submitted On:
17 October 2019 - 1:25am
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

WASPAA_2019_okamoto.pdf

(34)

Subscribe

[1] , "3D localized sound zone generation with a planar omni-directional loudspeaker array", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4877. Accessed: Dec. 12, 2019.
@article{4877-19,
url = {http://sigport.org/4877},
author = { },
publisher = {IEEE SigPort},
title = {3D localized sound zone generation with a planar omni-directional loudspeaker array},
year = {2019} }
TY - EJOUR
T1 - 3D localized sound zone generation with a planar omni-directional loudspeaker array
AU -
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4877
ER -
. (2019). 3D localized sound zone generation with a planar omni-directional loudspeaker array. IEEE SigPort. http://sigport.org/4877
, 2019. 3D localized sound zone generation with a planar omni-directional loudspeaker array. Available at: http://sigport.org/4877.
. (2019). "3D localized sound zone generation with a planar omni-directional loudspeaker array." Web.
1. . 3D localized sound zone generation with a planar omni-directional loudspeaker array [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4877

MULTI-GEOMETRY SPATIAL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION


The use of spatial information with multiple microphones can improve far-field automatic speech recognition (ASR) accuracy. However, conventional microphone array techniques degrade speech enhancement performance when there is an array geometry mismatch between design and test conditions. Moreover, such speech enhancement techniques do not always yield ASR accuracy improvement due to the difference between speech enhancement and ASR optimization objectives.

Paper Details

Authors:
Shiva Sundaram, Nikko Strom, Bjorn Hoffmeister
Submitted On:
10 May 2019 - 6:38pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

poster file

(40)

manuscript file

(41)

Subscribe

[1] Shiva Sundaram, Nikko Strom, Bjorn Hoffmeister, "MULTI-GEOMETRY SPATIAL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4420. Accessed: Dec. 12, 2019.
@article{4420-19,
url = {http://sigport.org/4420},
author = {Shiva Sundaram; Nikko Strom; Bjorn Hoffmeister },
publisher = {IEEE SigPort},
title = {MULTI-GEOMETRY SPATIAL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION},
year = {2019} }
TY - EJOUR
T1 - MULTI-GEOMETRY SPATIAL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION
AU - Shiva Sundaram; Nikko Strom; Bjorn Hoffmeister
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4420
ER -
Shiva Sundaram, Nikko Strom, Bjorn Hoffmeister. (2019). MULTI-GEOMETRY SPATIAL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION. IEEE SigPort. http://sigport.org/4420
Shiva Sundaram, Nikko Strom, Bjorn Hoffmeister, 2019. MULTI-GEOMETRY SPATIAL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION. Available at: http://sigport.org/4420.
Shiva Sundaram, Nikko Strom, Bjorn Hoffmeister. (2019). "MULTI-GEOMETRY SPATIAL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION." Web.
1. Shiva Sundaram, Nikko Strom, Bjorn Hoffmeister. MULTI-GEOMETRY SPATIAL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4420

FREQUENCY DOMAIN MULTI-CHANNEL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION


Conventional far-field automatic speech recognition (ASR) systems typically employ microphone array techniques for speech enhancement in order to improve robustness against noise or reverberation. However, such speech enhancement techniques do not always yield ASR accuracy improvement because the optimization criterion for speech enhancement is not directly relevant to the ASR objective. In this work, we develop new acoustic modeling techniques that optimize spatial filtering and long short-term memory (LSTM) layers from multi-channel (MC) input based on an ASR criterion directly.

Paper Details

Authors:
Shiva Sundaram, Nikko Strom, Bjorn Hoffmeister
Submitted On:
10 May 2019 - 6:36pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

poster file

(36)

manuscript file

(35)

Subscribe

[1] Shiva Sundaram, Nikko Strom, Bjorn Hoffmeister, "FREQUENCY DOMAIN MULTI-CHANNEL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4419. Accessed: Dec. 12, 2019.
@article{4419-19,
url = {http://sigport.org/4419},
author = {Shiva Sundaram; Nikko Strom; Bjorn Hoffmeister },
publisher = {IEEE SigPort},
title = {FREQUENCY DOMAIN MULTI-CHANNEL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION},
year = {2019} }
TY - EJOUR
T1 - FREQUENCY DOMAIN MULTI-CHANNEL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION
AU - Shiva Sundaram; Nikko Strom; Bjorn Hoffmeister
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4419
ER -
Shiva Sundaram, Nikko Strom, Bjorn Hoffmeister. (2019). FREQUENCY DOMAIN MULTI-CHANNEL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION. IEEE SigPort. http://sigport.org/4419
Shiva Sundaram, Nikko Strom, Bjorn Hoffmeister, 2019. FREQUENCY DOMAIN MULTI-CHANNEL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION. Available at: http://sigport.org/4419.
Shiva Sundaram, Nikko Strom, Bjorn Hoffmeister. (2019). "FREQUENCY DOMAIN MULTI-CHANNEL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION." Web.
1. Shiva Sundaram, Nikko Strom, Bjorn Hoffmeister. FREQUENCY DOMAIN MULTI-CHANNEL ACOUSTIC MODELING FOR DISTANT SPEECH RECOGNITION [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4419

Horizontal 3D sound field recording and 2.5D synthesis with omni-directional circular arrays


Although 2.5D sound field synthesis with a circular loudspeaker array can be used in a 3D sound field, a 2D sound field, instead of a 3D sound field, is assumed for a sound field recording with a circular microphone array. This paper presents a horizontal 3D sound field recording and 2.5D synthesis method used in 3D sound fields with multiple co-centered omni-directional circular microphone arrays and a circular loudspeaker array without vertical derivative measurements.

Paper Details

Authors:
Takuma Okamoto
Submitted On:
10 May 2019 - 3:08am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

icassp_2019_okamoto_2.pdf

(56)

Subscribe

[1] Takuma Okamoto, "Horizontal 3D sound field recording and 2.5D synthesis with omni-directional circular arrays", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4279. Accessed: Dec. 12, 2019.
@article{4279-19,
url = {http://sigport.org/4279},
author = {Takuma Okamoto },
publisher = {IEEE SigPort},
title = {Horizontal 3D sound field recording and 2.5D synthesis with omni-directional circular arrays},
year = {2019} }
TY - EJOUR
T1 - Horizontal 3D sound field recording and 2.5D synthesis with omni-directional circular arrays
AU - Takuma Okamoto
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4279
ER -
Takuma Okamoto. (2019). Horizontal 3D sound field recording and 2.5D synthesis with omni-directional circular arrays. IEEE SigPort. http://sigport.org/4279
Takuma Okamoto, 2019. Horizontal 3D sound field recording and 2.5D synthesis with omni-directional circular arrays. Available at: http://sigport.org/4279.
Takuma Okamoto. (2019). "Horizontal 3D sound field recording and 2.5D synthesis with omni-directional circular arrays." Web.
1. Takuma Okamoto. Horizontal 3D sound field recording and 2.5D synthesis with omni-directional circular arrays [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4279

Multipath Enabled Private Audio with Noise


We address the problem of privately communicating audio messages to multiple listeners in a reverberant room using a set of loudspeakers. We propose two methods based on emitting noise. In the first method, the loudspeakers emit noise signals that are appropriately filtered so that after echoing along multiple paths in the room, they sum up and descramble to yield distinct meaningful audio messages only at specific focusing spots, while being incoherent everywhere else.

Paper Details

Authors:
Anadi Chaman, Yu-Jeh Liu, Jonah Casebeer, Ivan Dokmanić
Submitted On:
9 May 2019 - 11:13pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

icassp_poster_v3.pdf

(47)

Subscribe

[1] Anadi Chaman, Yu-Jeh Liu, Jonah Casebeer, Ivan Dokmanić, "Multipath Enabled Private Audio with Noise", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4261. Accessed: Dec. 12, 2019.
@article{4261-19,
url = {http://sigport.org/4261},
author = {Anadi Chaman; Yu-Jeh Liu; Jonah Casebeer; Ivan Dokmanić },
publisher = {IEEE SigPort},
title = {Multipath Enabled Private Audio with Noise},
year = {2019} }
TY - EJOUR
T1 - Multipath Enabled Private Audio with Noise
AU - Anadi Chaman; Yu-Jeh Liu; Jonah Casebeer; Ivan Dokmanić
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4261
ER -
Anadi Chaman, Yu-Jeh Liu, Jonah Casebeer, Ivan Dokmanić. (2019). Multipath Enabled Private Audio with Noise. IEEE SigPort. http://sigport.org/4261
Anadi Chaman, Yu-Jeh Liu, Jonah Casebeer, Ivan Dokmanić, 2019. Multipath Enabled Private Audio with Noise. Available at: http://sigport.org/4261.
Anadi Chaman, Yu-Jeh Liu, Jonah Casebeer, Ivan Dokmanić. (2019). "Multipath Enabled Private Audio with Noise." Web.
1. Anadi Chaman, Yu-Jeh Liu, Jonah Casebeer, Ivan Dokmanić. Multipath Enabled Private Audio with Noise [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4261

Binaural Beamforming Based on Automatic Interferer Selection


Binaural cues are important for sound localization. In addition, spatially separated sound sources are more intelligible than when they are co-located. Binaural cue preservation in multi-microphone hearing assistive devices is therefore important for the user's listening experience and safety.
A number of linearly-constrained-minimum-variance (LCMV) based methods

Paper Details

Authors:
Costas A. Kokke, Richard C. Hendriks, Andreas I. Koutrouvelis
Submitted On:
9 May 2019 - 4:48pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

ICASSP_poster.pdf

(182)

Subscribe

[1] Costas A. Kokke, Richard C. Hendriks, Andreas I. Koutrouvelis, "Binaural Beamforming Based on Automatic Interferer Selection", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4240. Accessed: Dec. 12, 2019.
@article{4240-19,
url = {http://sigport.org/4240},
author = {Costas A. Kokke; Richard C. Hendriks; Andreas I. Koutrouvelis },
publisher = {IEEE SigPort},
title = {Binaural Beamforming Based on Automatic Interferer Selection},
year = {2019} }
TY - EJOUR
T1 - Binaural Beamforming Based on Automatic Interferer Selection
AU - Costas A. Kokke; Richard C. Hendriks; Andreas I. Koutrouvelis
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4240
ER -
Costas A. Kokke, Richard C. Hendriks, Andreas I. Koutrouvelis. (2019). Binaural Beamforming Based on Automatic Interferer Selection. IEEE SigPort. http://sigport.org/4240
Costas A. Kokke, Richard C. Hendriks, Andreas I. Koutrouvelis, 2019. Binaural Beamforming Based on Automatic Interferer Selection. Available at: http://sigport.org/4240.
Costas A. Kokke, Richard C. Hendriks, Andreas I. Koutrouvelis. (2019). "Binaural Beamforming Based on Automatic Interferer Selection." Web.
1. Costas A. Kokke, Richard C. Hendriks, Andreas I. Koutrouvelis. Binaural Beamforming Based on Automatic Interferer Selection [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4240

post

Paper Details

Authors:
Submitted On:
9 May 2019 - 1:21pm
Short Link:
Type:
Event:

Document Files

電子代轉_T03115216_20190502.pdf

(48)

Subscribe

[1] , "post", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4226. Accessed: Dec. 12, 2019.
@article{4226-19,
url = {http://sigport.org/4226},
author = { },
publisher = {IEEE SigPort},
title = {post},
year = {2019} }
TY - EJOUR
T1 - post
AU -
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4226
ER -
. (2019). post. IEEE SigPort. http://sigport.org/4226
, 2019. post. Available at: http://sigport.org/4226.
. (2019). "post." Web.
1. . post [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4226

A Subband Energy Modification Method for Elevation Control in Median Plane


Elevation perception is crucial for binaural reproduction. A recent study proposed an elevation control method by modifying the energy of HRTFs in each auditory scale subband, such as the ERB and Mel subband. However, this subband division is designed based on auditory excitation patterns and may not be consistent with the elevation localization cues. To this end, this study proposes a novel subband division strategy which emphasizes the physiological information involved in elevation localization based on a statistical analysis of the HRTF.

Paper Details

Authors:
Dingding Yao, Junfeng Li, Huaxing Xu, Risheng Xia, Yonghong Yan
Submitted On:
9 May 2019 - 2:43am
Short Link:
Type:
Event:
Presenter's Name:
Document Year:
Cite

Document Files

Icassp2019_poster_ydd_e1.pdf

(48)

Subscribe

[1] Dingding Yao, Junfeng Li, Huaxing Xu, Risheng Xia, Yonghong Yan, "A Subband Energy Modification Method for Elevation Control in Median Plane", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4156. Accessed: Dec. 12, 2019.
@article{4156-19,
url = {http://sigport.org/4156},
author = {Dingding Yao; Junfeng Li; Huaxing Xu; Risheng Xia; Yonghong Yan },
publisher = {IEEE SigPort},
title = {A Subband Energy Modification Method for Elevation Control in Median Plane},
year = {2019} }
TY - EJOUR
T1 - A Subband Energy Modification Method for Elevation Control in Median Plane
AU - Dingding Yao; Junfeng Li; Huaxing Xu; Risheng Xia; Yonghong Yan
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4156
ER -
Dingding Yao, Junfeng Li, Huaxing Xu, Risheng Xia, Yonghong Yan. (2019). A Subband Energy Modification Method for Elevation Control in Median Plane. IEEE SigPort. http://sigport.org/4156
Dingding Yao, Junfeng Li, Huaxing Xu, Risheng Xia, Yonghong Yan, 2019. A Subband Energy Modification Method for Elevation Control in Median Plane. Available at: http://sigport.org/4156.
Dingding Yao, Junfeng Li, Huaxing Xu, Risheng Xia, Yonghong Yan. (2019). "A Subband Energy Modification Method for Elevation Control in Median Plane." Web.
1. Dingding Yao, Junfeng Li, Huaxing Xu, Risheng Xia, Yonghong Yan. A Subband Energy Modification Method for Elevation Control in Median Plane [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4156

Pages