Sorry, you need to enable JavaScript to visit this website.

Spatial and Multichannel Audio

Natural Sound Rendering for Headphones: Integration of signal processing techniques (slides)


With the strong growth of assistive and personal listening devices, natural sound rendering over headphones is becoming a necessity for prolonged listening in multimedia and virtual reality applications. The aim of natural sound rendering is to naturally recreate the sound scenes with the spatial and timbral quality as natural as possible, so as to achieve a truly immersive listening experience. However, rendering natural sound over headphones encounters many challenges. This tutorial article presents signal processing techniques to tackle these challenges to assist human listening.

Paper Details

Authors:
Kaushik Sunder, Ee-Leng Tan
Submitted On:
23 February 2016 - 1:43pm
Short Link:
Type:

Document Files

SPM15slides_Natural Sound Rendering for Headphones.pdf

(359 downloads)

Keywords

Subscribe

[1] Kaushik Sunder, Ee-Leng Tan, "Natural Sound Rendering for Headphones: Integration of signal processing techniques (slides)", IEEE SigPort, 2015. [Online]. Available: http://sigport.org/167. Accessed: Apr. 27, 2017.
@article{167-15,
url = {http://sigport.org/167},
author = {Kaushik Sunder; Ee-Leng Tan },
publisher = {IEEE SigPort},
title = {Natural Sound Rendering for Headphones: Integration of signal processing techniques (slides)},
year = {2015} }
TY - EJOUR
T1 - Natural Sound Rendering for Headphones: Integration of signal processing techniques (slides)
AU - Kaushik Sunder; Ee-Leng Tan
PY - 2015
PB - IEEE SigPort
UR - http://sigport.org/167
ER -
Kaushik Sunder, Ee-Leng Tan. (2015). Natural Sound Rendering for Headphones: Integration of signal processing techniques (slides). IEEE SigPort. http://sigport.org/167
Kaushik Sunder, Ee-Leng Tan, 2015. Natural Sound Rendering for Headphones: Integration of signal processing techniques (slides). Available at: http://sigport.org/167.
Kaushik Sunder, Ee-Leng Tan. (2015). "Natural Sound Rendering for Headphones: Integration of signal processing techniques (slides)." Web.
1. Kaushik Sunder, Ee-Leng Tan. Natural Sound Rendering for Headphones: Integration of signal processing techniques (slides) [Internet]. IEEE SigPort; 2015. Available from : http://sigport.org/167

Time of Arrival Disambiguation Using the Linear Radon Transform


Echo labeling, the challenging task of assigning acoustic reflections to image sources, is equivalent to the highly-important disambiguation task in room geometry inference. A method using the Radon transform, an image processing tool, is proposed to address this challenge. The method relies on acoustic wavefront detection in room impulse response stacks, obtained with a uniform linear array of loudspeakers and one microphone. We show in our experiments that the proposed method can both label and detect echoes.

Paper Details

Authors:
Youssef El Baba, Andreas Walther, Emanuël Habets
Submitted On:
5 April 2017 - 11:44am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

LRT-SL talk.pdf

(16 downloads)

Keywords

Subscribe

[1] Youssef El Baba, Andreas Walther, Emanuël Habets, "Time of Arrival Disambiguation Using the Linear Radon Transform", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1790. Accessed: Apr. 27, 2017.
@article{1790-17,
url = {http://sigport.org/1790},
author = {Youssef El Baba; Andreas Walther; Emanuël Habets },
publisher = {IEEE SigPort},
title = {Time of Arrival Disambiguation Using the Linear Radon Transform},
year = {2017} }
TY - EJOUR
T1 - Time of Arrival Disambiguation Using the Linear Radon Transform
AU - Youssef El Baba; Andreas Walther; Emanuël Habets
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1790
ER -
Youssef El Baba, Andreas Walther, Emanuël Habets. (2017). Time of Arrival Disambiguation Using the Linear Radon Transform. IEEE SigPort. http://sigport.org/1790
Youssef El Baba, Andreas Walther, Emanuël Habets, 2017. Time of Arrival Disambiguation Using the Linear Radon Transform. Available at: http://sigport.org/1790.
Youssef El Baba, Andreas Walther, Emanuël Habets. (2017). "Time of Arrival Disambiguation Using the Linear Radon Transform." Web.
1. Youssef El Baba, Andreas Walther, Emanuël Habets. Time of Arrival Disambiguation Using the Linear Radon Transform [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1790

Active Speech Control using Wave-Domain Processing with a Linear Wall of Dipole Secondary Sources


In this paper, we investigate the effects of compensating for wave-domain filtering delay in an active speech control system. An active control system utilising wave-domain processed basis functions is evaluated for a linear array of dipole secondary sources. The target control soundfield is matched in a least squares sense using orthogonal wavefields to a predicted future target soundfield. Filtering is implemented using a block-based short-time signal processing approach which induces an inherent delay.

Paper Details

Authors:
Jacob Donley, Christian Ritz, W. Bastiaan Kleijn
Submitted On:
12 March 2017 - 1:34am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

High Quality Poster PDF presented at ICASSP 2017

(15 downloads)

Keywords

Subscribe

[1] Jacob Donley, Christian Ritz, W. Bastiaan Kleijn, "Active Speech Control using Wave-Domain Processing with a Linear Wall of Dipole Secondary Sources", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1746. Accessed: Apr. 27, 2017.
@article{1746-17,
url = {http://sigport.org/1746},
author = {Jacob Donley; Christian Ritz; W. Bastiaan Kleijn },
publisher = {IEEE SigPort},
title = {Active Speech Control using Wave-Domain Processing with a Linear Wall of Dipole Secondary Sources},
year = {2017} }
TY - EJOUR
T1 - Active Speech Control using Wave-Domain Processing with a Linear Wall of Dipole Secondary Sources
AU - Jacob Donley; Christian Ritz; W. Bastiaan Kleijn
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1746
ER -
Jacob Donley, Christian Ritz, W. Bastiaan Kleijn. (2017). Active Speech Control using Wave-Domain Processing with a Linear Wall of Dipole Secondary Sources. IEEE SigPort. http://sigport.org/1746
Jacob Donley, Christian Ritz, W. Bastiaan Kleijn, 2017. Active Speech Control using Wave-Domain Processing with a Linear Wall of Dipole Secondary Sources. Available at: http://sigport.org/1746.
Jacob Donley, Christian Ritz, W. Bastiaan Kleijn. (2017). "Active Speech Control using Wave-Domain Processing with a Linear Wall of Dipole Secondary Sources." Web.
1. Jacob Donley, Christian Ritz, W. Bastiaan Kleijn. Active Speech Control using Wave-Domain Processing with a Linear Wall of Dipole Secondary Sources [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1746

EVALUATION OF A COMPLEMENTARY HEARING AID FOR SPATIAL SOUND SEGREGATION

Paper Details

Authors:
Luca Giuliani, Luca Brayda, Sara Sansalone, Stefania Repetto, Michele Ricchetti
Submitted On:
8 March 2017 - 7:27pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

EVALUATION OF A COMPLEMENTARY HEARING AID FOR SPATIAL SOUND.pptx

(16 downloads)

Keywords

Subscribe

[1] Luca Giuliani, Luca Brayda, Sara Sansalone, Stefania Repetto, Michele Ricchetti, "EVALUATION OF A COMPLEMENTARY HEARING AID FOR SPATIAL SOUND SEGREGATION", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1712. Accessed: Apr. 27, 2017.
@article{1712-17,
url = {http://sigport.org/1712},
author = {Luca Giuliani; Luca Brayda; Sara Sansalone; Stefania Repetto; Michele Ricchetti },
publisher = {IEEE SigPort},
title = {EVALUATION OF A COMPLEMENTARY HEARING AID FOR SPATIAL SOUND SEGREGATION},
year = {2017} }
TY - EJOUR
T1 - EVALUATION OF A COMPLEMENTARY HEARING AID FOR SPATIAL SOUND SEGREGATION
AU - Luca Giuliani; Luca Brayda; Sara Sansalone; Stefania Repetto; Michele Ricchetti
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1712
ER -
Luca Giuliani, Luca Brayda, Sara Sansalone, Stefania Repetto, Michele Ricchetti. (2017). EVALUATION OF A COMPLEMENTARY HEARING AID FOR SPATIAL SOUND SEGREGATION. IEEE SigPort. http://sigport.org/1712
Luca Giuliani, Luca Brayda, Sara Sansalone, Stefania Repetto, Michele Ricchetti, 2017. EVALUATION OF A COMPLEMENTARY HEARING AID FOR SPATIAL SOUND SEGREGATION. Available at: http://sigport.org/1712.
Luca Giuliani, Luca Brayda, Sara Sansalone, Stefania Repetto, Michele Ricchetti. (2017). "EVALUATION OF A COMPLEMENTARY HEARING AID FOR SPATIAL SOUND SEGREGATION." Web.
1. Luca Giuliani, Luca Brayda, Sara Sansalone, Stefania Repetto, Michele Ricchetti. EVALUATION OF A COMPLEMENTARY HEARING AID FOR SPATIAL SOUND SEGREGATION [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1712

Sound Event Detection Using Spatial Features and Convolutional Recurrent Neural Network

Paper Details

Authors:
Pasi Pertilä, Tuomas Virtanen
Submitted On:
4 March 2017 - 8:59am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

icassp-2017-poster.pdf

(32 downloads)

Keywords

Subscribe

[1] Pasi Pertilä, Tuomas Virtanen, "Sound Event Detection Using Spatial Features and Convolutional Recurrent Neural Network", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1624. Accessed: Apr. 27, 2017.
@article{1624-17,
url = {http://sigport.org/1624},
author = {Pasi Pertilä; Tuomas Virtanen },
publisher = {IEEE SigPort},
title = {Sound Event Detection Using Spatial Features and Convolutional Recurrent Neural Network},
year = {2017} }
TY - EJOUR
T1 - Sound Event Detection Using Spatial Features and Convolutional Recurrent Neural Network
AU - Pasi Pertilä; Tuomas Virtanen
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1624
ER -
Pasi Pertilä, Tuomas Virtanen. (2017). Sound Event Detection Using Spatial Features and Convolutional Recurrent Neural Network. IEEE SigPort. http://sigport.org/1624
Pasi Pertilä, Tuomas Virtanen, 2017. Sound Event Detection Using Spatial Features and Convolutional Recurrent Neural Network. Available at: http://sigport.org/1624.
Pasi Pertilä, Tuomas Virtanen. (2017). "Sound Event Detection Using Spatial Features and Convolutional Recurrent Neural Network." Web.
1. Pasi Pertilä, Tuomas Virtanen. Sound Event Detection Using Spatial Features and Convolutional Recurrent Neural Network [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1624

DNN-based Speech Mask Estimation for Eigenvector Beamforming

Paper Details

Authors:
Submitted On:
2 March 2017 - 7:40am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

Lecture Talk

(34 downloads)

Keywords

Subscribe

[1] , "DNN-based Speech Mask Estimation for Eigenvector Beamforming", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1583. Accessed: Apr. 27, 2017.
@article{1583-17,
url = {http://sigport.org/1583},
author = { },
publisher = {IEEE SigPort},
title = {DNN-based Speech Mask Estimation for Eigenvector Beamforming},
year = {2017} }
TY - EJOUR
T1 - DNN-based Speech Mask Estimation for Eigenvector Beamforming
AU -
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1583
ER -
. (2017). DNN-based Speech Mask Estimation for Eigenvector Beamforming. IEEE SigPort. http://sigport.org/1583
, 2017. DNN-based Speech Mask Estimation for Eigenvector Beamforming. Available at: http://sigport.org/1583.
. (2017). "DNN-based Speech Mask Estimation for Eigenvector Beamforming." Web.
1. . DNN-based Speech Mask Estimation for Eigenvector Beamforming [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1583

Channel estimation for crosstalk cancellation in wireless acoustic networks


In this paper we deal with the estimation of the room impulse response (RIR) between each loudspeaker and each microphone of a wireless acoustic network of two nodes when used to implement a crosstalk canceller. The nodes of the network are commercial devices connected via standard wireless links, presenting low computational requirements and non-ideal synchronization between them. Moreover, the nodes can exchange information, but they cannot share their signals due to the high throughput and perfect synchronism that would be required.

Paper Details

Authors:
G. Pinero, P.A. Naylor
Submitted On:
2 March 2017 - 6:32am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

Channel estimation in WASNs

(19 downloads)

Keywords

Subscribe

[1] G. Pinero, P.A. Naylor, "Channel estimation for crosstalk cancellation in wireless acoustic networks", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1581. Accessed: Apr. 27, 2017.
@article{1581-17,
url = {http://sigport.org/1581},
author = {G. Pinero; P.A. Naylor },
publisher = {IEEE SigPort},
title = {Channel estimation for crosstalk cancellation in wireless acoustic networks},
year = {2017} }
TY - EJOUR
T1 - Channel estimation for crosstalk cancellation in wireless acoustic networks
AU - G. Pinero; P.A. Naylor
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1581
ER -
G. Pinero, P.A. Naylor. (2017). Channel estimation for crosstalk cancellation in wireless acoustic networks. IEEE SigPort. http://sigport.org/1581
G. Pinero, P.A. Naylor, 2017. Channel estimation for crosstalk cancellation in wireless acoustic networks. Available at: http://sigport.org/1581.
G. Pinero, P.A. Naylor. (2017). "Channel estimation for crosstalk cancellation in wireless acoustic networks." Web.
1. G. Pinero, P.A. Naylor. Channel estimation for crosstalk cancellation in wireless acoustic networks [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1581

INTERAURAL TIME DELAY PERSONALISATION USING INCOMPLETE HEAD SCANS


When using a set of generic head-related transfer functions (HRTFs) for spatial sound rendering, personalisation can be considered to minimise localisation errors. This typically involves tuning the characteristics of the HRTFs or a parametric model according to the listener’s anthropometry. However, measuring anthropometric features directly remains a challenge in practical applications, and the mapping between anthropometric and acoustic features is an open research problem.

Paper Details

Authors:
Hannes Gamper, David Johnston, Ivan J. Tashev
Submitted On:
1 March 2017 - 12:09pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

Poster: ITD personalisation using incomplete head scans

(30 downloads)

Keywords

Subscribe

[1] Hannes Gamper, David Johnston, Ivan J. Tashev, "INTERAURAL TIME DELAY PERSONALISATION USING INCOMPLETE HEAD SCANS", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1450. Accessed: Apr. 27, 2017.
@article{1450-17,
url = {http://sigport.org/1450},
author = {Hannes Gamper; David Johnston; Ivan J. Tashev },
publisher = {IEEE SigPort},
title = {INTERAURAL TIME DELAY PERSONALISATION USING INCOMPLETE HEAD SCANS},
year = {2017} }
TY - EJOUR
T1 - INTERAURAL TIME DELAY PERSONALISATION USING INCOMPLETE HEAD SCANS
AU - Hannes Gamper; David Johnston; Ivan J. Tashev
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1450
ER -
Hannes Gamper, David Johnston, Ivan J. Tashev. (2017). INTERAURAL TIME DELAY PERSONALISATION USING INCOMPLETE HEAD SCANS. IEEE SigPort. http://sigport.org/1450
Hannes Gamper, David Johnston, Ivan J. Tashev, 2017. INTERAURAL TIME DELAY PERSONALISATION USING INCOMPLETE HEAD SCANS. Available at: http://sigport.org/1450.
Hannes Gamper, David Johnston, Ivan J. Tashev. (2017). "INTERAURAL TIME DELAY PERSONALISATION USING INCOMPLETE HEAD SCANS." Web.
1. Hannes Gamper, David Johnston, Ivan J. Tashev. INTERAURAL TIME DELAY PERSONALISATION USING INCOMPLETE HEAD SCANS [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1450

FREQUENCY-BASED CUSTOMIZATION OF MULTIZONE SOUND SYSTEM DESIGN

Paper Details

Authors:
Bhaskar D Rao
Submitted On:
25 March 2016 - 11:53am
Short Link:
Type:
Event:
Document Year:
Cite

Document Files

Nasim Radmanesh_ICASSP2016.pdf

(110 downloads)

Keywords

Subscribe

[1] Bhaskar D Rao, "FREQUENCY-BASED CUSTOMIZATION OF MULTIZONE SOUND SYSTEM DESIGN", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1047. Accessed: Apr. 27, 2017.
@article{1047-16,
url = {http://sigport.org/1047},
author = {Bhaskar D Rao },
publisher = {IEEE SigPort},
title = {FREQUENCY-BASED CUSTOMIZATION OF MULTIZONE SOUND SYSTEM DESIGN},
year = {2016} }
TY - EJOUR
T1 - FREQUENCY-BASED CUSTOMIZATION OF MULTIZONE SOUND SYSTEM DESIGN
AU - Bhaskar D Rao
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1047
ER -
Bhaskar D Rao. (2016). FREQUENCY-BASED CUSTOMIZATION OF MULTIZONE SOUND SYSTEM DESIGN. IEEE SigPort. http://sigport.org/1047
Bhaskar D Rao, 2016. FREQUENCY-BASED CUSTOMIZATION OF MULTIZONE SOUND SYSTEM DESIGN. Available at: http://sigport.org/1047.
Bhaskar D Rao. (2016). "FREQUENCY-BASED CUSTOMIZATION OF MULTIZONE SOUND SYSTEM DESIGN." Web.
1. Bhaskar D Rao. FREQUENCY-BASED CUSTOMIZATION OF MULTIZONE SOUND SYSTEM DESIGN [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1047

FREQUENCY-BASED CUSTOMIZATION OF MULTIZONE SOUND SYSTEM DESIGN

Paper Details

Authors:
Bhaskar D Rao
Submitted On:
25 March 2016 - 11:45am
Short Link:
Type:
Event:
Document Year:
Cite

Document Files

Nasim Radmanesh_ICASSP2016.pdf

(109 downloads)

Keywords

Subscribe

[1] Bhaskar D Rao, "FREQUENCY-BASED CUSTOMIZATION OF MULTIZONE SOUND SYSTEM DESIGN", IEEE SigPort, 2016. [Online]. Available: http://sigport.org/1046. Accessed: Apr. 27, 2017.
@article{1046-16,
url = {http://sigport.org/1046},
author = {Bhaskar D Rao },
publisher = {IEEE SigPort},
title = {FREQUENCY-BASED CUSTOMIZATION OF MULTIZONE SOUND SYSTEM DESIGN},
year = {2016} }
TY - EJOUR
T1 - FREQUENCY-BASED CUSTOMIZATION OF MULTIZONE SOUND SYSTEM DESIGN
AU - Bhaskar D Rao
PY - 2016
PB - IEEE SigPort
UR - http://sigport.org/1046
ER -
Bhaskar D Rao. (2016). FREQUENCY-BASED CUSTOMIZATION OF MULTIZONE SOUND SYSTEM DESIGN. IEEE SigPort. http://sigport.org/1046
Bhaskar D Rao, 2016. FREQUENCY-BASED CUSTOMIZATION OF MULTIZONE SOUND SYSTEM DESIGN. Available at: http://sigport.org/1046.
Bhaskar D Rao. (2016). "FREQUENCY-BASED CUSTOMIZATION OF MULTIZONE SOUND SYSTEM DESIGN." Web.
1. Bhaskar D Rao. FREQUENCY-BASED CUSTOMIZATION OF MULTIZONE SOUND SYSTEM DESIGN [Internet]. IEEE SigPort; 2016. Available from : http://sigport.org/1046

Pages