Sorry, you need to enable JavaScript to visit this website.

Audio and Acoustic Signal Processing

Towards Wireless Acoustic Sensor Networks for Location Estimation and Counting of Multiple Speakers in Real-life Conditions

Paper Details

Authors:
Anastasios Alexandridis, Nikolaos Stefanakis, Athanasios Mouchtaris
Submitted On:
2 March 2017 - 3:35pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

ICASSP2017_Presentation.pdf

(122 downloads)

Keywords

Subscribe

[1] Anastasios Alexandridis, Nikolaos Stefanakis, Athanasios Mouchtaris, "Towards Wireless Acoustic Sensor Networks for Location Estimation and Counting of Multiple Speakers in Real-life Conditions", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1594. Accessed: Dec. 15, 2017.
@article{1594-17,
url = {http://sigport.org/1594},
author = {Anastasios Alexandridis; Nikolaos Stefanakis; Athanasios Mouchtaris },
publisher = {IEEE SigPort},
title = {Towards Wireless Acoustic Sensor Networks for Location Estimation and Counting of Multiple Speakers in Real-life Conditions},
year = {2017} }
TY - EJOUR
T1 - Towards Wireless Acoustic Sensor Networks for Location Estimation and Counting of Multiple Speakers in Real-life Conditions
AU - Anastasios Alexandridis; Nikolaos Stefanakis; Athanasios Mouchtaris
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1594
ER -
Anastasios Alexandridis, Nikolaos Stefanakis, Athanasios Mouchtaris. (2017). Towards Wireless Acoustic Sensor Networks for Location Estimation and Counting of Multiple Speakers in Real-life Conditions. IEEE SigPort. http://sigport.org/1594
Anastasios Alexandridis, Nikolaos Stefanakis, Athanasios Mouchtaris, 2017. Towards Wireless Acoustic Sensor Networks for Location Estimation and Counting of Multiple Speakers in Real-life Conditions. Available at: http://sigport.org/1594.
Anastasios Alexandridis, Nikolaos Stefanakis, Athanasios Mouchtaris. (2017). "Towards Wireless Acoustic Sensor Networks for Location Estimation and Counting of Multiple Speakers in Real-life Conditions." Web.
1. Anastasios Alexandridis, Nikolaos Stefanakis, Athanasios Mouchtaris. Towards Wireless Acoustic Sensor Networks for Location Estimation and Counting of Multiple Speakers in Real-life Conditions [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1594

FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION

Paper Details

Authors:
Submitted On:
2 March 2017 - 2:18pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

ICASSP2017_poster.pdf

(107 downloads)

Keywords

Subscribe

[1] , "FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1593. Accessed: Dec. 15, 2017.
@article{1593-17,
url = {http://sigport.org/1593},
author = { },
publisher = {IEEE SigPort},
title = {FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION},
year = {2017} }
TY - EJOUR
T1 - FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION
AU -
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1593
ER -
. (2017). FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION. IEEE SigPort. http://sigport.org/1593
, 2017. FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION. Available at: http://sigport.org/1593.
. (2017). "FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION." Web.
1. . FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1593

FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION

Paper Details

Authors:
Submitted On:
2 March 2017 - 2:18pm
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:

Document Files

ICASSP2017_poster.pdf

(103 downloads)

Keywords

Subscribe

[1] , " FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1592. Accessed: Dec. 15, 2017.
@article{1592-17,
url = {http://sigport.org/1592},
author = { },
publisher = {IEEE SigPort},
title = { FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION},
year = {2017} }
TY - EJOUR
T1 - FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION
AU -
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1592
ER -
. (2017). FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION. IEEE SigPort. http://sigport.org/1592
, 2017. FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION. Available at: http://sigport.org/1592.
. (2017). " FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION." Web.
1. . FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1592

: Faster-than-Nyquist Spatiotemporal Symbol-level Precoding in the Downlink of Multiuser MISO Channels

Paper Details

Authors:
Maha ALODEH, Danilo SPANO, Symeon CHATZINOTAS, Bjorn OTTERSTEN
Submitted On:
1 March 2017 - 7:55am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

Icassp_poster.pdf

(702 downloads)

Keywords

Subscribe

[1] Maha ALODEH, Danilo SPANO, Symeon CHATZINOTAS, Bjorn OTTERSTEN, ": Faster-than-Nyquist Spatiotemporal Symbol-level Precoding in the Downlink of Multiuser MISO Channels", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1550. Accessed: Dec. 15, 2017.
@article{1550-17,
url = {http://sigport.org/1550},
author = {Maha ALODEH; Danilo SPANO; Symeon CHATZINOTAS; Bjorn OTTERSTEN },
publisher = {IEEE SigPort},
title = {: Faster-than-Nyquist Spatiotemporal Symbol-level Precoding in the Downlink of Multiuser MISO Channels},
year = {2017} }
TY - EJOUR
T1 - : Faster-than-Nyquist Spatiotemporal Symbol-level Precoding in the Downlink of Multiuser MISO Channels
AU - Maha ALODEH; Danilo SPANO; Symeon CHATZINOTAS; Bjorn OTTERSTEN
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1550
ER -
Maha ALODEH, Danilo SPANO, Symeon CHATZINOTAS, Bjorn OTTERSTEN. (2017). : Faster-than-Nyquist Spatiotemporal Symbol-level Precoding in the Downlink of Multiuser MISO Channels. IEEE SigPort. http://sigport.org/1550
Maha ALODEH, Danilo SPANO, Symeon CHATZINOTAS, Bjorn OTTERSTEN, 2017. : Faster-than-Nyquist Spatiotemporal Symbol-level Precoding in the Downlink of Multiuser MISO Channels. Available at: http://sigport.org/1550.
Maha ALODEH, Danilo SPANO, Symeon CHATZINOTAS, Bjorn OTTERSTEN. (2017). ": Faster-than-Nyquist Spatiotemporal Symbol-level Precoding in the Downlink of Multiuser MISO Channels." Web.
1. Maha ALODEH, Danilo SPANO, Symeon CHATZINOTAS, Bjorn OTTERSTEN. : Faster-than-Nyquist Spatiotemporal Symbol-level Precoding in the Downlink of Multiuser MISO Channels [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1550

Supervised group nonnegative matrix factorisation with similarity constraints and applications to speaker identification


This paper presents supervised feature learning approaches for speaker identification that rely on nonnegative matrix factorisation. Recent studies have shown that group nonnegative matrix factorisation and task-driven supervised dictionary learning can help performing effective feature learning for audio classification problems.

Paper Details

Authors:
victor bisot, slim essid, gaël richard
Submitted On:
1 March 2017 - 4:34am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

Slide for the presentation

(100 downloads)

Keywords

Subscribe

[1] victor bisot, slim essid, gaël richard, "Supervised group nonnegative matrix factorisation with similarity constraints and applications to speaker identification", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1539. Accessed: Dec. 15, 2017.
@article{1539-17,
url = {http://sigport.org/1539},
author = {victor bisot; slim essid; gaël richard },
publisher = {IEEE SigPort},
title = {Supervised group nonnegative matrix factorisation with similarity constraints and applications to speaker identification},
year = {2017} }
TY - EJOUR
T1 - Supervised group nonnegative matrix factorisation with similarity constraints and applications to speaker identification
AU - victor bisot; slim essid; gaël richard
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1539
ER -
victor bisot, slim essid, gaël richard. (2017). Supervised group nonnegative matrix factorisation with similarity constraints and applications to speaker identification. IEEE SigPort. http://sigport.org/1539
victor bisot, slim essid, gaël richard, 2017. Supervised group nonnegative matrix factorisation with similarity constraints and applications to speaker identification. Available at: http://sigport.org/1539.
victor bisot, slim essid, gaël richard. (2017). "Supervised group nonnegative matrix factorisation with similarity constraints and applications to speaker identification." Web.
1. victor bisot, slim essid, gaël richard. Supervised group nonnegative matrix factorisation with similarity constraints and applications to speaker identification [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1539

CODING OF FINE GRANULAR AUDIO SIGNALS USING HIGH RESOLUTION ENVELOPE PROCESSING (HREP)


High Resolution Envelope Processing (HREP) is a new tool for improved perceptual coding of audio signals that predominantly consist of many dense transient events, such as applause, rain drop sounds, etc. These signals have traditionally been very difficult to code for perceptual audio codecs, particularly at low bit rates. Based on the gain control principle, HREP acts as a pre-/post-processor pair to perceptual audio codecs and preserves the temporal fine structure and subjective quality of applause-like signals.

Paper Details

Authors:
Florin Ghido, Sascha Disch, Jürgen Herre, Franz Reutelhuber, Alexander Adami
Submitted On:
1 March 2017 - 4:15am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

ICASSP 2017 HREP Poster

(105 downloads)

Keywords

Subscribe

[1] Florin Ghido, Sascha Disch, Jürgen Herre, Franz Reutelhuber, Alexander Adami, "CODING OF FINE GRANULAR AUDIO SIGNALS USING HIGH RESOLUTION ENVELOPE PROCESSING (HREP)", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1538. Accessed: Dec. 15, 2017.
@article{1538-17,
url = {http://sigport.org/1538},
author = {Florin Ghido; Sascha Disch; Jürgen Herre; Franz Reutelhuber; Alexander Adami },
publisher = {IEEE SigPort},
title = {CODING OF FINE GRANULAR AUDIO SIGNALS USING HIGH RESOLUTION ENVELOPE PROCESSING (HREP)},
year = {2017} }
TY - EJOUR
T1 - CODING OF FINE GRANULAR AUDIO SIGNALS USING HIGH RESOLUTION ENVELOPE PROCESSING (HREP)
AU - Florin Ghido; Sascha Disch; Jürgen Herre; Franz Reutelhuber; Alexander Adami
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1538
ER -
Florin Ghido, Sascha Disch, Jürgen Herre, Franz Reutelhuber, Alexander Adami. (2017). CODING OF FINE GRANULAR AUDIO SIGNALS USING HIGH RESOLUTION ENVELOPE PROCESSING (HREP). IEEE SigPort. http://sigport.org/1538
Florin Ghido, Sascha Disch, Jürgen Herre, Franz Reutelhuber, Alexander Adami, 2017. CODING OF FINE GRANULAR AUDIO SIGNALS USING HIGH RESOLUTION ENVELOPE PROCESSING (HREP). Available at: http://sigport.org/1538.
Florin Ghido, Sascha Disch, Jürgen Herre, Franz Reutelhuber, Alexander Adami. (2017). "CODING OF FINE GRANULAR AUDIO SIGNALS USING HIGH RESOLUTION ENVELOPE PROCESSING (HREP)." Web.
1. Florin Ghido, Sascha Disch, Jürgen Herre, Franz Reutelhuber, Alexander Adami. CODING OF FINE GRANULAR AUDIO SIGNALS USING HIGH RESOLUTION ENVELOPE PROCESSING (HREP) [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1538

MULTILAYER SENSOR NETWORK FOR INFORMATION PRIVACY


A sensor network wishes to transmit information to a fusion center to allow it to detect a public hypothesis, but at the same time prevent it from inferring a private hypothesis. We propose a multilayer sensor network structure, where each sensor first applies a nonlinear fusion function on the information it receives from sensors in a previous layer, and then a linear weighting matrix to distort the information it sends to sensors in the next layer.

Paper Details

Authors:
Xin He, Wee Peng Tay
Submitted On:
1 March 2017 - 1:57am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

ICASSP17_xin.pdf

(120 downloads)

Keywords

Additional Categories

Subscribe

[1] Xin He, Wee Peng Tay, "MULTILAYER SENSOR NETWORK FOR INFORMATION PRIVACY", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1521. Accessed: Dec. 15, 2017.
@article{1521-17,
url = {http://sigport.org/1521},
author = {Xin He; Wee Peng Tay },
publisher = {IEEE SigPort},
title = {MULTILAYER SENSOR NETWORK FOR INFORMATION PRIVACY},
year = {2017} }
TY - EJOUR
T1 - MULTILAYER SENSOR NETWORK FOR INFORMATION PRIVACY
AU - Xin He; Wee Peng Tay
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1521
ER -
Xin He, Wee Peng Tay. (2017). MULTILAYER SENSOR NETWORK FOR INFORMATION PRIVACY. IEEE SigPort. http://sigport.org/1521
Xin He, Wee Peng Tay, 2017. MULTILAYER SENSOR NETWORK FOR INFORMATION PRIVACY. Available at: http://sigport.org/1521.
Xin He, Wee Peng Tay. (2017). "MULTILAYER SENSOR NETWORK FOR INFORMATION PRIVACY." Web.
1. Xin He, Wee Peng Tay. MULTILAYER SENSOR NETWORK FOR INFORMATION PRIVACY [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1521

BIOLOGICALLY INSPIRED SPEECH EMOTION RECOGNITION


Conventional feature-based classification methods do not apply well to automatic recognition of speech emotions, mostly because the precise set of spectral and prosodic features that is required to identify the emotional state of a speaker has not been determined yet. This paper presents a method that operates directly on the speech signal, thus avoiding the problematic step of feature extraction.

Paper Details

Authors:
Submitted On:
16 March 2017 - 10:05am
Short Link:
Type:
Event:
Document Year:
Cite

Document Files

ICASSP2017_Lotfidereshgi (poster) V2.pdf

(86 downloads)

Keywords

Additional Categories

Subscribe

[1] , "BIOLOGICALLY INSPIRED SPEECH EMOTION RECOGNITION", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1516. Accessed: Dec. 15, 2017.
@article{1516-17,
url = {http://sigport.org/1516},
author = { },
publisher = {IEEE SigPort},
title = {BIOLOGICALLY INSPIRED SPEECH EMOTION RECOGNITION},
year = {2017} }
TY - EJOUR
T1 - BIOLOGICALLY INSPIRED SPEECH EMOTION RECOGNITION
AU -
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1516
ER -
. (2017). BIOLOGICALLY INSPIRED SPEECH EMOTION RECOGNITION. IEEE SigPort. http://sigport.org/1516
, 2017. BIOLOGICALLY INSPIRED SPEECH EMOTION RECOGNITION. Available at: http://sigport.org/1516.
. (2017). "BIOLOGICALLY INSPIRED SPEECH EMOTION RECOGNITION." Web.
1. . BIOLOGICALLY INSPIRED SPEECH EMOTION RECOGNITION [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1516

ROBUST AUTOMATIC RECOGNITION OF SPEECH WITH BACKGROUND MUSIC


This paper addresses the task of Automatic Speech Recognition (ASR) with music in the background, where the accuracy of recognition may deteriorate significantly.
To improve the robustness of ASR in this task, e.g. for broadcast news transcription or subtitles creation, we adopt two approaches:
1) multi-condition training of the acoustic models and 2) denoising autoencoders followed by acoustic model training on the preprocessed data.
In the latter case, two types of autoencoders are considered: the fully connected and the convolutional network.

Paper Details

Authors:
Jiri Malek, Jindrich Zdansky, Petr Cerva
Submitted On:
28 February 2017 - 9:22am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

posterICASSP2017_MalekZdanskyCerva.pdf

(117 downloads)

Keywords

Additional Categories

Subscribe

[1] Jiri Malek, Jindrich Zdansky, Petr Cerva, "ROBUST AUTOMATIC RECOGNITION OF SPEECH WITH BACKGROUND MUSIC", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1511. Accessed: Dec. 15, 2017.
@article{1511-17,
url = {http://sigport.org/1511},
author = {Jiri Malek; Jindrich Zdansky; Petr Cerva },
publisher = {IEEE SigPort},
title = {ROBUST AUTOMATIC RECOGNITION OF SPEECH WITH BACKGROUND MUSIC},
year = {2017} }
TY - EJOUR
T1 - ROBUST AUTOMATIC RECOGNITION OF SPEECH WITH BACKGROUND MUSIC
AU - Jiri Malek; Jindrich Zdansky; Petr Cerva
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1511
ER -
Jiri Malek, Jindrich Zdansky, Petr Cerva. (2017). ROBUST AUTOMATIC RECOGNITION OF SPEECH WITH BACKGROUND MUSIC. IEEE SigPort. http://sigport.org/1511
Jiri Malek, Jindrich Zdansky, Petr Cerva, 2017. ROBUST AUTOMATIC RECOGNITION OF SPEECH WITH BACKGROUND MUSIC. Available at: http://sigport.org/1511.
Jiri Malek, Jindrich Zdansky, Petr Cerva. (2017). "ROBUST AUTOMATIC RECOGNITION OF SPEECH WITH BACKGROUND MUSIC." Web.
1. Jiri Malek, Jindrich Zdansky, Petr Cerva. ROBUST AUTOMATIC RECOGNITION OF SPEECH WITH BACKGROUND MUSIC [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1511

Deductive Refinement of Species Labelling in Weakly Labelled Birdsong Recordings


Many approaches have been used in bird species classification from their sound in order to provide labels for the whole of a recording. However, a more precise classification of each bird vocalization would be of great importance to the use and management of sound archives and bird monitoring. In this work, we introduce a technique that using a two step process can first automatically detect all bird vocalizations and then, with the use of ‘weakly’ labelled recordings, classify them.

main.pdf

PDF icon main.pdf (320 downloads)

Paper Details

Authors:
Veronica Morfi, Dan Stowell
Submitted On:
28 February 2017 - 7:05am
Short Link:
Type:
Event:
Presenter's Name:
Paper Code:
Document Year:
Cite

Document Files

main.pdf

(320 downloads)

Keywords

Subscribe

[1] Veronica Morfi, Dan Stowell, "Deductive Refinement of Species Labelling in Weakly Labelled Birdsong Recordings", IEEE SigPort, 2017. [Online]. Available: http://sigport.org/1505. Accessed: Dec. 15, 2017.
@article{1505-17,
url = {http://sigport.org/1505},
author = {Veronica Morfi; Dan Stowell },
publisher = {IEEE SigPort},
title = {Deductive Refinement of Species Labelling in Weakly Labelled Birdsong Recordings},
year = {2017} }
TY - EJOUR
T1 - Deductive Refinement of Species Labelling in Weakly Labelled Birdsong Recordings
AU - Veronica Morfi; Dan Stowell
PY - 2017
PB - IEEE SigPort
UR - http://sigport.org/1505
ER -
Veronica Morfi, Dan Stowell. (2017). Deductive Refinement of Species Labelling in Weakly Labelled Birdsong Recordings. IEEE SigPort. http://sigport.org/1505
Veronica Morfi, Dan Stowell, 2017. Deductive Refinement of Species Labelling in Weakly Labelled Birdsong Recordings. Available at: http://sigport.org/1505.
Veronica Morfi, Dan Stowell. (2017). "Deductive Refinement of Species Labelling in Weakly Labelled Birdsong Recordings." Web.
1. Veronica Morfi, Dan Stowell. Deductive Refinement of Species Labelling in Weakly Labelled Birdsong Recordings [Internet]. IEEE SigPort; 2017. Available from : http://sigport.org/1505

Pages