Audio and Acoustic Signal Processing

Towards Wireless Acoustic Sensor Networks for Location Estimation and Counting of Multiple Speakers in Real-life Conditions

Read more about Towards Wireless Acoustic Sensor Networks for Location Estimation and Counting of Multiple Speakers in Real-life Conditions
Log in to post comments

ICASSP2017_Presentation.pdf

ICASSP2017_Presentation.pdf (282)

Categories:: Audio and Acoustic Signal Processing

7 Views

FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION

Read more about FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION
Log in to post comments

ICASSP2017_poster.pdf

ICASSP2017_poster.pdf (587)

Categories:: Audio and Acoustic Signal Processing

7 Views

FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION

Read more about FULLY COMPLEX DEEP NEURAL NETWORK FOR PHASE-INCORPORATING MONAURAL SOURCE SEPARATION
Log in to post comments

ICASSP2017_poster.pdf

ICASSP2017_poster.pdf (260)

Categories:: Audio and Acoustic Signal Processing

10 Views

: Faster-than-Nyquist Spatiotemporal Symbol-level Precoding in the Downlink of Multiuser MISO Channels

Icassp_poster.pdf

Icassp_poster.pdf (2052)

Categories:: Audio and Acoustic Signal Processing

13 Views

Supervised group nonnegative matrix factorisation with similarity constraints and applications to speaker identification

This paper presents supervised feature learning approaches for speaker identification that rely on nonnegative matrix factorisation. Recent studies have shown that group nonnegative matrix factorisation and task-driven supervised dictionary learning can help performing effective feature learning for audio classification problems.

ICASSP2017_rserizel.pdf

Slide for the presentation (256)

Categories:: Audio and Acoustic Signal Processing

4 Views

CODING OF FINE GRANULAR AUDIO SIGNALS USING HIGH RESOLUTION ENVELOPE PROCESSING (HREP)

High Resolution Envelope Processing (HREP) is a new tool for improved perceptual coding of audio signals that predominantly consist of many dense transient events, such as applause, rain drop sounds, etc. These signals have traditionally been very difficult to code for perceptual audio codecs, particularly at low bit rates. Based on the gain control principle, HREP acts as a pre-/post-processor pair to perceptual audio codecs and preserves the temporal fine structure and subjective quality of applause-like signals.

201702_AudioLabs_IIS_Kongressplakat_Disch_A0_web.pdf

ICASSP 2017 HREP Poster (435)

Categories:: Audio and Acoustic Signal Processing

63 Views

MULTILAYER SENSOR NETWORK FOR INFORMATION PRIVACY

Read more about MULTILAYER SENSOR NETWORK FOR INFORMATION PRIVACY
Log in to post comments

A sensor network wishes to transmit information to a fusion center to allow it to detect a public hypothesis, but at the same time prevent it from inferring a private hypothesis. We propose a multilayer sensor network structure, where each sensor first applies a nonlinear fusion function on the information it receives from sensors in a previous layer, and then a linear weighting matrix to distort the information it sends to sensors in the next layer.

ICASSP17_xin.pdf

ICASSP17_xin.pdf (303)

Categories:: Audio and Acoustic Signal Processing

9 Views

BIOLOGICALLY INSPIRED SPEECH EMOTION RECOGNITION

Read more about BIOLOGICALLY INSPIRED SPEECH EMOTION RECOGNITION
Log in to post comments

Conventional feature-based classification methods do not apply well to automatic recognition of speech emotions, mostly because the precise set of spectral and prosodic features that is required to identify the emotional state of a speaker has not been determined yet. This paper presents a method that operates directly on the speech signal, thus avoiding the problematic step of feature extraction.

ICASSP2017_Lotfidereshgi (poster) V2.pdf

ICASSP2017_Lotfidereshgi (poster) V2.pdf (418)

Categories:: Audio and Acoustic Signal Processing

44 Views

ROBUST AUTOMATIC RECOGNITION OF SPEECH WITH BACKGROUND MUSIC

Read more about ROBUST AUTOMATIC RECOGNITION OF SPEECH WITH BACKGROUND MUSIC
Log in to post comments

This paper addresses the task of Automatic Speech Recognition (ASR) with music in the background, where the accuracy of recognition may deteriorate significantly.
To improve the robustness of ASR in this task, e.g. for broadcast news transcription or subtitles creation, we adopt two approaches:
1) multi-condition training of the acoustic models and 2) denoising autoencoders followed by acoustic model training on the preprocessed data.
In the latter case, two types of autoencoders are considered: the fully connected and the convolutional network.

posterICASSP2017_MalekZdanskyCerva.pdf

posterICASSP2017_MalekZdanskyCerva.pdf (356)

Categories:: Audio and Acoustic Signal Processing

12 Views

Deductive Refinement of Species Labelling in Weakly Labelled Birdsong Recordings

Read more about Deductive Refinement of Species Labelling in Weakly Labelled Birdsong Recordings
Log in to post comments

Many approaches have been used in bird species classification from their sound in order to provide labels for the whole of a recording. However, a more precise classification of each bird vocalization would be of great importance to the use and management of sound archives and bird monitoring. In this work, we introduce a technique that using a two step process can first automatically detect all bird vocalizations and then, with the use of ‘weakly’ labelled recordings, classify them.

main.pdf

main.pdf (698)

Categories:: Audio and Acoustic Signal Processing

10 Views

Audio and Acoustic Signal Processing

Pages