Audio and Acoustic Signal Processing

MULTI-SCALE OBJECT DETECTION WITH FEATURE FUSION AND REGION OBJECTNESS NETWORK

Read more about MULTI-SCALE OBJECT DETECTION WITH FEATURE FUSION AND REGION OBJECTNESS NETWORK
Log in to post comments

WenjieGuan-3304-2018_ICASSP_POSTER.pdf

WenjieGuan-3304-2018_ICASSP_POSTER.pdf (619)

Categories:: Audio and Acoustic Signal Processing

9 Views

Whole Sentence Neural Language Model

Read more about Whole Sentence Neural Language Model
Log in to post comments

Recurrent neural networks have become increasingly popular for the task of language modeling achieving impressive gains in state-of-the-art speech recognition and natural language processing (NLP) tasks. Recurrent models exploit word dependencies over a much longer context window (as retained by the history states) than what is feasible with n-gram language models.

whole-sent-v3.pdf

whole sentence neural language model (709)

Categories:: Audio and Acoustic Signal Processing

100 Views

Signboard Saliency Detection in Street Videos

Read more about Signboard Saliency Detection in Street Videos
Log in to post comments

ICASSP_onkar.pdf

ICASSP_onkar.pdf (553)

Categories:: Audio and Acoustic Signal Processing

5 Views

Acoustic Reflector Localization and Classification

Read more about Acoustic Reflector Localization and Classification
Log in to post comments

The process of understanding acoustic properties of environments is important for several applications, such as spatial audio, augmented reality and source separation. In this paper, multichannel room impulse responses are recorded and transformed into their direction of arrival (DOA)-time domain, by employing a superdirective beamformer. This domain can be represented as a 2D image. Hence, a novel image processing method is proposed to analyze the DOA-time domain, and estimate the reflection times of arrival and DOAs. The main acoustically reflective objects are then localized.

Remaggietal_ICASSP2018.pdf

Remaggietal_ICASSP2018.pdf (464)

Categories:: Audio and Acoustic Signal Processing

8 Views

CLASSIFICATION OF CORALS IN REFLECTANCE AND FLUORESCENCE IMAGES USING CONVOLUTIONAL NEURAL NETWORK REPRESENTATIONS

Coral species, with complex morphology and ambiguous boundaries, pose a great challenge for automated classification. CNN activations, which are extracted from fully connected layers of deep networks (FC features), have been successfully used as powerful universal representations in many visual tasks. In this paper, we investigate the transferability and combined performance of FC features and CONV features (extracted

ICASSP2018 Poster.pdf

ICASSP2018 Poster.pdf (407)

Categories:: Audio and Acoustic Signal Processing

44 Views

Determined Blind Source Separation via Proximal Splitting Algorithm

Read more about Determined Blind Source Separation via Proximal Splitting Algorithm
Log in to post comments

The state-of-the-art algorithms of determined blind source separation (BSS) methods based on the independent component analysis

2018.04.20_ICASSPポスター_PDS_ICA.pdf

2018.04.20_ICASSPポスター_PDS_ICA.pdf (401)

Categories:: Audio and Acoustic Signal Processing

35 Views

Phase Corrected Total Variation for Audio Signals

Read more about Phase Corrected Total Variation for Audio Signals
Log in to post comments

In optimization-based signal processing, the so-called prior term models the desired signal, and therefore its design is the key factor to achieve a good performance. For audio signals, the time-directional total variation applied to a spectrogram in combination with phase correction has been proposed recently to model sinusoidal components of the signal. Although it is a promising prior, its applicability might be restricted to some extent because of the mismatch of the assumption to the signal.

2018.04.20_ICASSPポスター_iPC_TV.pdf

2018.04.20_ICASSPポスター_iPC_TV.pdf (430)

Categories:: Audio and Acoustic Signal Processing

54 Views

A NOVEL LSTM-BASED SPEECH PREPROCESSOR FOR SPEAKER DIARIZATION IN REALISTIC MISMATCH CONDITIONS

icassp_poster.pdf

icassp_poster.pdf (656)

Categories:: Audio and Acoustic Signal Processing

9 Views

On Sequential Random Distortion Testing of Non-Stationary Processes

Read more about On Sequential Random Distortion Testing of Non-Stationary Processes
Log in to post comments

Random distortion testing (RDT) addresses the problem of testing whether or not a random signal deviates by more than a specified tolerance from a fixed value. The test is non-parametric in the sense that the distribution of the signal under each hypothesis is assumed to be unknown. The signal is observed in independent and identically distributed (i.i.d) additive noise. The need to control the probabilities of false alarm and missed de- tection while reducing the number of samples required to make a decision leads to the SeqRDT approach.

ICASSP18_Slides.pdf

ICASSP18_Slides.pdf (445)

Categories:: Audio and Acoustic Signal Processing

9 Views

AUTOMATIC TEMPORAL SEGMENTATION OF HAND MOVEMENTS FOR HAND POSITIONS RECOGNITION IN FRENCH CUED SPEECH

In the context of Cued Speech (CS) recognition, the recognition
of lips and hand movements is a key task. As we know, a good
temporal segmentation is necessary for the supervised recog-
nition system. However, lips and hand streams cannot share
the same temporal segmentation since they are not synchro-
nized. In this work, we propose a hand preceding model to
predict temporal segmentations of hand movements automati-
cally by exploring the relationship between hand preceding time

Poster_ICASSP_36032018.pdf

Poster_ICASSP_36032018.pdf (495)

Categories:: Audio and Acoustic Signal Processing

33 Views

Audio and Acoustic Signal Processing

Pages