ICASSP 2018

ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

AN ATTENUATION ADAPTED PULSE COMPRESSION TECHNIQUE TO ENHANCE THE BANDWIDTH AND THE RESOLUTION USING ULTRAFAST ULTRASOUND IMAGING

Read more about AN ATTENUATION ADAPTED PULSE COMPRESSION TECHNIQUE TO ENHANCE THE BANDWIDTH AND THE RESOLUTION USING ULTRAFAST ULTRASOUND IMAGING
Log in to post comments

Recent studies suggest that Resolution Enhancement Compression (REC) can provide significant improvements in terms of imaging quality over Classical Pulsed (CP) ultrasonic imaging techniques, by employing frequency and amplitude modulated transmitted signals. However the performance of coded excitations methods degrades drastically deeper into the tissue where the attenuation effects become more significant. In this work, a technique that allows overcoming the effects of attenuation on REC imaging is proposed (REC-Opt).

PosterICASSP_Benane.pdf

PosterICASSP_Benane.pdf (551)

Categories:: Medical imaging

12 Views

MULTISTREAM DIARIZATION FUSION USING THE MINIMUM VARIANCE BAYESIAN INFORMATION CRITERION

Poster_ICASSP_2018.pdf

Poster_ICASSP_2018.pdf (508)

Categories:: Speaker Recognition and Characterization (SPE-SPKR)

6 Views

Distributed Model Construction in Radio Interferometric Calibration

Read more about Distributed Model Construction in Radio Interferometric Calibration
Log in to post comments

lofar74.pdf

poster (425)

Categories:: Sensor Array Processing

7 Views

Advancing Acoustic-to-Word CTC Model

Read more about Advancing Acoustic-to-Word CTC Model
Log in to post comments

The acoustic-to-word model based on the connectionist temporal classification (CTC) criterion was shown as a natural end-to-end (E2E) model directly targeting words as output units. However, the word-based CTC model suffers from the out-of-vocabulary (OOV) issue as it can only model limited number of words in the output layer and maps all the remaining words into an OOV output node. Hence, such a word-based CTC model can only recognize the frequent words modeled by the network output nodes.

AdvanceCTC_poster.pdf

AdvanceCTC_poster.pdf (556)

Categories:: Acoustic Modeling for Automatic Speech Recognition (SPE-RECO)

9 Views

DEVELOPING FAR-FIELD SPEAKER SYSTEM VIA TEACHER-STUDENT LEARNING

Read more about DEVELOPING FAR-FIELD SPEAKER SYSTEM VIA TEACHER-STUDENT LEARNING
Log in to post comments

In this study, we develop the keyword spotting (KWS) and acoustic model (AM) components in a far-field speaker system. Specifically, we use teacher-student (T/S) learning to adapt a close-talk well-trained production AM to far-field by using parallel close-talk and simulated far-field data. We also use T/S learning to compress a large-size KWS model into a small-size one to fit the device computational cost. Without the need of transcription, T/S learning well utilizes untranscribed data to boost the model performance in both the AM adaptation and KWS model compression.

speaker_poster.pdf

speaker_poster.pdf (484)

Categories:: Acoustic Modeling for Automatic Speech Recognition (SPE-RECO)

6 Views

Exploring CTC-network derived features with conventional hybrid system

Read more about Exploring CTC-network derived features with conventional hybrid system
Log in to post comments

icassp2018.pdf

icassp2018.pdf (750)

Categories:: Audio Processing Systems

116 Views

EVALUATING MULTIEXPOSURE FUSION USING IMAGE INFORMATION

Read more about EVALUATING MULTIEXPOSURE FUSION USING IMAGE INFORMATION
Log in to post comments

mefa0.pdf

mefa0.pdf (553)

Categories:: Image Formation

14 Views

CONTENT-BASED REPRESENTATIONS OF AUDIO USING SIAMESE NEURAL NETWORKS

Read more about CONTENT-BASED REPRESENTATIONS OF AUDIO USING SIAMESE NEURAL NETWORKS
Log in to post comments

In this paper, we focus on the problem of content-based retrieval for
audio, which aims to retrieve all semantically similar audio recordings
for a given audio clip query. This problem is similar to the
problem of query by example of audio, which aims to retrieve media
samples from a database, which are similar to the user-provided example.
We propose a novel approach which encodes the audio into
a vector representation using Siamese Neural Networks. The goal is
to obtain an encoding similar for files belonging to the same audio

ICASSP2018_Pranay.pdf

ICASSP2018_Pranay.pdf (543)

Categories:: Content-Based Audio Processing

4 Views

Document Quality Estimation using Spatial Frequency Response

Read more about Document Quality Estimation using Spatial Frequency Response
Log in to post comments

The current Document Image Quality Assessment (DIQA) algorithms directly relate the Optical Character Recognition (OCR) accuracies with the quality of the document to build supervised learning frameworks. This direct correlation has two major limitations: (a) OCR may be affected by factors independent of the quality of the capture and (b) it cannot account for blur variations within an image. An alternate possibility is to quantify the quality of capture using human judgement, however, it is subjective and prone to error.