GlobalSIP 2017

The fifth IEEE Global Conference on Signal and Information Processing (GlobalSIP) will be held in Montreal, Quebec, Canada on November 14-16, 2017. GlobalSIP is a flagship IEEE Signal Processing Society conference. It focuses on signal and information processing with an emphasis on up-and-coming signal processing themes. The conference features world-class plenary speeches, distinguished Symposium talks, tutorials, exhibits, oral and poster sessions, and panels. Visit website.

Efficient Segmentation-Aided Text Detection for Intelligent Robots_Poster

Read more about Efficient Segmentation-Aided Text Detection for Intelligent Robots_Poster
Log in to post comments

Scene text detection is a critical prerequisite for many fascinating applications for vision-based intelligent robots. Existing methods detect texts either using the local information only or casting it as a semantic segmentation problem. They tend to produce a large number of false alarms or cannot separate individual words accurately. In this work, we present an elegant segmentation-aided text detection solution that predicts the word-level bounding boxes using an end-to-end trainable deep convolutional neural network.

Poster-Conference.pdf

GlobalSIP2017_Segmentation-aided_Text_Detection (551)

Categories:: Image/Video Processing
Pattern recognition and classification (MLR-PATT)

8 Views

OBJECTIVE ASSESSMENT OF ENVELOPE ENHANCEMENT ALGORITHMS FOR ASSISTIVE HEARING DEVICES

Read more about OBJECTIVE ASSESSMENT OF ENVELOPE ENHANCEMENT ALGORITHMS FOR ASSISTIVE HEARING DEVICES
Log in to post comments

Individuals with auditory neuropathy spectrum disorder (ANSD) or auditory processing disorders (APDs) often suffer from temporal processing deficits leading to degraded speech perception. The situation becomes worse in the presence of background noise. Evidence exists that the exaggeration of speech envelope may enhance intelligibility, although a comprehensive evaluation of envelope enhancement algorithms is lacking.

Moshgelani_SIP.pdf

Moshgelani_SIP.pdf (486)

Categories:: Biomedical signal processing

5 Views

Influence of Audio Bandwidth Reduction on Speech Emotion Recognition by Human Subjects

Audio bandwidth reduction is the very first processing step in any speech and audio coder. Its negative influence on perceived speech quality and intelligibility has been thoroughly studied and is therefore very well documented. This paper examines whether it also has an influence on speech emotion recognition by human subjects. Several standard telephony bandwidths, from fullband down to narrowband, are considered. Listening test results show that recognition accuracy decreases with audio bandwidth.

GlobalSIP-2017_Lahaie_Presentation.pdf

GlobalSIP-2017_Lahaie_Presentation.pdf (729)

Categories:: Speech Coding (SPE-CODI)

38 Views

DESIGN OF BINARY LDPC CODES FOR SLEPIAN-WOLF CODING OF CORRELATED INFORMATION SOURCES

Read more about DESIGN OF BINARY LDPC CODES FOR SLEPIAN-WOLF CODING OF CORRELATED INFORMATION SOURCES
Log in to post comments

In this paper, we propose a scheme for distributed source coding, using low-density parity-check (LDPC) codes to compress close to the Slepian-Wolf limit for correlated binary sources. First, we develop a conventional Belief Propagation (BP) algorithm LDPC decoder which takes the syndrome information into account. Subsequently, modelling the correlation between the sources as a binary symmetric channel (BSC), we replace the received probabilities in the conventional channel with the cross over probability.

Eleruja_Slides.pdf

Eleruja_Slides.pdf (508)

Categories:: Communication Systems and Applications

19 Views

Convergence Analysis of Belief Propagation for Pairwise Linear Gaussian Models

Read more about Convergence Analysis of Belief Propagation for Pairwise Linear Gaussian Models
Log in to post comments

Globalsip17_JianDu_CMU.pdf

Globalsip17_JianDu_CMU.pdf (604)

Categories:: Graphical and kernel methods (MLR-GRKN)

10 Views

FAST NEAR INFRARED FUSION-BASED ADAPTIVE ENHANCEMENT OF VISIBLE

Read more about FAST NEAR INFRARED FUSION-BASED ADAPTIVE ENHANCEMENT OF VISIBLE
Log in to post comments

Visible (VS) and near infra-red (NIR) band sensors provide digital
images that capture complementary spectral radiations from a
scene. Since NIR radiations propagate well through haze, mist, or
fog, the captured NIR image contains better scene details compared
to the VS image in such cases. However, NIR radiations are material
dependent and provide little information about color or texture of
the scene’s objects. To exploit the complementary details provided
by VS and NIR images, we propose a fusion approach that adaptively

Elliethy_Aly_1351.pdf

presentation (529)

Categories:: Image/Video Processing

26 Views

FAST NEAR INFRARED FUSION-BASED ADAPTIVE ENHANCEMENT OF VISIBLE

Read more about FAST NEAR INFRARED FUSION-BASED ADAPTIVE ENHANCEMENT OF VISIBLE
Log in to post comments

Ellithey_Aly_1351.pdf

presentation (465)

Categories:: Image/Video Processing

11 Views

DATA CENSORING WITH SET-MEMBERSHIP ALGORITHMS

Read more about DATA CENSORING WITH SET-MEMBERSHIP ALGORITHMS
Log in to post comments

In this paper, we use the set-membership normalized least-mean-square (SM-NLMS) algorithm to censor the data set in big data applications. First, we use the distribution of the noise signal and the excess of the steady-state mean-square error (EMSE) to estimate the threshold for the desired update rate in the single threshold SM-NLMS (ST-SM-NLMS) algorithm. Then, we introduce the double threshold SM-NLMS (DT-SM-NLMS) algorithm which defines an acceptable
range of the error signal. This algorithm censors the data with very low and very high output estimation error.

Diniz.pdf

DATA CENSORING WITH SET-MEMBERSHIP ALGORITHMS (538)

Categories:: Adaptive Signal Processing

25 Views

Hybrid eye center localization using cascaded regression and robust circle fitting

Read more about Hybrid eye center localization using cascaded regression and robust circle fitting
Log in to post comments

We propose a new cascaded regressor for eye center detection. Previous methods start from a face or an eye detector and use either advanced features or powerful regressors for eye center localization, but not both. Instead, we detect the eyes more accurately using an existing facial feature alignment method. We improve the robustness of localization by using both advanced features and powerful regression machinery. Finally, unlike most other methods that do not refine the regression results, we make the localization more accurate by adding a robust circle fitting post-processing step.

GlobalSIP2017_IrisDetection.pdf

GlobalSIP2017_IrisDetection.pdf (514)

Categories:: Image/Video Processing

14 Views

Heart Sound Segmentation using Switching Linear Dynamical Models

Read more about Heart Sound Segmentation using Switching Linear Dynamical Models
Log in to post comments

Localization of exact positions of the fundamental heart sounds (FHS) is an essential step towards automatic analysis of heart sound phonocardiogram (PCG) recordings, the automatic segmentation allows for data-driven classification of heart pathological events. Current approach using probabilistic models such as hidden Markov models (HMMs) has improved accuracy of heart sound segmentation.

poster_1413.pdf

poster_1413.pdf (582)

Categories:: Biomedical signal processing

12 Views

Pages