ICASSP is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The 2019 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.
- Read more about DEVELOPMENT AND EVALUATION OF JAPANESE TEXT-TO-SPEECH MIDDLEWARE FOR 32-BIT MICROCONTROLLERS
- Log in to post comments
- Categories:
- Read more about SIMPLE COOPERATIVE TRANSMISSION SCHEMES FOR UNDERLAY SPECTRUM SHARING USING SYMBOL-LEVEL PRECODING AND LOAD-CONTROLLED ARRAYS
- Log in to post comments
The combination of coordinated multi-point (CoMP) and underlay spectrum sharing promises substantial spectral efficiency (SE) gains for future cellular networks. However, this concept has been largely overlooked in the literature. Moreover, none of the few relevant studies consider the use of “standard” transmission strategies to facilitate the adoption of the aforementioned communication paradigm by 5G networks.
- Categories:
- Read more about SIMPLE COOPERATIVE TRANSMISSION SCHEMES FOR UNDERLAY SPECTRUM SHARING USING SYMBOL-LEVEL PRECODING AND LOAD-CONTROLLED ARRAYS
- Log in to post comments
The combination of coordinated multi-point (CoMP) and underlay spectrum sharing promises substantial spectral efficiency (SE) gains for future cellular networks. However, this concept has been largely overlooked in the literature. Moreover, none of the few relevant studies consider the use of “standard” transmission strategies to facilitate the adoption of the aforementioned communication paradigm by 5G networks.
- Categories:
- Read more about ICASSP 2019 SLIDES: SPEAKER CHANGE DETECTION USING FUNDAMENTAL FREQUENCY WITH APPLICATION TO MULTI-TALKER SEGMENTATION
- Log in to post comments
This paper shows that time varying pitch properties can be used advantageously within the segmentation step of a multi-talker diarization system. First a study is conducted to verify that changes in pitch are strong indicators of changes in the speaker. It is then highlighted that an individual’s pitch is smoothly varying and, therefore, can be predicted by means of a Kalman filter. Subsequently it is shown that if the pitch is not predictable then this is most likely due to a change in the speaker.
- Categories:
- Read more about NEUROMORPHIC VISION SENSING FOR CNN-BASED ACTION RECOGNITION
- Log in to post comments
Neuromorphic vision sensing (NVS) hardware is now gaining traction as a low-power/high-speed visual sensing technology that circumvents the limitations of conventional active pixel sensing (APS) cameras. While object detection and tracking models have been investigated in conjunction with NVS, there is currently little work on NVS for higher-level semantic tasks, such as action recognition.
- Categories:
- Read more about Bilinear Representation for Language-based Image Editing Using Conditional Generative Adversarial Networks
- Log in to post comments
The task of Language-Based Image Editing (LBIE) aims at generating a target image by editing the source image based on the given language description. The main challenge of LBIE is to disentangle the semantics in image and text and then combine them to generate realistic images. Therefore, the editing performance is heavily dependent on the learned representation.
- Categories:
- Read more about Segmentation, Classification, and Visualization of Orca Calls using Deep Learning
- Log in to post comments
Audiovisual media are increasingly used to study the communication and behavior of animal groups, e.g. by placing microphones in the animals habitat resulting in huge datasets with only a small amount of animal interactions. The Orcalab has recorded orca whales since 1973 using stationary underwater hydrophones and made it publicly available on the Orchive. There exist over 15 000 manually extracted orca/noise annotations and about 20 000 h unseen audio data. To analyze the behavior and communication of killer whales we need to interpret the different call types.
- Categories:
- Read more about Secure MIMO Interference Channel with Condential Messages and Delayed CSIT
- Log in to post comments
Slide of Presentation in ICASSP 2019
- Categories:
- Read more about ATOM SELECTION IN CONTINUOUS DICTIONARIES: RECONCILING POLAR AND SVD APPROXIMATIONS
- Log in to post comments
This work deals with efficient atom selection procedure in
a continuous dictionary, as required for instance in a Frank-
Wolfe approach within a BLASSO problem for the onedimensional
deconvolution problem. We show that efficient
maximization of a correlation between any given vector and
an atom sweeping a continuous dictionary can be performed
through a particular piece-wise linear approximation of dictionaries:
the polar approximation. We finally identify the
polar approximation as being optimal in a mean square error
- Categories:
- Read more about DEEP LEARNING THE EEG MANIFOLD FOR PHONOLOGICAL CATEGORIZATION FROM ACTIVE THOUGHTS
- Log in to post comments
Speech-related Brain Computer Interfaces (BCI) aim primarily at finding an alternative vocal communication pathway for
people with speaking disabilities. As a step towards full decoding of imagined speech from active thoughts, we present a
BCI system for subject-independent classification of phonological categories exploiting a novel deep learning based
- Categories: