ICASSP 2021 - IEEE International Conference on Acoustics, Speech and Signal Processing is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The ICASSP 2021 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.
- Read more about SALIENCY-DRIVEN VERSATILE VIDEO CODING FOR NEURAL OBJECT DETECTION
- Log in to post comments
Saliency-driven image and video coding for humans has gained importance in the recent past. In this paper, we propose such a saliency-driven coding framework for the video coding for machines task using the latest video coding standard Versatile Video Coding (VVC). To determine the salient regions before encoding, we employ the real-time-capable object detection network You Only Look Once (YOLO) in combination with a novel decision criterion. To measure the coding quality for a machine, the state-of-the-art object segmentation network Mask R-CNN was applied to the decoded frame.
- Categories:
- Read more about SALIENCY-DRIVEN VERSATILE VIDEO CODING FOR NEURAL OBJECT DETECTION
- Log in to post comments
Saliency-driven image and video coding for humans has gained importance in the recent past. In this paper, we propose such a saliency-driven coding framework for the video coding for machines task using the latest video coding standard Versatile Video Coding (VVC). To determine the salient regions before encoding, we employ the real-time-capable object detection network You Only Look Once (YOLO) in combination with a novel decision criterion. To measure the coding quality for a machine, the state-of-the-art object segmentation network Mask R-CNN was applied to the decoded frame.
- Categories:
- Read more about Seen and Unseen emotional style transfer for voice conversion with a new emotional speech dataset
- Log in to post comments
Emotional voice conversion aims to transform emotional prosody in speech while preserving the linguistic content and speaker identity. Prior studies show that it is possible to disentangle emotional prosody using an encoder-decoder network conditioned on discrete representation, such as one-hot emotion labels. Such networks learn to remember a fixed set of emotional styles.
icassp_poster.pdf
icassp_slides.pdf
- Categories:
- Read more about Presentation Slides: SPARSITY DRIVEN LATENT SPACE SAMPLING FOR GENERATIVE PRIOR BASED COMPRESSIVE SENSING
- 1 comment
- Log in to post comments
- Categories:
- Read more about Poster : SPARSITY DRIVEN LATENT SPACE SAMPLING FOR GENERATIVE PRIOR BASED COMPRESSIVE SENSING
- 1 comment
- Log in to post comments
- Categories:
- Read more about Acute Lymphoblastic Leukemia detection based on adaptive unsharpening and Deep Learning
- Log in to post comments
Computer Aided Diagnosis (CAD) systems are increasingly utilizing image analysis and Deep Learning (DL) techniques, due to their high accuracy in several medical imaging fields, including the detection of Acute Lymphoblastic (or Lymphocytic) Leukemia (ALL) from peripheral blood samples. However, no method in the literature has specifically analyzed the focus quality of ALL images or proposed a technique for sharpening the samples in an adaptive way for the purpose of classification.
- Categories:
- Read more about Presentation Slides: SPARSITY DRIVEN LATENT SPACE SAMPLING FOR GENERATIVE PRIOR BASED COMPRESSIVE SENSING
- Log in to post comments
- Categories:
- Read more about SELF-INFERENCE OF OTHERS' POLICIES FOR HOMOGENEOUS AGENTS IN COOPERATIVE MULTI-AGENT REINFORCEMENT LEARNING
- Log in to post comments
- Categories:
- Read more about An Adaptive Multi-Scale and Multi-Level Features Fusion Network with Perceptual Loss for Change Detection
- Log in to post comments
Change detection plays a vital role in monitoring and analyzing temporal changes in Earth observation tasks. This paper proposes a novel adaptive multi-scale and multi-level features fusion network for change detection in very-high-resolution bi-temporal remote sensing images. The proposed approach has three advantages. Firstly, it excels in abstracting high-level representations empowered by a highly effective feature extraction module.
MFPNet_poster.pdf
- Categories:
- Read more about POLA: Online Time Series Prediction by Adaptive Learning Rates
- Log in to post comments
Online prediction for streaming time series data has practical use for many real-world applications where downstream decisions depend on accurate forecasts for the future. Deployment in dynamic environments requires models to adapt quickly to changing data distributions without overfitting. We propose POLA (Predicting Online by Learning rate Adaptation) to automatically regulate the learning rate of recurrent neural network models to adapt to changing time series patterns across time.
POLA_slides.pdf
POLA_poster.pdf
- Categories: