ICASSP 2021

ICASSP 2021 - IEEE International Conference on Acoustics, Speech and Signal Processing is the world’s largest and most comprehensive technical conference focused on signal processing and its applications. The ICASSP 2021 conference will feature world-class presentations by internationally renowned speakers, cutting-edge session topics and provide a fantastic opportunity to network with like-minded professionals from around the world. Visit website.

POINT OF CARE IMAGE ANALYSIS FOR COVID-19

Read more about POINT OF CARE IMAGE ANALYSIS FOR COVID-19
Log in to post comments

Early detection of COVID-19 is key in containing the pandemic. Disease detection and evaluation based on imaging is fast and cheap and therefore plays an important role in COVID-19 handling. COVID-19 is easier to detect in chest CT, however, it is expensive, non-portable, and difficult to disinfect, making it unfit as a point-of-care (POC) modality. On the other hand, chest X-ray (CXR) and lung ultrasound (LUS) are widely used, yet, COVID-19 findings in these modalities are not always very clear.

COVID19_ICASSP_only_slides_for_pdf.pdf

COVID19_ICASSP_only_slides_for_pdf.pdf (270)

Categories:: Pattern recognition and classification (MLR-PATT)

8 Views

SALIENCY-DRIVEN VERSATILE VIDEO CODING FOR NEURAL OBJECT DETECTION

Read more about SALIENCY-DRIVEN VERSATILE VIDEO CODING FOR NEURAL OBJECT DETECTION
Log in to post comments

Saliency-driven image and video coding for humans has gained importance in the recent past. In this paper, we propose such a saliency-driven coding framework for the video coding for machines task using the latest video coding standard Versatile Video Coding (VVC). To determine the salient regions before encoding, we employ the real-time-capable object detection network You Only Look Once (YOLO) in combination with a novel decision criterion. To measure the coding quality for a machine, the state-of-the-art object segmentation network Mask R-CNN was applied to the decoded frame.

poster_ICASSP_2021.pdf

Poster for ICASSP 2021 (302)

Categories:: Image/Video Coding

4 Views

SALIENCY-DRIVEN VERSATILE VIDEO CODING FOR NEURAL OBJECT DETECTION

Read more about SALIENCY-DRIVEN VERSATILE VIDEO CODING FOR NEURAL OBJECT DETECTION
Log in to post comments

poster_ICASSP_2021.pdf

Poster for ICASSP 2021 (302)

Categories:: Image/Video Coding
Machine Learning for Signal Processing

44 Views

Seen and Unseen emotional style transfer for voice conversion with a new emotional speech dataset

Emotional voice conversion aims to transform emotional prosody in speech while preserving the linguistic content and speaker identity. Prior studies show that it is possible to disentangle emotional prosody using an encoder-decoder network conditioned on discrete representation, such as one-hot emotion labels. Such networks learn to remember a fixed set of emotional styles.

icassp_poster.pdf

Poster (370)

icassp_slides.pdf

Slides (369)

Categories:: Audio Analysis and Synthesis
Speech Synthesis and Generation, including TTS (SPE-SYNT)

39 Views

Presentation Slides: SPARSITY DRIVEN LATENT SPACE SAMPLING FOR GENERATIVE PRIOR BASED COMPRESSIVE SENSING

ICASSP_2021_Slides_5466.pdf

ICASSP_2021_Slides_5466.pdf (314)

Categories:: Other applications of machine learning (MLR-APPL)

10 Views

Poster : SPARSITY DRIVEN LATENT SPACE SAMPLING FOR GENERATIVE PRIOR BASED COMPRESSIVE SENSING

ICASSP_2021_Poster_5466.pdf

Poster of SPARSITY DRIVEN LATENT SPACE SAMPLING FOR GENERATIVE PRIOR BASED COMPRESSIVE SENSING presented at ICASSP 2021 (268)

Categories:: Other applications of machine learning (MLR-APPL)

13 Views

Acute Lymphoblastic Leukemia detection based on adaptive unsharpening and Deep Learning

Computer Aided Diagnosis (CAD) systems are increasingly utilizing image analysis and Deep Learning (DL) techniques, due to their high accuracy in several medical imaging ﬁelds, including the detection of Acute Lymphoblastic (or Lymphocytic) Leukemia (ALL) from peripheral blood samples. However, no method in the literature has speciﬁcally analyzed the focus quality of ALL images or proposed a technique for sharpening the samples in an adaptive way for the purpose of classiﬁcation.

Genovese_ICASSP_2021_Slides_v3.pdf

Presentation slides (277)

Categories:: Medical image analysis

44 Views

Presentation Slides: SPARSITY DRIVEN LATENT SPACE SAMPLING FOR GENERATIVE PRIOR BASED COMPRESSIVE SENSING

ICASSP_2021_Slides_5466.pdf

Slides of SPARSITY DRIVEN LATENT SPACE SAMPLING FOR GENERATIVE PRIOR BASED COMPRESSIVE SENSING presented at ICASSP 2021 (232)

Categories:: Other applications of machine learning (MLR-APPL)

3 Views

SELF-INFERENCE OF OTHERS' POLICIES FOR HOMOGENEOUS AGENTS IN COOPERATIVE MULTI-AGENT REINFORCEMENT LEARNING

main.pdf

main.pdf (257)

Categories:: Sequential learning; sequential decision methods (MLR-SLER)

6 Views

An Adaptive Multi-Scale and Multi-Level Features Fusion Network with Perceptual Loss for Change Detection

Change detection plays a vital role in monitoring and analyzing temporal changes in Earth observation tasks. This paper proposes a novel adaptive multi-scale and multi-level features fusion network for change detection in very-high-resolution bi-temporal remote sensing images. The proposed approach has three advantages. Firstly, it excels in abstracting high-level representations empowered by a highly effective feature extraction module.

MFPNet_Slides.pdf

Presentation (230)

MFPNet_poster.pdf

Poster (247)

Categories:: Multimodal signal processing
Image/Video Processing

27 Views

Pages