Multimedia Signal Processing

ENCODER-RECURRENT DECODER NETWORK FOR SINGLE IMAGE DEHAZING

Read more about ENCODER-RECURRENT DECODER NETWORK FOR SINGLE IMAGE DEHAZING
Log in to post comments

ICASSP2020.pdf

ICASSP2020.pdf (625)

Categories:: Image/Video Processing
Multimedia Signal Processing

62 Views

Convolutional Neural Network based Fast Intra Mode Prediction for H.266/FVC Video Coding

Convolutional Neural Network based Fast Intra Mode Prediction.pdf

Convolutional Neural Network based Fast Intra Mode Prediction.pdf (560)

Categories:: Multimedia Signal Processing

114 Views

Segmentation of Text-Lines and Words from JPEG Compressed Printed Text Documents Using DCT Coefficients

Segmenting a document image into text-lines and words finds applications in many research areas of DIA(Document Image Analysis) such as OCR, Word Spotting, and document retrieval. However, carrying out segmentation operation directly in the compressed document images is still an unexplored and challenging research area. Since JPEG is most widely accepted compression algorithm, this research paper attempts to segment a JPEG compressed printed text document image into text-lines and words, without fully decompressing the image.

dccv.pdf

DCC2020 Paper ID 181 (507)

Categories:: Image, Video, and Multidimensional Signal Processing
Machine Learning for Signal Processing
Multimedia Signal Processing

59 Views

Multimodal active speaker detection and virtual cinematography for video conferencing

Read more about Multimodal active speaker detection and virtual cinematography for video conferencing
Log in to post comments

Active speaker detection (ASD) and virtual cinematography (VC) can significantly improve the remote user experience of a video conference by automatically panning, tilting and zooming of a video conferencing camera: users subjectively rate an expert video cinematographer’s video significantly higher than unedited video. We describe a new automated ASD and VC that performs within 0.3 MOS of an expert cinematographer based on subjective ratings with a 1-5 scale.

ICASSP 2020 ASD.pdf

ICASSP 2020 ASD.pdf (1257)

Categories:: Multimedia Signal Processing

37 Views

A Fast Iterative Method for Removing Sparse Noise from Sparse Signals

Read more about A Fast Iterative Method for Removing Sparse Noise from Sparse Signals
Log in to post comments

Reconstructing a signal corrupted by impulsive noise is of high importance in several applications, including impulsive noise removal from images, audios and videos, and separating texts from images. Investigating this problem, in this paper we propose a new method to reconstruct a noise-corrupted signal where both signal and noise are sparse but in different domains. We apply our algorithm for impulsive noise (Salt-and-Pepper Noise (SPN) and Random-Valued Impulsive Noise (RVIN) removal from images and compare our results with other notable algorithms in the literature.

Sahar Sadrizadeh.pdf

Sahar Sadrizadeh.pdf (475)

Categories:: Multimedia Signal Processing

48 Views

Dynamic Guidance For Depth Map Restoration

Read more about Dynamic Guidance For Depth Map Restoration
Log in to post comments

Dynamic Guidance For Depth Map Restoration-RanZhu(paper id 80).pdf

Dynamic Guidance For Depth Map Restoration-RanZhu(paper id 80).pdf (552)

Categories:: Multimedia Signal Processing

26 Views

RATE-DISTORTION DRIVEN SEPARATION OF DIFFUSE AND SPECULAR COMPONENTS IN MULTIVIEW IMAGERY

In this work we explore an overcomplete representation of
multiview imagery for the purpose of compression. We
present a rate-distortion (R-D) driven approach to decompose
multiview datasets into two additive parts which can
be interpreted as being the diffuse and specular components.
We apply different transforms to each component such that
the compressibility of input data is improved. We describe
a framework which performs the R-D optimized separation
in a registered domain to avoid the complexity of warping

ICIP2019_Haghighat_Poster.pdf

ICIP2019_Haghighat_Poster.pdf (567)

Categories:: Multimedia Signal Processing

15 Views

Influence of viewpoint on visual saliency models for volumetric content

Read more about Influence of viewpoint on visual saliency models for volumetric content
Log in to post comments

In order to predict where humans look in a 3D immersive en- vironment, saliency can be computed using either 3D saliency models or view-based approaches (2D projection). In fact, building a 3D complete model is still a challenging task that is not investigated enough in the research field while 2D imag- ing approaches have been extensively studied and have shown solid performances.

Lecture_ICIP_2019.pdf

Lecture_ICIP_2019.pdf (444)

Categories:: Multimedia Signal Processing

37 Views

HAND GRAPH REPRESENTATIONS FOR UNSUPERVISED SEGMENTATION OF COMPLEX ACTIVITIES

Read more about HAND GRAPH REPRESENTATIONS FOR UNSUPERVISED SEGMENTATION OF COMPLEX ACTIVITIES
Log in to post comments

Analysis of hand skeleton data can be used to understand patterns in manipulation and assembly tasks. This paper introduces a graphbased representation of hand skeleton data and proposes a method to perform unsupervised temporal segmentation of a sequence of subtasks in order to evaluate the efficiency of an assembly task. We explore the properties of different choices of hand graphs and their spectral decomposition. A comparative performance of these graphs is presented in the context of complex activity segmentation.