- Read more about Beyond Keypoint Coding: Temporal Evolution Inference with Compact Feature Representation for Talking Face Video Compression
- Log in to post comments
We propose a talking face video compression framework by implicitly transforming the temporal evolution into compact feature representation. More specifically, the temporal evolution of faces, which is complex, non-linear and difficult to extrapolate, is modelled in an end-to-end inference framework based upon very compact features. This enables the high-quality rendering of the face videos, which benefits from the learning of dense motion map with compact feature representation.
- Categories:
- Read more about Compressing Cipher Images by Using Semi-tensor Product Compressed Sensing and Pre-mapping
- Log in to post comments
As a new signal processing technology, compressed sensing (CS) has been showed to be a promising solution for compressing cipher images. However, the previous CS-based schemes are unsatisfactory in terms of ratio-distortion (R-D) performance. In order to solve this problem, an image encryption-then-compression (ETC) scheme by using semi-tensor product CS (STP-CS) and pre-mapping is proposed in this paper. In the proposed scheme, the original image is encrypted by using the scrambling operation. After image encryption, the cipher image is compressed through three steps.
DCC 2022.pdf
- Categories:
- Read more about Semantic Neural Rendering-based Video Coding: Towards Ultra-Low Bitrate Video Conferencing
- Log in to post comments
DCC-Pre.pptx
- Categories:
- Read more about Pathology Image Compression Based on JPEG2000, Multi-Resolutional Human Perception and the Region of Interest Predictions
- Log in to post comments
To achieve high efficiency of remote pathology image browsing in telemedicine, efficient image compression coding is required. In this work, we establish a visibility threshold (VT) model, which considers multi-resolution and different visual qualities jointly. Based on this model, we propose an image coding method under the JPEG2000 standard for the whole-slide pathology images (WSIs), which operates adaptively according to the required resolutions and visual qualities.
- Categories:
- Read more about Coarse-to-fine Prediction With Local and Nonlocal Correlations for Intra Coding
- Log in to post comments
Recently many efforts have been devoted to learning non-linear predictions from neighboring samples with deep neural networks. However, existing methods mainly generate predictions with local reference samples, regardless of nonlocal self-similarity.
- Categories:
- Read more about Attribute-Decomposable Motion Compression Network for 3D MoCap Data
- Log in to post comments
Motion Capture (MoCap) data is one type of fundamental asset for the digital entertainment. The progressively increasing 3D applications make MoCap data compression unprecedentedly important. In this paper, we propose an end-to-end attribute-decomposable motion compression network using the AutoEncoder architecture. Specifically, the algorithm consists of an LSTM-based encoder-decoder for compression and decompression. The encoder module decomposes human motion into multiple uncorrelated semantic attributes, including action content, arm space, and motion mirror.
- Categories:
- Read more about Improved Deep Image Compression with Joint Optimization of Cross Channel Context Model And Generalized Loop Filter
- Log in to post comments
- Categories:
- Read more about Keyframe Insertion for Fast Channel Switching & Packet-Loss Repair in Low-Delay Live Streaming
- Log in to post comments
Keyframe insertion is a solution for fast channel switching and packet-loss repair in low-delay live streaming.
This work lists the requirements of keyframe insertion in three generations of video coding standards (H.264/AVC, H.265/HEVC, and H.266/VVC), and analyzes the quality impact.
- Categories:
- Read more about An Improved Multi-reference Frame Loop Filter Algorithm Based on Transformer for VVC
- Log in to post comments
Deep learning methods have been achieving good results at the in-loop filtering stage in Versatile Video Coding(VVC). The Multi-Frame In-Loop Filter of HEVC (MIF) algorithm is one of the networks that effectively utilizes the multiple reference frames to enhance the quality of reconstruction. However, it has the disadvantages of low efficiency of the reference frame selection, and its quality enhancement network does not fully utilize the inter-frame correlations of the video sequence and has large redundancy.
- Categories:
- Read more about A fast geometric prediction merge mode decision algorithm based on CU gradient for VVC
- Log in to post comments
Geometric prediction merge mode (GPM) is a new tool introduced in the inter-prediction of Versatile Video Coding (VVC), which uses non-rectangular block partitions for coding unit (CU) partition to improve coding performance. To address the problem of large computational redundancy in the geometric prediction merge mode with motion vector refinement (GPM with MMVD), in this paper, a new decision algorithm is proposed based on CU gradient.
- Categories: