- Read more about Convolutional Neural Network for Image Compression with Application to JPEG Standard
- Log in to post comments
In this paper the authors present a novel structure of convolutional neural network for lossy image compression intended for use as a part of JPEG’s standard image compression stream. The network is trained on randomly selected images from high-quality image dataset of human faces and its effectiveness is verified experimentally using standard test images.
- Categories:
- Read more about Fast GLCM-based Intra Block Partition for VVC
- Log in to post comments
In the latest video coding standard, Versatile Video Coding (H.266/VVC), a new quadtree with nested multi-type tree (QTMT) coding block structure is proposed. QTMT significantly improves coding performance, but more complex block partitioning structure brings greater computational burden. To solve this problem, a fast intra block partition pattern pruning algorithm is proposed using gray level co-occurrence matrix (GLCM) to calculate texture direction information of coding units, terminating the horizontal or vertical split of the binary tree and the ternary tree in advance.
- Categories:
- Read more about Lossy Compression for Integrating Event Sensors
- Log in to post comments
- Categories:
- Read more about Multiscale Point Cloud Geometry Compression
- Log in to post comments
Recent years have witnessed the growth of point cloud based applications for both immersive media as well as 3D sensing for auto-driving, because of its realistic and fine-grained representation of 3D objects and scenes. However, it is a challenging problem to compress sparse, unstructured, and high-precision 3D points for efficient communication. In this paper, leveraging the sparsity nature of the point cloud, we propose a multiscale end-to-end learning framework that hierarchically reconstructs the 3D Point Cloud Geometry (PCG) via progressive re-sampling.
- Categories:
- Read more about SLFC: Scalable Light Field Coding
- Log in to post comments
Light field imaging enables some post-processing capabilities like refocusing, changing view perspective, and depth estimation. As light field images are represented by multiple views they contain a huge amount of data that makes compression inevitable. Although there are some proposals to efficiently compress light field images, their main focus is on encoding efficiency. However, some important functionalities such as viewpoint and quality scalabilities, random access, and uniform quality distribution have not been addressed adequately.
- Categories:
- Read more about JQF: Optimal JPEG Quantization Table Fusion by Simulated Annealing on Texture Images and Predicting Textures
- Log in to post comments
JPEG has been a widely used lossy image compression codec for nearly three decades. The JPEG standard allows to use customized quantization table; however, it's still a challenging problem to find an optimal quantization table within acceptable computational cost. This work tries to solve the dilemma of balancing between computational cost and image specific optimality by introducing a new concept of texture mosaic images.
- Categories:
- Categories:
- Read more about An Efficient QP Variable Convolutional Neural NetworkBased In-loop Filter for Intra Coding
- Log in to post comments
In this paper, a novel QP variable convolutional neural network based in-loop filter is proposed for VVC intra coding. To avoid training and deploying multiple networks, we develop an efficient QP attention module (QPAM) which can capture compression noise levels for different QPs and emphasize meaningful features along channel dimension. Then we embed QPAM into the residual block, and based on it, we design a network architecture that is equipped with controllability for different QPs.
- Categories:
- Read more about On Intra Video Coding and In-Loop Filtering for Neural Object Detection Networks
- Log in to post comments
Classical video coding for satisfying humans as the final user is a widely investigated field of studies for visual content, and common video codecs are all optimized for the human visual system (HVS). But are the assumptions and optimizations also valid when the compressed video stream is analyzed by a machine? To answer this question, we compared the performance of two state-of-the-art neural detection networks when being fed with deteriorated input images coded with HEVC and VVC in an autonomous driving scenario using intra coding.
- Categories:
- Read more about Memory Assessment of Versatile Video Coding
- Log in to post comments
This work presents a memory assessment of the next-generation Versatile Video Coding (VVC). The memory analyses are performed adopting as a baseline the state-of-the-art High-Efficiency Video Coding (HEVC). The goal is to offer insights and observations of how critical the memory requirements of VVC are aggravated, compared to HEVC. The adopted methodology consists of two sets of experiments: (1) an overall memory profiling and (2) an inter-prediction specific memory analysis. The results obtained in the memory profiling show that VVC access up to 13.4x more memory than HEVC.
- Categories: