- Read more about Fast Inverse Mapping of Face GANs
- Log in to post comments
Generative adversarial networks (GANs) synthesize realistic images from random latent vectors. While many studies have explored various training configurations and architectures for GANs, the problem of inverting the generator of GANs has been inadequately investigated. We train a ResNet architecture to map given faces to latent vectors that can be used to generate faces nearly identical to the target. We use a perceptual loss to embed face details in the recovered latent vector while maintaining visual quality using a pixel loss.
- Categories:
- Read more about Efficient Real-Time Video Stabilization with A Novel Least Squares Formulation
- Log in to post comments
We present a novel video stabilization algorithm (LSstab) that removes unwanted motions in real-time. LSstab is based on a novel least squares formulation of the smoothing cost function to alleviate the undesirable camera jitter. A recursive least square solver is derived to minimize the smoothing cost function with an O(N) computation complexity. LSstab is evaluated using a suite of publicly available videos against the state of the art video stabilization methods. Results show LSstab reaches comparable or better performance, achieving real-time processing speed when a GPU is used.
- Categories:
- Read more about SA-Net: Shuffle Attention for Deep Convolutional Neural Networks
- Log in to post comments
Attention mechanisms, which enable a neural network to accurately focus on all the relevant elements of the input, have become an essential component to improve the performance of deep neural networks. There are mainly two attention mechanisms widely used in computer vision studies, spatial attention and channel attention, which aim to capture the pixel-level pairwise relationship and channel dependency, respectively. Although fusing them together may achieve better performance than their individual implementations, it will inevitably increase the computational overhead.
poster.pdf
- Categories:
- Read more about CANET: CONTEXT-AWARE LOSS FOR DESCRIPTOR LEARNING
- Log in to post comments
- Categories:
- Read more about An Empirical Analysis of Recurrent Learning Algorithms In Neural Lossy Image Compression Systems.
- 1 comment
- Log in to post comments
Prior work on image compression has focused on optimizing models to achieve better reconstruction at lower bit rates. These approaches are focused on creating sophisticated architectures that enhance encoder or decoder performance. In some cases, there is the desire to jointly optimize both along with a designed form of entropy encoding. In some instances, these approaches result in the creation of many redundant components, which may or may not be useful.
- Categories:
In this paper, we concentrate on the super-resolution (SR) of compressed screen content video, in an effort to address the real-world challenges by considering the underlying characteristics of screen content. Firstly, we propose a new dataset for the SR of screen content video with different distortion levels. Meanwhile, we design an efficient SR structure that could capture the characteristics of compressed screen content video and manipulate the inner-connections in consecutive compressed low-resolution frames, facilitating the high-quality recovery of the high-resolution counter-part.
- Categories:
- Read more about Reducing Image Compression Artifacts for Deep Neural Networks
- Log in to post comments
Existing compression artifacts reduction methods aim to restore images on pixel-level, which can improves human visual experience. However, in many applications, large-scale images are collected not for visual examination by human. Instead, they are used for many high-level vision tasks usually by Deep Neural Networks (DNN). One fundamental problem here is whether existing artifacts reduction methods can help DNNs improve the performance of the high-level tasks. In this paper, we find that these methods have limited performance improvements to high-level tasks, even bring negative effects.
- Categories:
- Read more about HH-CompWordNet: Holistic Handwritten Word Recognition in the Compressed Domain
- Log in to post comments
Holistic word recognition in handwritten documents is an important research topic in the field of Document Image Analysis. For some applications, given strong language models, it can be more robust and computationally less expensive than character segmentation and recognition. This paper presents HH-CompWordNet, a novel approach to applying a Convolutional Neural Network (CNN) to directly to the DCT coefficients of the compressed domain word images.
- Categories: