- Read more about Invert-and-project (IVP)-A Lossless Compression Method of Multi-scale JPEG Images via DCT Coefficients Prediction
- Log in to post comments
JPEG is a versatile and widely used format for images. Based an elegant design that enables the joint works of basis transformation (gross-scale decorrelation) and entropy coding (fine-scale coding), the resulting JPEG image can maintain virtually all visible features of an image while reducing its size to one tens of the original raw data.
- Categories:
- Read more about Progressive-Granularity Retrieval via Hierarchical Feature Alignment for Person Re-Identification
- Log in to post comments
- Categories:
- Read more about Medical image retrieval based on depth hash
- Log in to post comments
- Categories:
- Read more about Fast Coding of Haar Wavelet Trees
- Log in to post comments
Tarter_DCC.pdf
- Categories:
- Read more about Describe me if you can! Characterized instance-level human parsing
- Log in to post comments
Several computer vision applications such as person search or online fashion rely on human description. The use of instance-level human parsing (HP) is therefore relevant since it localizes semantic attributes and body parts within a person. But how to characterize these attributes? To our knowledge, only some single-HP datasets describe attributes with some color, size and/or pattern characteristics. There is a lack of dataset for multi-HP in the wild with such characteristics.
- Categories:
- Read more about Interpretable representation learning on natural image datasets via reconstruction in visual-semantic embedding space
- Log in to post comments
Unsupervised learning of disentangled representations is a core task for discovering interpretable factors of variation in an image dataset. We propose a novel method that can learn disentangled representations with semantic explanations on natural image datasets. In our method, we guide the representation learning of a variational autoencoder (VAE) via reconstruction in a visual-semantic embedding (VSE) space to leverage the semantic information of image data and explain the learned latent representations in an unsupervised manner.
- Categories:
- Read more about Semantic Role Aware Correlation Transformer For Text To Video Retrieval
- Log in to post comments
With the emergence of social media, voluminous video clips are uploaded every day, and retrieving the most relevant visual content with a language query becomes critical. Most approaches aim to learn a joint embedding space for plain textual and visual contents without adequately exploiting their intra-modality structures and inter-modality correlations.
- Categories:
- Read more about Semantic-Preserving Metric Learning for Video-Text Retrieval (Poster)
- Log in to post comments
- Categories:
- Read more about CHANNEL SHUFFLE RECONSTRUCTION NETWORK FOR IMAGE COMPRESSIVE SENSING
- Log in to post comments
- Categories: