Image/Video Storage, Retrieval

Interpretable representation learning on natural image datasets via reconstruction in visual-semantic embedding space

Unsupervised learning of disentangled representations is a core task for discovering interpretable factors of variation in an image dataset. We propose a novel method that can learn disentangled representations with semantic explanations on natural image datasets. In our method, we guide the representation learning of a variational autoencoder (VAE) via reconstruction in a visual-semantic embedding (VSE) space to leverage the semantic information of image data and explain the learned latent representations in an unsupervised manner.

Nakagawa_ICIP_Slide.pdf

Poster (353)

Categories:: Image/Video Storage, Retrieval

70 Views

Semantic Role Aware Correlation Transformer For Text To Video Retrieval

Read more about Semantic Role Aware Correlation Transformer For Text To Video Retrieval
Log in to post comments

With the emergence of social media, voluminous video clips are uploaded every day, and retrieving the most relevant visual content with a language query becomes critical. Most approaches aim to learn a joint embedding space for plain textual and visual contents without adequately exploiting their intra-modality structures and inter-modality correlations.

burak_poster.pdf

burak_poster.pdf (271)

Categories:: Image/Video Storage, Retrieval

13 Views

Semantic-Preserving Metric Learning for Video-Text Retrieval (Poster)

Read more about Semantic-Preserving Metric Learning for Video-Text Retrieval (Poster)
Log in to post comments

1006_ICIP2021_poster.pdf

1006_ICIP2021_poster.pdf (270)

Categories:: Image/Video Storage, Retrieval

7 Views

CHANNEL SHUFFLE RECONSTRUCTION NETWORK FOR IMAGE COMPRESSIVE SENSING

Read more about CHANNEL SHUFFLE RECONSTRUCTION NETWORK FOR IMAGE COMPRESSIVE SENSING
Log in to post comments

Li.pdf

Li.pdf (449)

Categories:: Image/Video Storage, Retrieval

40 Views

LIGHTWEIGHT IMAGE SUPER-RESOLUTION RECONSTRUCTION WITH HIERARCHICAL FEATURE-DRIVEN NETWORK

Li.pdf

Li.pdf (320)

Categories:: Image/Video Storage, Retrieval

45 Views

Video Embed: This video provider is not currently supported.

Read more about Video Embed: This video provider is not currently supported.
Log in to post comments

Scale-invariant siamese network for person re-identification(paper code 3023).pdf

Scale-invariant siamese network for person re-identification(paper code 3023).pdf (295)

Categories:: Image/Video Storage, Retrieval

26 Views

DEEP SMOOTHED PROJECTED LANDWEBER NETWORK FOR BLOCK-BASED IMAGE COMPRESSIVE SENSING

Read more about DEEP SMOOTHED PROJECTED LANDWEBER NETWORK FOR BLOCK-BASED IMAGE COMPRESSIVE SENSING
Log in to post comments

ICIP_presentation.pdf

ICIP_presentation.pdf (593)

Categories:: Image/Video Storage, Retrieval

59 Views

Attention Boosted Deep Networks for Video Classficaition

Read more about Attention Boosted Deep Networks for Video Classficaition
Log in to post comments

Video classification can be performed by summarizing image contents of individual frames into one class by deep neural networks, e.g., CNN and LSTM. Human interpretation of video content is influenced by the attention mechanism. In other words, video class can be more attentively decided by certain information than others. In this paper, we propose to integrate the attention mechanism into deep networks for video classification.

Attention Boosted Deep Networks for Video Classficaition.pdf

Attention Boosted Deep Networks for Video Classficaition.pdf (1152)

Categories:: Image/Video Storage, Retrieval

30 Views

Deep Multi-Region Hashing

Read more about Deep Multi-Region Hashing
Log in to post comments

DMRH.pdf

paper slide: Deep Multi-Region Hashing (340)

Categories:: Image/Video Storage, Retrieval

9 Views

QUANTIZED TENSOR ROBUST PRINCIPAL COMPONENT ANALYSIS

Read more about QUANTIZED TENSOR ROBUST PRINCIPAL COMPONENT ANALYSIS
Log in to post comments

High-dimensional data structures, known as tensors, are fundamental in many applications, including multispectral imaging and color video processing. Compression of such huge amount of multidimensional data collected over time is of paramount importance, necessitating the process of quantization of measurements into discrete values. Furthermore, noise and issues related to the acquisition and transmission of signals frequently lead to unobserved, lost or corrupted measurements.

QTRPCA_presentation.pdf

QTRPCA_presentation.pdf (373)

Categories:: Image/Video Storage, Retrieval

22 Views

Image/Video Storage, Retrieval

Pages