Image, Video, and Multidimensional Signal Processing

Single-Shot Real-Time Multiple-Path Time-of-Flight Depth Imaging for Multi-Aperture and Macro-Pixel Sensors

Multiple-Path Interference (MPI) is a major drawback of Time-of-Flight (ToF) sensors. MPI occurs when a ToF pixel receives more than a single light bounce from the scene. Current methods resolving more than a single return per pixel rely on the sequential acquisition of large amounts of data and are too computationally expensive to deliver depth images in real time. These factors have precluded the development of a multiple-path ToF camera to date. In this work we consider two hardware alternatives that can be used to acquire all necessary raw data in a single shot.

heredia_ICASSP2020_kagawa_v3_final.pdf

heredia_ICASSP2020_kagawa_v3_final.pdf (457)

Categories:: Image, Video, and Multidimensional Signal Processing

105 Views

COMPARE LEARNING: BI-ATTENTION NETWORK FOR FEW-SHOT LEARNING

Read more about COMPARE LEARNING: BI-ATTENTION NETWORK FOR FEW-SHOT LEARNING
Log in to post comments

Learning with few labeled data is a key challenge for visual recognition, as deep neural networks tend to overfit using a few samples only. One of the Few-shot learning methods called metric learning addresses this challenge by first learning a deep distance metric to determine whether a pair of images belong to the same category, then applying the trained metric to instances from other test set with limited labels. This method makes the most of the few samples and limits the overfitting effectively.

ICASSP2020 presentation.pdf

ICASSP2020 presentation.pdf (293)

Categories:: Image, Video, and Multidimensional Signal Processing

61 Views

EXPOSURE INTERPOLATION VIA HYBRID LEARNING

Read more about EXPOSURE INTERPOLATION VIA HYBRID LEARNING
Log in to post comments

Deep learning based methods have become dominant solutions to many image processing problems. A natural question would be “Is there any space for conventional methods on these problems?” In this paper, exposure interpolation is taken as an example to answer this question and the answer is “Yes”. A new hybrid learning framework is introduced to interpolate a medium exposure image for two large-exposure-ratio images from an emerging high dynamic range (HDR) video capturing device. The framework is set up by fusing conventional and deep learning methods.

ICASSP2020HybridLearning.pdf

ICASSP2020HybridLearning.pdf (209)

Categories:: Image, Video, and Multidimensional Signal Processing

35 Views

COMPLEX PAIRWISE ACTIVITY ANALYSIS VIA INSTANCE LEVEL EVOLUTION REASONING

Read more about COMPLEX PAIRWISE ACTIVITY ANALYSIS VIA INSTANCE LEVEL EVOLUTION REASONING
Log in to post comments

Video activity analysis systems are often trained on large datasets. Activities and events in the real-world do not occur in isolation, instead, they occur as interactions between related objects. This work introduces a novel method that jointly exploits relational information between pairs of objects and temporal dynamics of each object. The proposed method effectively leverages a new simple architecture that is flexible and easily trained to detect relational activities and events using small datasets (hundreds of samples).

5348_complex_pairwise_activity_presentation.pdf

5348_complex_pairwise_activity_presentation.pdf (176)

Categories:: Image, Video, and Multidimensional Signal Processing

14 Views

Key Action And Joint CTC-Attention Based Sign Language Recognition

Read more about Key Action And Joint CTC-Attention Based Sign Language Recognition
Log in to post comments

Sign Language Recognition (SLR) translates sign language video into natural language. In practice, sign language video, owning a large number of redundant frames, is necessary to be selected the essential. However, unlike common video that describes actions, sign language video is characterized as continuous and dense action sequence, which is difficult to capture key actions corresponding to meaningful sentence. In this paper, we propose to hierarchically search key actions by a pyramid BiLSTM.

poster_video5717.pdf

poster_video5717.pdf (257)

Categories:: Image, Video, and Multidimensional Signal Processing

56 Views

Key Action And Joint CTC-Attention Based Sign Language Recognition

Read more about Key Action And Joint CTC-Attention Based Sign Language Recognition
Log in to post comments

poster_video5717.pdf

poster_video5717.pdf (247)

Categories:: Image, Video, and Multidimensional Signal Processing

92 Views

Efficient Storage of Images onto DNA Using Vector Quantization

Read more about Efficient Storage of Images onto DNA Using Vector Quantization
Log in to post comments

The archiving of digital data is becoming very challenging as conventional electronic devices wear out in time leaving at stake any data that has been stored in them. Therefore, data migration is necessary every 5-10 years. A great percentage of this stored data is "cold", which means that it is very rarely accessed but needs to be safely stored into back-up drives for security and compliance reasons. Unfortunately, the maintenance and replacement of back-up tape drives in big data centers is very expensive both in terms of money and energy.