Image, Video, and Multidimensional Signal Processing

M3VSNet: Unsupervised Multi-metric Multi-view Stereo Network

Read more about M3VSNet: Unsupervised Multi-metric Multi-view Stereo Network
Log in to post comments

The present Multi-view stereo (MVS) methods with supervised learning-based networks have an impressive performance comparing with traditional MVS methods. However, the ground-truth depth maps for training are hard to be obtained and are within limited kinds of scenarios. In this paper, we propose a novel unsupervised multi-metric MVS network, named M^3VSNet, for dense point cloud reconstruction without any supervision.

slide1091.pdf

slide1091.pdf (316)

Categories:: Image, Video, and Multidimensional Signal Processing

73 Views

SOLVING FOURIER PHASE RETRIEVAL WITH A REFERENCE IMAGE AS A SEQUENCE OF LINEAR INVERSE PROBLEMS

ICIP2021_Fahimeh_Poster.pdf

Poster for the paper titled "SOLVING FOURIER PHASE RETRIEVAL WITH A REFERENCE IMAGE AS A SEQUENCE OF LINEAR INVERS" (366)

Sequential Fourier Phase Retrieval ICIP 2021.pdf

Presentation Slides for the paper titled "SOLVING FOURIER PHASE RETRIEVAL WITH A REFERENCE IMAGE AS A SEQUENCE OF LINEAR INVERS" (505)

Categories:: Image, Video, and Multidimensional Signal Processing
Other

20 Views

Adversarial Unsupervised Video Summarization Augmented with Dictionary Loss

Read more about Adversarial Unsupervised Video Summarization Augmented with Dictionary Loss
Log in to post comments

Automated unsupervised video summarization by key-frame extraction consists in identifying representative video frames, best abridging a complete input sequence, and temporally ordering them to form a video summary, without relying on manually constructed ground-truth key-frame sets. State-of-the-art unsupervised deep neural approaches consider the desired summary to be a subset of the original sequence, composed of video frames that are sufficient to visually reconstruct the entire input.

Poster_ICIP_Kaseris.pdf

Poster_ICIP_Kaseris.pdf (344)

Categories:: Image, Video, and Multidimensional Signal Processing

31 Views

INTEGRATED GRAD-CAM: SENSITIVITY-AWARE VISUAL EXPLANATION OF DEEP CONVOLUTIONAL NETWORKS VIA INTEGRATED GRADIENT-BASED SCORING

Visualizing the features captured by Convolutional Neural Networks (CNNs) is one of the conventional approaches to interpret the predictions made by these models in numerous image recognition applications. Grad-CAM is a popular solution that provides such a visualization by combining the activation maps obtained from the model.However, the average gradient-based terms deployed in this method under-estimates the contribution of the representations discovered by the model to its predictions.

ICASSP-IGCAM.pdf

Presentation slide deck of IGCAM XAI algorithm (1052)

IG-CAM_Poster.pdf

Poster of IGCAM XAI algorithm (364)

Categories:: Image, Video, and Multidimensional Signal Processing

14 Views

ADA-SISE: ADAPTIVE SEMANTIC INPUT SAMPLING FOR EFFICIENT EXPLANATION OF CONVOLUTIONAL NEURAL NETWORKS

Explainable AI (XAI) is an active research area to interpret a neural network’s decision by ensuring transparency and trust in the task-specified learned models.Recently,perturbation-based model analysis has shown better interpretation, but back-propagation techniques are still prevailing because of their computational efficiency. In this work, we combine both approaches as a hybrid visual explanation algorithm and propose an efficient interpretation method for convolutional neural networks.

ICASSP-AdaSISE-slides.pdf

Presentation slide deck of Ada-SISE XAI algorithm (623)

4216.pdf

Poster of Ada-SISE XAI algorithm (348)

Categories:: Image, Video, and Multidimensional Signal Processing
Neural network learning (MLR-NNLR)

11 Views

LIGHT FIELD STYLE TRANSFER WITH LOCAL ANGULAR CONSISTENCY

Read more about LIGHT FIELD STYLE TRANSFER WITH LOCAL ANGULAR CONSISTENCY
Log in to post comments

ICASSP2021_Poster.pdf

ICASSP2021_Poster.pdf (657)

Categories:: Image, Video, and Multidimensional Signal Processing

9 Views

MULTI-GRANULARITY FEATURE INTERACTION AND RELATION REASONING FOR 3D DENSE ALIGNMENT AND FACE RECONSTRUCTION

In this paper, we propose a multi-granularity feature interaction and relation reasoning network (MFIRRN) which can recover a detail-rich 3D face and perform more accurate dense alignment in an unconstrained environment. Traditional 3DMM-based methods directly regress parameters, resulting in the lack of fine-grained details in the reconstruction 3D face. To this end, we use different branches to capture discriminative features at different granularities, especially local features at medium and fine granularities.