Image/Video Processing

SPATIO-TEMPORAL SLOWFAST NETWORK FOR ACTION RECOGNITION

Read more about SPATIO-TEMPORAL SLOWFAST NETWORK FOR ACTION RECOGNITION
Log in to post comments

SPATIO-TEMPORAL SLOWFAST NETWORK FOR ACTION RECOGNITION.pdf

SPATIO-TEMPORAL SLOWFAST NETWORK FOR ACTION RECOGNITION.pdf (568)

Categories:: Image/Video Processing

20 Views

Joint Demosaicking / Rectification of Fisheye Camera Images using Multi-color Graph Laplacian Regularization

ICIP_20_slides.pdf

ICIP_20_slides.pdf (277)

Categories:: Image/Video Processing

10 Views

Open-Set Metric Learning for Person Re-identification in The Wild

Read more about Open-Set Metric Learning for Person Re-identification in The Wild
Log in to post comments

Person re-identification in the wild needs to simultaneously (frame-wise) detect and re-identify persons and has wide utility in practical scenarios. However, such tasks come with an additional open-set re-ID challenge as all probe persons may not necessarily be present in the (frame-wise) dynamic gallery. Traditional or close-set re-ID systems are not equipped to handle such cases and raise several false alarms as a result. To cope with such challenges open-set metric learning (OSML), based on the concept of Large margin nearest neighbor (LMNN) approach, is proposed.

Presentation Paper 2396.pdf

Presentation Paper 2396.pdf (539)

Categories:: Image/Video Processing

98 Views

DEPTH MAPS FAST SCALABLE COMPRESSION BASED ON CODING UNIT DEPTH

Read more about DEPTH MAPS FAST SCALABLE COMPRESSION BASED ON CODING UNIT DEPTH
Log in to post comments

Présentation_Dorsaf Sebai.pdf

Présentation_Dorsaf Sebai.pdf (286)

Categories:: Image/Video Coding
Image/Video Processing

19 Views

3D Point Cloud Enhancement using Graph-Modelled Multiview Depth Measurements

Read more about 3D Point Cloud Enhancement using Graph-Modelled Multiview Depth Measurements
Log in to post comments

A 3D point cloud is often synthesized from depth measurements collected by sensors at different viewpoints. The acquired measurements are typically both coarse in precision and corrupted by noise. To improve quality, previous works denoise a synthesized 3D point cloud a posteriori, after projecting the imperfect depth data onto the 3D space. Instead, we enhance depth measurements on the sensed images a priori, exploiting inherent 3D geometric correlation across views, before synthesizing a 3D point cloud from the improved measurements.

1972.pdf

3D Point Cloud Enhancement using Graph-Modelled Multiview Depth Measurements (259)

Categories:: Image/Video Processing

10 Views

DEPTH ESTIMATION FROM SINGLE IMAGE AND SEMANTIC PRIOR

Read more about DEPTH ESTIMATION FROM SINGLE IMAGE AND SEMANTIC PRIOR
Log in to post comments

The multi-modality sensor fusion technique is an active research
area in scene understating. In this work, we explore
the RGB image and semantic-map fusion methods for depth
estimation. The LiDARs, Kinect, and TOF depth sensors are
unable to predict the depth-map at illuminate and monotonous
pattern surface. In this paper, we propose a semantic-to-depth
generative adversarial network (S2D-GAN) for depth estimation
from RGB image and its semantic-map. In the first stage,
the proposed S2D-GAN estimates the coarse level depthmap

Virtual_PPT_ICIP20.pdf

Virtual_PPT_ICIP20.pdf (303)

Categories:: Image/Video Processing

19 Views

ENHANCED IMAGE RECONSTRUCTION FROM QUARTER SAMPLING MEASUREMENTS USING AN ADAPTED VERY DEEP SUPER RESOLUTION NETWORK

ICIP2020_Enhanced image reconstruction.pptx

ICIP2020_Enhanced image reconstruction.pptx (303)

Categories:: Image/Video Processing

11 Views

Egok360: A 360 Egocentric Kinetic Human Activity Video Dataset

Read more about Egok360: A 360 Egocentric Kinetic Human Activity Video Dataset
Log in to post comments

Recently, there has been a growing interest in wearable sensors which provides new research perspectives for 360 ° video analysis. However, the lack of 360 ° datasets in literature hinders the research in this field. To bridge this gap, in this paper we propose a novel Egocentric (first-person) 360° Kinetic human activity video dataset (EgoK360). The EgoK360 dataset contains annotations of human activity with different sub-actions, e.g., activity Ping-Pong with four sub-actions which are pickup-ball, hit, bounce-ball and serve.

icip_presentation.pdf

icip_presentation.pdf (619)

Categories:: Image/Video Processing

17 Views

Sparse Directed Graph Learning for Head Movement Prediction in 360 Video Streaming

Read more about Sparse Directed Graph Learning for Head Movement Prediction in 360 Video Streaming
Log in to post comments

High-definition 360 videos encoded in fine quality are typically too large in size to stream in its entirety over bandwidth (BW)-constrained networks. One popular remedy is to interactively extract and send a spatial sub-region corresponding to a viewer's current field-of-view (FoV) in a head-mounted display (HMD) for more BW-efficient streaming. Due to the non-negligible round-trip-time (RTT) delay between server and client, accurate head movement prediction that foretells a viewer's future FoVs is essential.

2069.pdf

2069.pdf (352)

Categories:: Image/Video Processing

45 Views

Estimation of gaze region using two dimensional probabilistic maps constructed using convolutional neural networks

Predicting the gaze of a user can have important applications in hu- man computer interactions (HCI). They find applications in areas such as social interaction, driver distraction, human robot interaction and education. Appearance based models for gaze estimation have significantly improved due to recent advances in convolutional neural network (CNN). This paper proposes a method to predict the gaze of a user with deep models purely based on CNNs.

Jha_2019-poster.pdf

Jha_2019-poster.pdf (508)

Categories:: Image/Video Processing

36 Views

Image/Video Processing

Pages