Image/Video Processing

A Spatiotemporal Deep Learning Solution For Automatic Micro-Expressions Recognition From Local Facial Regions

Mouath_Aouayeb_ieee_mlsp2019.pdf

Mouath_Aouayeb_ieee_mlsp2019.pdf (368)

Categories:: Pattern recognition and classification (MLR-PATT)
Image/Video Processing

47 Views

Improving Neural Non-Maximum Suppression For Object Detection By Exploiting Interest-Point Detector

Non-maximum suppression (NMS) is a post-processing step in almost every visual object detector. Its goal is to drastically prune the number of overlapping detected candidate regions-of-interest (ROIs) and replace them with a single, more spatially accurate detection. The default algorithm (Greedy NMS) is fairly simple and suffers from drawbacks, due to its need for manual tuning. Recently, NMS has been improved using deep neural networks that learn how to solve a spatial overlap-based detections rescoring task in a supervised manner, where only ROI coordinates are exploited as input.

mlsp_poster.pdf

mlsp_poster.pdf (422)

Categories:: Image/Video Processing
Neural network learning (MLR-NNLR)

149 Views

Learning Lightweight Pedestrian Detector with Hierarchical Knowledge Distillation

Read more about Learning Lightweight Pedestrian Detector with Hierarchical Knowledge Distillation
Log in to post comments

RuiChen_ICIP19_Oral.pdf

RuiChen_ICIP19_Oral.pdf (421)

Categories:: Image/Video Processing

38 Views

3D Facial Expression Recognition Based on Multi-View and Prior Knowledge Fusion

Read more about 3D Facial Expression Recognition Based on Multi-View and Prior Knowledge Fusion
Log in to post comments

MMSP2019.pdf

MMSP2019.pdf (499)

Categories:: Image/Video Processing
Pattern recognition and classification (MLR-PATT)

30 Views

RRPN: Radar Region Proposal Network for Object Detection in Autonomous Vehicles

Read more about RRPN: Radar Region Proposal Network for Object Detection in Autonomous Vehicles
Log in to post comments

RRPN_presentation.pdf

RRPN_presentation.pdf (421)

Categories:: Image/Video Processing

64 Views

When Spatially-Variant Filtering Meets Low-Rank Regularization: Exploiting Non-Local Similarity for Single Image Interpolation

This paper combines spatially-variant filtering and non-local low-rank regularization (NLR) to exploit non-local similarity in natural images in addressing the problem of image interpolation. We propose to build a carefully designed spatially-variant, non-local filtering scheme to generate a reliable estimate of the interpolated image and utilize NLR to refine the estimation. Our work uses a simple, parallelizable algorithm without the need to solve complicated optimization problems.

When spatially-variant filtering meets low rank approximation.pdf

When spatially-variant filtering meets low rank approximation.pdf (425)

Categories:: Image/Video Processing

39 Views

Injective State-Image Mapping facilitates Visual Adversarial Imitation Learning

Read more about Injective State-Image Mapping facilitates Visual Adversarial Imitation Learning
Log in to post comments

The growing use of virtual autonomous agents in applications like games and entertainment demands better control policies for natural-looking movements and actions. Unlike the conventional approach of hard-coding motion routines, we propose a deep learning method for obtaining control policies by directly mimicking raw video demonstrations. Previous methods in this domain rely on extracting low-dimensional features from expert videos followed by a separate hand-crafted reward estimation step.

mmps_final.pdf

mmps_final.pdf (459)

Categories:: Image/Video Processing
Machine Learning for Signal Processing

37 Views

End-to-End Conditional GAN-based Architectures for Image Colourisation

Read more about End-to-End Conditional GAN-based Architectures for Image Colourisation
Log in to post comments

In this work recent advances in conditional adversarial networks are investigated to develop an end-to-end architecture based on Convolutional Neural Networks (CNNs) to directly map realistic colours to an input greyscale image. Observing that existing colourisation methods sometimes exhibit a lack of colourfulness, this paper proposes a method to improve colourisation results. In particular, the method uses Generative Adversarial Neural Networks (GANs) and focuses on improvement of training stability to enable better generalisation in large multi-class image datasets.

MMSP2019_poster.pdf

MMSP2019_poster.pdf (363)

Categories:: Image/Video Processing

49 Views

STEADIFACE: REAL-TIME FACE-CENTRIC STABILIZATION ON MOBILE PHONES

Read more about STEADIFACE: REAL-TIME FACE-CENTRIC STABILIZATION ON MOBILE PHONES
Log in to post comments

We present Steadiface, a new real-time face-centric video stabilization method that simultaneously removes hand shake and keeps subject's head stable. We use a CNN to estimate the face landmarks and use them to optimize a stabilized head center. We then formulate an optimization problem to find a virtual camera pose that locates the face to the stabilized head center while retains smooth rotation and translation transitions across frames. We test the proposed method on fieldtest videos and show it stabilizes both the head motion and background.

steadiface_poster.pdf

steadiface_poster.pdf (388)

Categories:: Image/Video Processing

35 Views

VARIATIONAL REGULARIZED TRANSMISSION REFINEMENT FOR IMAGE DEHAZING

Read more about VARIATIONAL REGULARIZED TRANSMISSION REFINEMENT FOR IMAGE DEHAZING
Log in to post comments

High-quality dehazing performance is highly dependent upon the accurate estimation of transmission map. In this work, the coarse estimation version is first obtained by weightedly fusing two different transmission maps, which are generated from foreground and sky regions, respectively. A hybrid variational model with promoted regularization terms is then proposed to assisting in refining transmission map. The resulting complicated optimization problem is effectively solved via an alternating direction algorithm.

Eposter_WHUT.pdf

Eposter_WHUT.pdf (418)

Categories:: Image/Video Processing

19 Views

Image/Video Processing

Pages