Image, Video, and Multidimensional Signal Processing

SHOW, TRANSLATE AND TELL

Read more about SHOW, TRANSLATE AND TELL
Log in to post comments

Humans have an incredible ability to process and understand
information from multiple sources such as images,
video, text, and speech. Recent success of deep neural
networks has enabled us to develop algorithms which give
machines the ability to understand and interpret this information.
There is a need to both broaden their applicability and
develop methods which correlate visual information along
with semantic content. We propose a unified model which
jointly trains on images and captions, and learns to generate

STT_v5.pdf

STT_v5.pdf (387)

Categories:: Image, Video, and Multidimensional Signal Processing

19 Views

MULTI-TASK LEARNING WITH COMPRESSIBLE FEATURES FOR COLLABORATIVE INTELLIGENCE

Read more about MULTI-TASK LEARNING WITH COMPRESSIBLE FEATURES FOR COLLABORATIVE INTELLIGENCE
Log in to post comments

A promising way to deploy Artificial Intelligence (AI)-based services on mobile devices is to run a part of the AI model (a deep neural network) on the mobile itself, and the rest in the cloud. This is sometimes referred to as collaborative intelligence. In this framework, intermediate features from the deep network need to be transmitted to the cloud for further processing. We study the case where such features are used for multiple purposes in the cloud (multi-tasking) and where they need to be compressible in order to allow efficient transmission to the cloud.

ICIP_2019.pptx

ICIP_2019.pptx (438)

Categories:: Image, Video, and Multidimensional Signal Processing

50 Views

A COMPOUND NEURAL NETWORK FOR BRAIN TUMOR SEGMENTATION

Read more about A COMPOUND NEURAL NETWORK FOR BRAIN TUMOR SEGMENTATION
Log in to post comments

ICIP2019-eposter_zh-bupt-V1.pdf

ICIP2019-eposter_zh-bupt-V1.pdf (381)

Categories:: Image, Video, and Multidimensional Signal Processing

14 Views

DEEP UNSUPERVISED LEARNING FOR SIMULTANEOUS VISUAL ODOMETRY AND DEPTH ESTIMATION

Read more about DEEP UNSUPERVISED LEARNING FOR SIMULTANEOUS VISUAL ODOMETRY AND DEPTH ESTIMATION
1 comment
Log in to post comments

ICIP_poster__1.pptx

ICIP_poster__1.pptx (336)

Categories:: Image, Video, and Multidimensional Signal Processing

38 Views

IMPRESSION ESTIMATION FOR DEFORMED PORTRAITS WITH A LANDMARK-BASED RANKING NETWORK

Read more about IMPRESSION ESTIMATION FOR DEFORMED PORTRAITS WITH A LANDMARK-BASED RANKING NETWORK
Log in to post comments

In recent years, it has become a trend for people to manipulate their own portraits before posting them on a social networking service. However, it is difficult to get a desired portrait after manipulation without sufficient experience or skill. To obtain a simpler and more effective portrait manipulation technique, we consider an automated portrait manipulation method based on five impression words: clear, sweet, elegant, modern, and dynamic.

2019ICIPposter_miyata_verfinal3.pdf

2019ICIPposter_miyata_verfinal3.pdf (357)

Categories:: Image, Video, and Multidimensional Signal Processing

51 Views

MULTIMODAL LATENT FACTOR MODEL WITH LANGUAGE CONSTRAINT FOR PREDICATE DETECTION

Read more about MULTIMODAL LATENT FACTOR MODEL WITH LANGUAGE CONSTRAINT FOR PREDICATE DETECTION
Log in to post comments

poster-2.pdf

poster-2.pdf (381)

Categories:: Image, Video, and Multidimensional Signal Processing

17 Views

Multimodal Point Distribution Model for Anthropological Landmark Detection

Read more about Multimodal Point Distribution Model for Anthropological Landmark Detection
Log in to post comments

While current landmark detection algorithms offer a good approximation of the landmark locations, they are often unsuitable for the use in biological research. We present multimodal landmark detection approach, based on Point distribution model that detects a larger number of anthropologically relevant landmarks than the current landmark detection algorithms.
At the same time we show that improving detection accuracy of initial vertices, using image information, to which

ICIP_poster.pdf

ICIP_poster.pdf (383)

Categories:: Image, Video, and Multidimensional Signal Processing

38 Views

VISUAL LOCALIZATION USING SPARSE SEMANTIC 3D MAP

Read more about VISUAL LOCALIZATION USING SPARSE SEMANTIC 3D MAP
Log in to post comments

Accurate and robust visual localization under a wide range of viewing condition variations including season and illumination changes, as well as weather and day-night variations, is the key component for many computer vision and robotics applications. Under these conditions, most traditional methods would fail to locate the camera. In this paper we present a visual localization algorithm that combines structure-based method and image-based method with semantic information.