Sorry, you need to enable JavaScript to visit this website.

facebooktwittermailshare

SHOW, TRANSLATE AND TELL

Abstract: 

Humans have an incredible ability to process and understand
information from multiple sources such as images,
video, text, and speech. Recent success of deep neural
networks has enabled us to develop algorithms which give
machines the ability to understand and interpret this information.
There is a need to both broaden their applicability and
develop methods which correlate visual information along
with semantic content. We propose a unified model which
jointly trains on images and captions, and learns to generate
new captions given either an image or a caption query.
We evaluate our model on three different tasks namely crossmodal
retrieval, image captioning, and sentence paraphrasing.
Our model gains insight into cross-modal vector embeddings,
generalizes well on multiple tasks and is competitive to state
of the art methods on retrieval.

up
0 users have voted:

Paper Details

Authors:
Dheeraj Peri, Shagan Sah, Raymond Ptucha
Submitted On:
20 September 2019 - 7:51pm
Short Link:
Type:
Poster
Event:
Presenter's Name:
Ray Ptudcha
Paper Code:
2914
Document Year:
2019
Cite

Document Files

STT_v5.pdf

(8)

Subscribe

[1] Dheeraj Peri, Shagan Sah, Raymond Ptucha, "SHOW, TRANSLATE AND TELL", IEEE SigPort, 2019. [Online]. Available: http://sigport.org/4795. Accessed: Oct. 18, 2019.
@article{4795-19,
url = {http://sigport.org/4795},
author = {Dheeraj Peri; Shagan Sah; Raymond Ptucha },
publisher = {IEEE SigPort},
title = {SHOW, TRANSLATE AND TELL},
year = {2019} }
TY - EJOUR
T1 - SHOW, TRANSLATE AND TELL
AU - Dheeraj Peri; Shagan Sah; Raymond Ptucha
PY - 2019
PB - IEEE SigPort
UR - http://sigport.org/4795
ER -
Dheeraj Peri, Shagan Sah, Raymond Ptucha. (2019). SHOW, TRANSLATE AND TELL. IEEE SigPort. http://sigport.org/4795
Dheeraj Peri, Shagan Sah, Raymond Ptucha, 2019. SHOW, TRANSLATE AND TELL. Available at: http://sigport.org/4795.
Dheeraj Peri, Shagan Sah, Raymond Ptucha. (2019). "SHOW, TRANSLATE AND TELL." Web.
1. Dheeraj Peri, Shagan Sah, Raymond Ptucha. SHOW, TRANSLATE AND TELL [Internet]. IEEE SigPort; 2019. Available from : http://sigport.org/4795