Sorry, you need to enable JavaScript to visit this website.

Semantic Mapping of Natural Language Input to Database Entries via Convolutional Neural Networks

Citation Author(s):
Zachary Collins, Jim Glass
Submitted by:
Mandy Korpusik
Last updated:
3 March 2017 - 12:34pm
Document Type:
Presentation Slides
Event:
Presenters:
Mandy Korpusik
Paper Code:
2627
 

Natural language processing research has made major advances with the concept of representing words, sentences, paragraphs, and even documents by embedded vector representations. We apply this idea to the problem of relating foods, as expressed in natural language meal descriptions, to corresponding database entries. We generate fixed-length embeddings for U.S. Department of Agriculture (USDA) food database entries, as well as vector-based representations of natural language meal descriptions, through a convolutional neural network (CNN) architecture that predicts whether or not a USDA food item is present in the meal description. We compute dot products between each token in a meal description and a USDA food entry. By ranking the network's predicted average dot product between each possible database food entry and a meal description, we show it is possible to directly predict the USDA foods mentioned in a meal without requiring intermediate steps that would be used in a conventional database access application. We report the performance of this model on a binary verification task of over 48k meal descriptions, and show that this approach, when integrated with a Markov model, substantially outperforms our previous best multi-stage approach involving a conditional random field tagger, probabilistic segmentation, and database lookup.

up
0 users have voted: