Sorry, you need to enable JavaScript to visit this website.

PREDICTING TONGUE MOTION IN UNLABELED ULTRASOUND VIDEO USING 3D CONVOLUTIONAL NEURAL NETWORKS

Citation Author(s):
Shicheng Chen, Guorui Sheng, Pierre Roussel, Bruce Denby
Submitted by:
Chengrui Wu
Last updated:
12 April 2018 - 1:53pm
Document Type:
Poster
Document Year:
2018
Event:
Presenters:
Chengrui Wu
Paper Code:
1712
 

A 3-dimensional convolutional neural network is trained on unlabeled ultrasound video to predict an upcoming tongue image from previous ones. The network obtains results superior to those of simpler predictors, and provides a starting point for exploiting the higher-level representation of the tongue learned by the system in a variety of applications in speech research. This work is believed to be the first application of convolutional neural networks to unlabeled ultrasound video for the purpose of predicting tongue movement.

up
1 user has voted: Shicheng Chen