Documents
Poster
Poster
PREDICTING TONGUE MOTION IN UNLABELED ULTRASOUND VIDEO USING 3D CONVOLUTIONAL NEURAL NETWORKS
- Citation Author(s):
- Submitted by:
- Chengrui Wu
- Last updated:
- 12 April 2018 - 1:53pm
- Document Type:
- Poster
- Document Year:
- 2018
- Event:
- Presenters:
- Chengrui Wu
- Paper Code:
- 1712
- Categories:
- Log in to post comments
A 3-dimensional convolutional neural network is trained on unlabeled ultrasound video to predict an upcoming tongue image from previous ones. The network obtains results superior to those of simpler predictors, and provides a starting point for exploiting the higher-level representation of the tongue learned by the system in a variety of applications in speech research. This work is believed to be the first application of convolutional neural networks to unlabeled ultrasound video for the purpose of predicting tongue movement.