Image and Video Analysis and Understanding

Key Action And Joint CTC-Attention Based Sign Language Recognition

Read more about Key Action And Joint CTC-Attention Based Sign Language Recognition
Log in to post comments

Sign Language Recognition (SLR) translates sign language video into natural language. In practice, sign language video, owning a large number of redundant frames, is necessary to be selected the essential. However, unlike common video that describes actions, sign language video is characterized as continuous and dense action sequence, which is difficult to capture key actions corresponding to meaningful sentence. In this paper, we propose to hierarchically search key actions by a pyramid BiLSTM.

poster_video5717.pdf

poster_video5717.pdf (420)

Categories:: Image, Video, and Multidimensional Signal Processing

60 Views

Key Action And Joint CTC-Attention Based Sign Language Recognition

Read more about Key Action And Joint CTC-Attention Based Sign Language Recognition
Log in to post comments

poster_video5717.pdf

poster_video5717.pdf (415)

Categories:: Image, Video, and Multidimensional Signal Processing

102 Views

Increasingly specialized ensemble of Convolutional Neural Networks for Fine-grained recognition

Fine-grained recognition focuses on the challenging task of automatically identifying the subtle differences between similar categories. Current state-of-the-art approaches require elaborated feature learning procedures, involving tuning several hyper-parameters, or rely on expensive human annotations such as objects or parts location. In this paper we propose a simple method for fine-grained recognition that exploits a nearly cost-free attention-based focus operation to construct an ensemble of increasingly specialized Convolutional Neural Networks.

simonelli.pdf

simonelli.pdf (518)

Categories:: Audio and Acoustic Signal Processing

31 Views