|国家预印本平台
首页|Dynamic Gesture Recognition

Dynamic Gesture Recognition

Dynamic Gesture Recognition

来源:Arxiv_logoArxiv
英文摘要

The Human-Machine Interaction (HMI) research field is an important topic in machine learning that has been deeply investigated thanks to the rise of computing power in the last years. The first time, it is possible to use machine learning to classify images and/or videos instead of the traditional computer vision algorithms. The aim of this paper is to build a symbiosis between a convolutional neural network (CNN) and a recurrent neural network (RNN) to recognize cultural/anthropological Italian sign language gestures from videos. The CNN extracts important features that later are used by the RNN. With RNNs we are able to store temporal information inside the model to provide contextual information from previous frames to enhance the prediction accuracy. Our novel approach uses different data augmentation techniques and regularization methods from only RGB frames to avoid overfitting and provide a small generalization error.

Costanza Maria Improta、Jonas Bokstaller

计算技术、计算机技术科学、科学研究信息传播、知识传播

Costanza Maria Improta,Jonas Bokstaller.Dynamic Gesture Recognition[EB/OL].(2021-09-20)[2025-05-18].https://arxiv.org/abs/2109.09396.点此复制

评论