TY - JOUR
T1 - A deep neural network for hand gesture recognition from RGB image in complex background
AU - Tsai, Tsung Han
AU - Ho, Yuan Chen
AU - Chi, Po Ting
AU - Chen, Ting Jia
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
PY - 2024/8
Y1 - 2024/8
N2 - Deep learning research has gained significant popularity recently, finding applications in various domains such as image preprocessing, segmentation, object recognition, and semantic analysis. Deep learning has gradually replaced traditional algorithms such as color-based methods, contour-based methods, and motion-based methods. In the context of hand gesture recognition, traditional algorithms heavily rely on depth information for accuracy, but their performance is often subpar. This paper introduces a novel approach using a deep neural network for hand gesture recognition, requiring only a single complementary metal oxide semiconductor (CMOS) camera to operate amidst complex backgrounds. The neural network design incorporates depthwise separable convolutional layers, dividing the model into segmentation and recognition components. As our proposed single-stage model, we avoid the use of the whole model and thus reduce the number of weights and calculations. Additionally, in the training phase, the data augmentation and iterative training strategy further increase recognition accuracy. The results show that the proposed work uses little parameter usage while still having a higher gesture recognition rate than the other works.
AB - Deep learning research has gained significant popularity recently, finding applications in various domains such as image preprocessing, segmentation, object recognition, and semantic analysis. Deep learning has gradually replaced traditional algorithms such as color-based methods, contour-based methods, and motion-based methods. In the context of hand gesture recognition, traditional algorithms heavily rely on depth information for accuracy, but their performance is often subpar. This paper introduces a novel approach using a deep neural network for hand gesture recognition, requiring only a single complementary metal oxide semiconductor (CMOS) camera to operate amidst complex backgrounds. The neural network design incorporates depthwise separable convolutional layers, dividing the model into segmentation and recognition components. As our proposed single-stage model, we avoid the use of the whole model and thus reduce the number of weights and calculations. Additionally, in the training phase, the data augmentation and iterative training strategy further increase recognition accuracy. The results show that the proposed work uses little parameter usage while still having a higher gesture recognition rate than the other works.
KW - Attention model
KW - Deep neural network
KW - Depthwise separable convolution
KW - Hand gesture recognition
KW - Hand segmentation
KW - Human–computer interaction
UR - http://www.scopus.com/inward/record.url?scp=85192081914&partnerID=8YFLogxK
U2 - 10.1007/s11760-024-03198-x
DO - 10.1007/s11760-024-03198-x
M3 - 期刊論文
AN - SCOPUS:85192081914
SN - 1863-1703
VL - 18
SP - 861
EP - 872
JO - Signal, Image and Video Processing
JF - Signal, Image and Video Processing
IS - Suppl 1
ER -