A deep neural network for hand gesture recognition from RGB image in complex background

Tsung Han Tsai, Yuan Chen Ho, Po Ting Chi, Ting Jia Chen

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

Deep learning research has gained significant popularity recently, finding applications in various domains such as image preprocessing, segmentation, object recognition, and semantic analysis. Deep learning has gradually replaced traditional algorithms such as color-based methods, contour-based methods, and motion-based methods. In the context of hand gesture recognition, traditional algorithms heavily rely on depth information for accuracy, but their performance is often subpar. This paper introduces a novel approach using a deep neural network for hand gesture recognition, requiring only a single complementary metal oxide semiconductor (CMOS) camera to operate amidst complex backgrounds. The neural network design incorporates depthwise separable convolutional layers, dividing the model into segmentation and recognition components. As our proposed single-stage model, we avoid the use of the whole model and thus reduce the number of weights and calculations. Additionally, in the training phase, the data augmentation and iterative training strategy further increase recognition accuracy. The results show that the proposed work uses little parameter usage while still having a higher gesture recognition rate than the other works.

Original languageEnglish
Pages (from-to)861-872
Number of pages12
JournalSignal, Image and Video Processing
Volume18
Issue numberSuppl 1
DOIs
StatePublished - Aug 2024

Keywords

  • Attention model
  • Deep neural network
  • Depthwise separable convolution
  • Hand gesture recognition
  • Hand segmentation
  • Human–computer interaction

Fingerprint

Dive into the research topics of 'A deep neural network for hand gesture recognition from RGB image in complex background'. Together they form a unique fingerprint.

Cite this