Projects per year
Abstract
This research investigated real-time fingertip detection in frames captured from the increas-ingly popular wearable device, smart glasses. The egocentric-view fingertip detection and character recognition can be used to create a novel way of inputting texts. We first employed Unity3D to build a synthetic dataset with pointing gestures from the first-person perspective. The obvious benefits of using synthetic data are that they eliminate the need for time-consuming and error-prone manual labeling and they provide a large and high-quality dataset for a wide range of purposes. Following that, a modified Mask Regional Convolutional Neural Network (Mask R-CNN) is proposed, consist-ing of a region-based CNN for finger detection and a three-layer CNN for fingertip location. The process can be completed in 25 ms per frame for 640 × 480 RGB images, with an average error of 8.3 pixels. The speed is high enough to enable real-time “air-writing”, where users are able to write characters in the air to input texts or commands while wearing smart glasses. The characters can be recognized by a ResNet-based CNN from the fingertip trajectories. Experimental results demonstrate the feasibility of this novel methodology.
Original language | English |
---|---|
Article number | 4382 |
Journal | Sensors (Switzerland) |
Volume | 21 |
Issue number | 13 |
DOIs | |
State | Published - 1 Jul 2021 |
Keywords
- Air-writing
- Fingertip detection
- Region-based convolutional neural network
- Smart glasses
Fingerprint
Dive into the research topics of 'Egocentric-view fingertip detection for air writing based on convolutional neural networks†'. Together they form a unique fingerprint.Projects
- 1 Finished
-
A Deep-Learning-Based Vi Sual Recognition Scheme for Taiwan Sign Language Training
Su, P.-C. (PI)
1/08/20 → 31/07/21
Project: Research