Action recognition using three dimension convolution and long short term memory

Yu Cheng Liu, Jian Jiun Ding, Yao Jen Chang, Chien Yao Wang, Jia Ching Wang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

The convolutional neural network (CNN) is more and more popular in computer vision and widely used in acoustic signal processing, image classification, and image segmentation. In this work, an architecture which is a combination of the 3-D convolutional neural network and the long short term memory (LSTM) was proposed for action recognition. It stacks the consecutive video frames, extracts spatial and time features, and trains the input dataset to achieve good recognition performance. Moreover, the LSTM model based on the relations among the frames in different time is adopted to consider the information of past frames. Simulations show that the proposed algorithm outperforms other neural network based methods and has even better performance for action recognition.

Original languageEnglish
Title of host publication2017 IEEE International Conference on Consumer Electronics - Taiwan, ICCE-TW 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages83-84
Number of pages2
ISBN (Electronic)9781509040179
DOIs
StatePublished - 25 Jul 2017
Event4th IEEE International Conference on Consumer Electronics - Taiwan, ICCE-TW 2017 - Taipei, United States
Duration: 12 Jun 201714 Jun 2017

Publication series

Name2017 IEEE International Conference on Consumer Electronics - Taiwan, ICCE-TW 2017

Conference

Conference4th IEEE International Conference on Consumer Electronics - Taiwan, ICCE-TW 2017
Country/TerritoryUnited States
CityTaipei
Period12/06/1714/06/17

Fingerprint

Dive into the research topics of 'Action recognition using three dimension convolution and long short term memory'. Together they form a unique fingerprint.

Cite this