TY - JOUR
T1 - Deep learning-based firework video pattern classification
AU - Kasthuri Arachchi, S. P.
AU - Shih, Timothy K.
AU - Lin, Chih Yang
AU - Wijayarathna, Gamini
N1 - Publisher Copyright:
© 2019 Taiwan Academic Network Management Committee. All rights reserved.
PY - 2019
Y1 - 2019
N2 - The video analytics technology has been a rapidly improving discipline in the past decade. With the recent developments in computer vision, we now have the ability to mine massive video data to obtain a clear understanding of what is happening in the world. Because of the remarkable successes of deep learning, currently, we are able to improve video analysis performance significantly than traditional statistical approaches. This study focuses on classifying the patterns of firework videos with various deep learning techniques using spatial and temporal features, beyond common types of pattern classifications. Among successful artificial neural networks, Convolutional Neural Networks (CNN) have demonstrated superiority on modeling high-level visual concepts, while Long Short-term Memory (LSTM) and Gated Recurrent Unit (GRU) units have shown great talent in modeling temporal dynamics in video-based pattern classification. Our basic models consist of CNN, LSTM, and GRU and, we did experiments by fine-tuning the parameters of layers and using different dropout values with sequence LSTM and GRU models. Our experimental results demonstrated that the model with a sequence of LSTM units and double dropout layers—one for input and another for hidden layers—outperforms the other experimental models with the training accuracy of 83.05%.
AB - The video analytics technology has been a rapidly improving discipline in the past decade. With the recent developments in computer vision, we now have the ability to mine massive video data to obtain a clear understanding of what is happening in the world. Because of the remarkable successes of deep learning, currently, we are able to improve video analysis performance significantly than traditional statistical approaches. This study focuses on classifying the patterns of firework videos with various deep learning techniques using spatial and temporal features, beyond common types of pattern classifications. Among successful artificial neural networks, Convolutional Neural Networks (CNN) have demonstrated superiority on modeling high-level visual concepts, while Long Short-term Memory (LSTM) and Gated Recurrent Unit (GRU) units have shown great talent in modeling temporal dynamics in video-based pattern classification. Our basic models consist of CNN, LSTM, and GRU and, we did experiments by fine-tuning the parameters of layers and using different dropout values with sequence LSTM and GRU models. Our experimental results demonstrated that the model with a sequence of LSTM units and double dropout layers—one for input and another for hidden layers—outperforms the other experimental models with the training accuracy of 83.05%.
KW - Convolution Neural Network
KW - Deep learning
KW - Fireworks
KW - Recurrent Neural Network
UR - http://www.scopus.com/inward/record.url?scp=85078187024&partnerID=8YFLogxK
U2 - 10.3966/160792642019122007002
DO - 10.3966/160792642019122007002
M3 - 期刊論文
AN - SCOPUS:85078187024
SN - 1607-9264
VL - 20
SP - 2033
EP - 2042
JO - Journal of Internet Technology
JF - Journal of Internet Technology
IS - 7
ER -