Improved convolutional neural network based scene classification using long short-term memory and label relations

Po Jen Chen, Jian Jiun Ding, Hung Wei Hsu, Chien Yao Wang, Jia Ching Wang

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Convolutional neural network (CNN) is more and more important in pattern recognition. In this work, we adopt label relations and long short-term memory (LSTM) to develop an accurate CNN-based scene classification algorithm. Traditional scene classification algorithms assume that labels are mutually exclusive. However, this is not reasonable when an image has a variety of objects and hence has multiple labels. In this work, we apply two label relations, which are exclusive and hierarchy relations, to improve the accuracy of multiple-label scene classification. For example, it is impossible that an image has both the labels of 'factory' and 'garden'. If the label 'factory' is assigned to an image, the probability that it has the label of 'garden' should be lowered. We also use image captioning to construct a scene classification model and propose an LSTM based method to further explore label relations and obtain more accurate results for scenic image labeling.

Original languageEnglish
Title of host publication2017 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages429-434
Number of pages6
ISBN (Electronic)9781538605608
DOIs
StatePublished - 5 Sep 2017
Event2017 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2017 - Hong Kong, Hong Kong
Duration: 10 Jul 201714 Jul 2017

Publication series

Name2017 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2017

Conference

Conference2017 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2017
Country/TerritoryHong Kong
CityHong Kong
Period10/07/1714/07/17

Keywords

  • Convolutional neural network
  • long short-term memory
  • machine learning
  • pattern recognition
  • scene classification

Fingerprint

Dive into the research topics of 'Improved convolutional neural network based scene classification using long short-term memory and label relations'. Together they form a unique fingerprint.

Cite this