TY - JOUR
T1 - Scene Classification, Data Cleaning, and Comment Summarization for Large-Scale Location Databases
AU - Cheng, Hsu Yung
AU - Yu, Chih Chang
N1 - Publisher Copyright:
© 2022 by the authors. Licensee MDPI, Basel, Switzerland.
PY - 2022/7/1
Y1 - 2022/7/1
N2 - This paper presents a framework that can automatically analyze the images and comments in user-uploaded location databases. The proposed framework integrates image processing and natural language processing techniques to perform scene classification, data cleaning, and comment summarization so that the cluttered information in user-uploaded databases can be presented in an organized way to users. For scene classification, RGB image features, segmentation features, and the features of discriminative objects are fused with an attention module to improve classification accuracy. For data cleaning, incorrect images are detected using a multilevel feature extractor and a multiresolution distance calculation scheme. Finally, a comment summarization scheme is proposed to overcome the problems of unstructured sentences and the improper usage of punctuation marks, which are commonly found in customer reviews. To validate the proposed framework, a system that can classify and organize scenes and comments for hotels is implemented and evalu-ated. Comparisons with existing related studies are also performed. The experimental results validate the effectiveness and superiority of the proposed framework.
AB - This paper presents a framework that can automatically analyze the images and comments in user-uploaded location databases. The proposed framework integrates image processing and natural language processing techniques to perform scene classification, data cleaning, and comment summarization so that the cluttered information in user-uploaded databases can be presented in an organized way to users. For scene classification, RGB image features, segmentation features, and the features of discriminative objects are fused with an attention module to improve classification accuracy. For data cleaning, incorrect images are detected using a multilevel feature extractor and a multiresolution distance calculation scheme. Finally, a comment summarization scheme is proposed to overcome the problems of unstructured sentences and the improper usage of punctuation marks, which are commonly found in customer reviews. To validate the proposed framework, a system that can classify and organize scenes and comments for hotels is implemented and evalu-ated. Comparisons with existing related studies are also performed. The experimental results validate the effectiveness and superiority of the proposed framework.
KW - deep learning
KW - image analysis
KW - image classification
KW - natural language processing
UR - http://www.scopus.com/inward/record.url?scp=85132260397&partnerID=8YFLogxK
U2 - 10.3390/electronics11131947
DO - 10.3390/electronics11131947
M3 - 期刊論文
AN - SCOPUS:85132260397
SN - 2079-9292
VL - 11
JO - Electronics (Switzerland)
JF - Electronics (Switzerland)
IS - 13
M1 - 1947
ER -