Scene Classification, Data Cleaning, and Comment Summarization for Large-Scale Location Databases

Hsu Yung Cheng, Chih Chang Yu

研究成果: 雜誌貢獻期刊論文同行評審


This paper presents a framework that can automatically analyze the images and comments in user-uploaded location databases. The proposed framework integrates image processing and natural language processing techniques to perform scene classification, data cleaning, and comment summarization so that the cluttered information in user-uploaded databases can be presented in an organized way to users. For scene classification, RGB image features, segmentation features, and the features of discriminative objects are fused with an attention module to improve classification accuracy. For data cleaning, incorrect images are detected using a multilevel feature extractor and a multiresolution distance calculation scheme. Finally, a comment summarization scheme is proposed to overcome the problems of unstructured sentences and the improper usage of punctuation marks, which are commonly found in customer reviews. To validate the proposed framework, a system that can classify and organize scenes and comments for hotels is implemented and evalu-ated. Comparisons with existing related studies are also performed. The experimental results validate the effectiveness and superiority of the proposed framework.

期刊Electronics (Switzerland)
出版狀態已出版 - 1 7月 2022


深入研究「Scene Classification, Data Cleaning, and Comment Summarization for Large-Scale Location Databases」主題。共同形成了獨特的指紋。