TY - GEN
T1 - Devising a Cross-Domain Model to Detect Fake Review Comments
AU - Wei, Chen Shan
AU - Hsu, Ping Yu
AU - Huang, Chen Wan
AU - Cheng, Ming Shien
AU - Prassida, Grandys Frieska
N1 - Publisher Copyright:
© 2020, Springer Nature Switzerland AG.
PY - 2020
Y1 - 2020
N2 - The online reviews not only have huge impact on consumer shopping behavior but also online stores’ marketing strategy. Positive reviews will have positive influence for consumer’s buying decision. Therefore, some sellers want to boost their sales volume. They will hire spammers to write undeserving positive reviews to promote their products. Currently, some of the researches related to detection of fake reviews based on the text feature, the model will reach to high accuracy. However, the same model test on the other dataset the accuracy decrease sharply. Relevant researches had gradually explored the identification of fake reviews across different domains, whether the model built using comprehensive methods such as text features or neural networks, encountering the decreasing of accuracy. On the other hand, the method didn’t explain why the model can be applied to cross-domain predictions. In our research, we using the fake reviews and truthful reviews from Ott et al. (2011) and Li, Ott, Cardie, and Hovy (2014) in the three domain (hotel, restaurant, doctor). The cross-domain detect model based on Stimuli Organism Response (S-O-R) combine LIWC (Linguistic Inquiry and Word Count), add word2vec quantization feature, overcoming the decreasing accuracy situation. According to the research result, in the method one SOR calculation of feature weight of reviews, the DNN classification algorithm accuracy is 63.6%. In the method two, calculation of frequent features of word vectors, the random forest classification algorithm accuracy is 73.75%.
AB - The online reviews not only have huge impact on consumer shopping behavior but also online stores’ marketing strategy. Positive reviews will have positive influence for consumer’s buying decision. Therefore, some sellers want to boost their sales volume. They will hire spammers to write undeserving positive reviews to promote their products. Currently, some of the researches related to detection of fake reviews based on the text feature, the model will reach to high accuracy. However, the same model test on the other dataset the accuracy decrease sharply. Relevant researches had gradually explored the identification of fake reviews across different domains, whether the model built using comprehensive methods such as text features or neural networks, encountering the decreasing of accuracy. On the other hand, the method didn’t explain why the model can be applied to cross-domain predictions. In our research, we using the fake reviews and truthful reviews from Ott et al. (2011) and Li, Ott, Cardie, and Hovy (2014) in the three domain (hotel, restaurant, doctor). The cross-domain detect model based on Stimuli Organism Response (S-O-R) combine LIWC (Linguistic Inquiry and Word Count), add word2vec quantization feature, overcoming the decreasing accuracy situation. According to the research result, in the method one SOR calculation of feature weight of reviews, the DNN classification algorithm accuracy is 63.6%. In the method two, calculation of frequent features of word vectors, the random forest classification algorithm accuracy is 73.75%.
KW - Fake reviews
KW - Stimuli-Organism-Response (S-O-R) framework
KW - word2vec
UR - http://www.scopus.com/inward/record.url?scp=85097042315&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-63119-2_58
DO - 10.1007/978-3-030-63119-2_58
M3 - 會議論文篇章
AN - SCOPUS:85097042315
SN - 9783030631185
T3 - Communications in Computer and Information Science
SP - 714
EP - 725
BT - Advances in Computational Collective Intelligence - 12th International Conference, ICCCI 2020, Proceedings
A2 - Hernes, Marcin
A2 - Wojtkiewicz, Krystian
A2 - Szczerbicki, Edward
PB - Springer Science and Business Media Deutschland GmbH
T2 - 12th International Conference on International Conference on Computational Collective Intelligence, ICCCI 2020
Y2 - 30 November 2020 through 3 December 2020
ER -