An evaluation of the formal concept analysis-based document vector on document clustering

Jihn Chang Jehng, Shihchieh Chou, Chin Yi Cheng, Jia Sheng Heh

研究成果: 書貢獻/報告類型會議論文篇章同行評審

2 引文 斯高帕斯(Scopus)

摘要

In conventional approaches, documents are represented by the vector whose dimensionalities are equivalent to the terms extracted from a document set. These approaches, called bag-of-term approaches, ignore the conceptual relationships between terms such as synonyms, hypernyms and hyponyms. In the past, researches have applied thesauri such as Word Net to solve this problem. However, thesauri such as Word Net are developed more for general purposes and are limited in specific domain. Therefore, an automatically built ontology for terms is desired. In our previous study, we proposed a method which applies formal concept analysis (FCA), an automatic ontology building method, to extract the term relationships from a document set, and then apply the extracted information as the ontology of terms to represent the documents as concept vectors. In order to evaluate the usability and effectiveness of the proposed method for information retrieval related applications, we employed the concept vectors generated for the documents to the document clustering. In this study, we apply bisecting k-means clustering and hierarchical agglomerative clustering as the platforms with which to evaluate our method.

原文???core.languages.en_GB???
主出版物標題Proceedings - 2011 International Conference on Computational Science and Its Applications, ICCSA 2011
頁面207-210
頁數4
DOIs
出版狀態已出版 - 2011
事件11th International Conference on Computational Science and Its Applications, ICCSA 2011 - Santander, Spain
持續時間: 20 6月 201123 6月 2011

出版系列

名字Proceedings - 2011 International Conference on Computational Science and Its Applications, ICCSA 2011

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???11th International Conference on Computational Science and Its Applications, ICCSA 2011
國家/地區Spain
城市Santander
期間20/06/1123/06/11

指紋

深入研究「An evaluation of the formal concept analysis-based document vector on document clustering」主題。共同形成了獨特的指紋。

引用此