Gaussian process based text categorization for healthy information

Sih Huei Chen, Yuan Shan Lee, Tzu Chiang Tai, Jia Ching Wang

研究成果: 書貢獻/報告類型會議論文篇章同行評審

2 引文 斯高帕斯(Scopus)

摘要

As the development of the medical technology, more and more people start to pay attention to their health. A large amount of health information can be easily obtained from the website now. Therefore, text categorization is important to analyze the information. In this work, we propose a system for text categorization that is based on a Gaussian process. Our proposed system involves the two parts- feature learning and classification. In the first part, we apply the latent Dirichlet allocation (LDA) to obtain the K latent topics proportion from each document. The K-dimensional vector is regarded as the feature of each document. In the classification part, a Gaussian process (GP) is utilized for the text categorization. 10 classes of text documents are categorized by the one-versus-one approach. The experimental results show that our proposed system performs well in text categorization, especially with the small size of training dataset.

原文???core.languages.en_GB???
主出版物標題Proceedings of 2015 International Conference on Orange Technologies, ICOT 2015
發行者Institute of Electrical and Electronics Engineers Inc.
頁面30-33
頁數4
ISBN(電子)9781467382373
DOIs
出版狀態已出版 - 22 6月 2016
事件3rd International Conference on Orange Technologies, ICOT 2015 - Hong Kong, Hong Kong
持續時間: 19 12月 201522 12月 2015

出版系列

名字Proceedings of 2015 International Conference on Orange Technologies, ICOT 2015

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???3rd International Conference on Orange Technologies, ICOT 2015
國家/地區Hong Kong
城市Hong Kong
期間19/12/1522/12/15

指紋

深入研究「Gaussian process based text categorization for healthy information」主題。共同形成了獨特的指紋。

引用此