OLERA: Semisupervised Web-data extraction with visual support

Chia Hui Chang, Shih Chien Kuo

研究成果: 雜誌貢獻回顧評介論文同行評審

71 引文 斯高帕斯(Scopus)

摘要

A semi-supervised information extraction (IE) system, OLERA (On-Line Extraction Rule Analysis), proposed by Chia-Hui Chang of National Central University, Taiwan and Shih-Chien Kou, Trend Micro, Taiwan, is described. The system allows users, with minimal effort, train extraction rules from semistructured Web pages without requiring detailed annotation of the training documents. OLERA offers visual interaction by displaying discovered records in a spreadsheet-like table for schema assignment. It performs well for program-generated Web pages with few training pages and limited user intervention.

原文???core.languages.en_GB???
頁(從 - 到)56-64
頁數9
期刊IEEE Intelligent Systems
19
發行號6
DOIs
出版狀態已出版 - 11月 2004

指紋

深入研究「OLERA: Semisupervised Web-data extraction with visual support」主題。共同形成了獨特的指紋。

引用此