A knowledge-based approach to citation extraction

Min Yuh Day, Tzong Han Tsai, Cheng Lung Sung, Cheng Wei Lee, Shih Hung Wu, Chorng Shyong Ong, Wen Lian Hsu

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

28 Scopus citations

Abstract

Integration of the bibliographical information of scholarly publications available on the Internet is an important task in academic research. To accomplish this task, accurate reference metadata extraction for scholarly publications is essential for the integration of information from heterogeneous reference sources. In this paper, we propose a knowledge-based approach to literature mining and focus on reference metadata extraction methods for scholarly publications. We adopt an ontological knowledge representation framework called INFOMAP to automatically extract the reference metadata. The experimental results show that, by using INFOMAP, we can extract author, title, journal, volume, number (issue), year, and page information from different reference styles with a high degree of accuracy. The overall average field accuracy of citation extraction for a Bioinformatics dataset is 97.87% for six reference styles.

Original languageEnglish
Title of host publicationProceedings of the 2005 IEEE International Conference on Information Reuse and Integration, IRI - 2005
Pages50-55
Number of pages6
DOIs
StatePublished - 2005
Event2005 IEEE International Conference on Information Reuse and Integration, IRI - 2005 - Las Vegas, NV, United States
Duration: 15 Aug 200517 Aug 2005

Publication series

NameProceedings of the 2005 IEEE International Conference on Information Reuse and Integration, IRI - 2005
Volume2005

Conference

Conference2005 IEEE International Conference on Information Reuse and Integration, IRI - 2005
Country/TerritoryUnited States
CityLas Vegas, NV
Period15/08/0517/08/05

Keywords

  • Citation extraction
  • INFOMAP
  • Knowledge-based
  • Ontology

Fingerprint

Dive into the research topics of 'A knowledge-based approach to citation extraction'. Together they form a unique fingerprint.

Cite this