TY - JOUR
T1 - Using conditional random fields for result identification in biomedical abstracts
AU - Lin, Ryan T.K.
AU - Dai, Hong Jie
AU - Bow, Yue Yang
AU - Chiu, Justin Liang Te
AU - Tsai, Richard Tzong Han
PY - 2009
Y1 - 2009
N2 - The abstracts of biomedical papers usually contain three sections: objective, methods, and results-conclusion. The results-conclusion section is the most important because it usually describes the main contribution of a paper. Unfortunately, not all biomedical journals follow this three-section format. In this paper, we propose a machine learning (ML) based approach to automatically identify the results-conclusion section. The results-conclusion section identification problem is formulated as a sequence labeling task. Four feature sets, including Position, Named Entity, Tense, and Word Frequency, are employed with Conditional Random Fields (CRFs) as the underlying ML model. The experiment results show that the proposed approach can achieve F-measure, precision, and recall of 97.08%, 96.63% and 97.53%, respectively.
AB - The abstracts of biomedical papers usually contain three sections: objective, methods, and results-conclusion. The results-conclusion section is the most important because it usually describes the main contribution of a paper. Unfortunately, not all biomedical journals follow this three-section format. In this paper, we propose a machine learning (ML) based approach to automatically identify the results-conclusion section. The results-conclusion section identification problem is formulated as a sequence labeling task. Four feature sets, including Position, Named Entity, Tense, and Word Frequency, are employed with Conditional Random Fields (CRFs) as the underlying ML model. The experiment results show that the proposed approach can achieve F-measure, precision, and recall of 97.08%, 96.63% and 97.53%, respectively.
KW - Conditional random fields
KW - Result identification
KW - Sequence labeling
UR - http://www.scopus.com/inward/record.url?scp=70449109756&partnerID=8YFLogxK
U2 - 10.3233/ICA-2009-0321
DO - 10.3233/ICA-2009-0321
M3 - 期刊論文
AN - SCOPUS:70449109756
SN - 1069-2509
VL - 16
SP - 339
EP - 352
JO - Integrated Computer-Aided Engineering
JF - Integrated Computer-Aided Engineering
IS - 4
ER -