摘要
In this paper, we construct a biomedical semantic role labeling (SRL) system that can be used to facilitate relation extraction. First, we construct a proposition bank on top of the popular biomedical GENIA treebank following the PropBank annotation scheme. We only annotate the predicate-argument structures (PAS's) of thirty frequently used biomedical predicates and their corresponding arguments. Second, we use our proposition bank to train a biomedical SRL system, which uses a maximum entropy (ME) model. Thirdly, we automatically generate argument-type templates which can be used to improve classification of biomedical argument types. Our experimental results show that a newswire SRL system that achieves an F-score of 86.29% in the newswire domain can maintain an F-score of 64.64% when ported to the biomedical domain. By using our annotated biomedical corpus, we can increase that F-score by 22.9%. Adding automatically generated template features further increases overall F-score by 0.47% and adjunct arguments (AM) Fscore by 1.57%, respectively.
原文 | ???core.languages.en_GB??? |
---|---|
頁面 | 57-64 |
頁數 | 8 |
出版狀態 | 已出版 - 2006 |
事件 | HLT-NAACL 2006 Workshop on Linking Natural Language Processing and Biology: Towards Deeper Biological Literature Analysis, BioNLP 2006 - New York, United States 持續時間: 8 6月 2006 → … |
???event.eventtypes.event.conference???
???event.eventtypes.event.conference??? | HLT-NAACL 2006 Workshop on Linking Natural Language Processing and Biology: Towards Deeper Biological Literature Analysis, BioNLP 2006 |
---|---|
國家/地區 | United States |
城市 | New York |
期間 | 8/06/06 → … |