TY - JOUR
T1 - New challenges for biological text-mining in the next decade
AU - Dai, Hong Jie
AU - Chang, Yen Ching
AU - Tzong-Han Tsai, Richard
AU - Hsu, Wen Lian
N1 - Funding Information:
Regular Paper This work was supported by the “National Science Council” under Grant Nos. NSC 97-2218-E-155-001 and NSC96-2752-E-001-001-PAE, the Research Center for Humanities and Social Sciences, and the Thematic Program of “Academia Sinica” under Grant No. AS95ASIA02. 2010 Springer Science + Business Media, LLC & Science Press, China
PY - 2010/1
Y1 - 2010/1
N2 - The massive flow of scholarly publications from traditional paper journals to online outlets has benefited biologists because of its ease to access. However, due to the sheer volume of available biological literature, researchers are finding it increasingly difficult to locate needed information. As a result, recent biology contests, notably JNLPBA and BioCreAtIvE, have focused on evaluating various methods in which the literature may be navigated. Among these methods, text-mining technology has shown the most promise. With recent advances in text-mining technology and the fact that publishers are now making the full texts of articles available in XML format, TMSs can be adapted to accelerate literature curation, maintain the integrity of information, and ensure proper linkage of data to other resources. Even so, several new challenges have emerged in relation to full text analysis, life-science terminology, complex relation extraction, and information fusion. These challenges must be overcome in order for text-mining to be more effective. In this paper, we identify the challenges, discuss how they might be overcome, and consider the resources that may be helpful in achieving that goal.
AB - The massive flow of scholarly publications from traditional paper journals to online outlets has benefited biologists because of its ease to access. However, due to the sheer volume of available biological literature, researchers are finding it increasingly difficult to locate needed information. As a result, recent biology contests, notably JNLPBA and BioCreAtIvE, have focused on evaluating various methods in which the literature may be navigated. Among these methods, text-mining technology has shown the most promise. With recent advances in text-mining technology and the fact that publishers are now making the full texts of articles available in XML format, TMSs can be adapted to accelerate literature curation, maintain the integrity of information, and ensure proper linkage of data to other resources. Even so, several new challenges have emerged in relation to full text analysis, life-science terminology, complex relation extraction, and information fusion. These challenges must be overcome in order for text-mining to be more effective. In this paper, we identify the challenges, discuss how they might be overcome, and consider the resources that may be helpful in achieving that goal.
KW - Bioinformatics database
KW - Mining method and algorithm
KW - Text mining
UR - http://www.scopus.com/inward/record.url?scp=75749095148&partnerID=8YFLogxK
U2 - 10.1007/s11390-010-9313-5
DO - 10.1007/s11390-010-9313-5
M3 - 期刊論文
AN - SCOPUS:75749095148
SN - 1000-9000
VL - 25
SP - 169
EP - 179
JO - Journal of Computer Science and Technology
JF - Journal of Computer Science and Technology
IS - 1
ER -