Zone Identification in Biology Articles as a Basis for Information Extraction

ADVERTISEMENT


(ZI) process in biology articles. Specifically, we illustrate the linguistic and other features of each zone based on our investigation of articles selected from four …

Abstract

Information extraction (IE) in the biomedical domain is now regarded as an essential technique for the dynamic management of factual information contained in archived journal articles and abstract collections. We aim to provide a technique serving as a basis for pin-pointing and organizing factual information related to experimental results. In this paper, we enhance the idea proposed in (Mizuta and Collier, 2004); annotating articles in terms of rhetorical zones with shallow nesting. We give a qualitative analysis of the zone identification (ZI) process in biology articles. Specifically, we illustrate the linguistic and other features of each zone based on our investigation of articles selected from four major online journals. We also discuss controversial cases and nested zones, and ZI using multiple features. In doing so, we provide a stronger theoretical and practical support for our framework toward automatic ZI.

Information extraction (IE) in the biomedical domain is now regarded as an essential technique for utilizing information contained in archived journal articles and abstract collections such as MEDLINE. Major domain databases often contain incomplete and inconsistent results. Also, a majority of the reported experimental results are only available in unstructured full-text format.
These being combined, scientists need to check with source journal articles to obtain and confirm factual information. Furthermore, they often need to start with document retrieval and face an overwhelming number of candidate articles. Thus, the significance of dynamic management of factual information, specifically an integration and update of experimental results, is self-evident. It would not only save researchers much time used for retrieval and redundant experiments but also help them use the information more effectively. Given the limitations of manual work in terms of both efficiency and accuracy, IE focusing on factual information is of critical importance.

Download Zone Identification in Biology Articles as a Basis for Information Extraction.pdf

Leave a Reply


Map: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67