Towards CST-enhanced summarization
Eighteenth national conference on Artificial intelligence
X-tract: Structure Extraction from Botanical Textual Descriptions
SPIRE '99 Proceedings of the String Processing and Information Retrieval Symposium & International Workshop on Groupware
Augmenting Naive Bayes Classifiers with Statistical Language Models
Information Retrieval
A common theory of information fusion from multiple text sources step one: cross-document structure
SIGDIAL '00 Proceedings of the 1st SIGdial workshop on Discourse and dialogue - Volume 10
The reusability of induced knowledge for the automatic semantic markup of taxonomic descriptions
Journal of the American Society for Information Science and Technology
Integrating and querying parallel leaf shape descriptions
ISWC'06 Proceedings of the 5th international conference on The Semantic Web
Hi-index | 0.00 |
Providing a single access point to an information system from multiple documents is helpful for biodiversity researchers as it is true in many fields. It not only saves the time for going back and forth from different sources but also provides the opportunity to generate new information out of the complementary information in different sources and levels of description. This paper investigates the potential of information fusion techniques in biodiversity area since the researchers in this domain desperately need information from different sources to verify their decision. In another sense, there are massive amounts of collections in this area. It is not easy or even possible for the researcher to manually collect information from different places. The proposed system contains 4 steps: Text segmentation and Taxonomic Name Identification, Organ-level and Sub-organ level Information Extraction, Relationship Identification, and Information fusion. Information fusion is based on the seven out of the twenty-four relationships in CST (Cross-document Sentence Theory). We argue that this kind of information fusion system might not only save the researchers the time for going back and forth from different sources but also provides the opportunity to generate new information out of the complementary information in different sources and levels.