Statistical Models for Text Segmentation
Machine Learning - Special issue on natural language learning
Advances in domain independent linear text segmentation
NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
Semantic annotation of biosystematics literature without training examples
Journal of the American Society for Information Science and Technology
From text to RDF triple store: an application for biodiversity literature
Proceedings of the 73rd ASIS&T Annual Meeting on Navigating Streams in an Information Ecosystem - Volume 47
Hi-index | 0.00 |
This paper describes a simple, unsupervised bootstrapping procedure that identifies morphological description segments from heterogeneous biodiversity document collections. While the procedure is used to preprocess biodiversity literature for semantic annotation of morphological descriptions in our project, it also can be used to crawl the Web for morphological descriptions for a biodiversity niche search engine.