Inductive learning algorithms and representations for text categorization
Proceedings of the seventh international conference on Information and knowledge management
Power to the people: end-user building of digital library collections
Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
The Greenstone plugin architecture
Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
Domain-Specific Keyphrase Extraction
IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Using Compression to Identify Acronyms in Text
DCC '00 Proceedings of the Conference on Data Compression
A new framework for building digital library collections
Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
Hi-index | 0.00 |
Flexible digital library systems need to be able to accept, or "import," documents and metadata in a variety of forms, and associate metadata with the appropriate documents. This paper analyzes the requirements of the import process for general digital libraries. The requirements include (a) format conversion for source documents, (b) the ability to incorporate existing conversion utilities, (c) provision for metadata to be specified in the document files themselves and/or in separate metadata files, (d) format conversion for metadata files, (e) provision for metadata to be computed from the document content, and (f) flexible ways of associating metadata with documents or sets of documents. We argue that these requirements are so open-ended that they are best met by an extensible architecture that facilitates the addition of new document formats and metadata facilities to existing digital library systems. An implementation of this architecture is briefly described.