Making a digital library: the contents of the CORE project

Authors:
Richard Entlich;Jan Olsen;Lorrin Garson;Michael Lesk;Lorraine Normore;Stuart Weibel
Affiliations:
Cornell Univ., Ithaca, NY;Cornell Univ., Ithaca, NY;American Chemical Society;Bellcore;Chemical Abstracts Service;OCLC, Dublin, OH
Venue:
ACM Transactions on Information Systems (TOIS)
Year:
1997

Citing 6
Cited 5

Markup systems and the future of scholarly text processing

Communications of the ACM
Classification of newspaper image blocks using texture analysis

Computer Vision, Graphics, and Image Processing
Behavioral evaluation and analysis of a hypertext browser

CHI '89 Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Hypertext for the electronic library?: CORE sample results

HYPERTEXT '91 Proceedings of the third annual ACM conference on Hypertext
Tcl and the Tk toolkit

Tcl and the Tk toolkit
The RightPages services: an image-based electronic library

Journal of the American Society for Information Science

Digital library information appliances

Proceedings of the third ACM conference on Digital libraries
From reading to retrieval: freeform ink annotations as queries

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Scalable Digital Libraries Based on NCSTRL/Dienst

ECDL '00 Proceedings of the 4th European Conference on Research and Advanced Technology for Digital Libraries
An image based interactive digital library of mechanical engineering objects

AIKED'08 Proceedings of the 7th WSEAS International Conference on Artificial intelligence, knowledge engineering and data bases
Data mining of maps and their automatic region-time-theme classification

SIGSPATIAL Special

Quantified Score

Hi-index	0.00

Visualization

Abstract

The CORE (Chemical Online Retrieval Experiment) project is a library of primary journal articles in chemistry. Any library has an inside and an outside; in this article we describe the inside of the library and the methods for building the system and accumulating the database. A later article will describe the outside (user experiences). Among electronic-library projects, the CORE project is unusual in that it has both ASCII derived from typesetting and image data for all its pages, and among experimental electronic-library projects, it is unusually large. We describe here (a) the processes of scanning and analyzing about 400,000 pages of primary journal material, (b) the conversion of a similar amount of textual database material, (c) the linking of these two data sources, and (d) the indexing of the text material.