CiteSeer: an automatic citation indexing system
Proceedings of the third ACM conference on Digital libraries
Automatic classification of Web resources using Java and Dewey decimal classification
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Scalable collection summarization and selection
Proceedings of the fourth ACM conference on Digital libraries
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Predicting library of congress classifications from library of congress subject headings
Journal of the American Society for Information Science and Technology
Browsing and searching behavior in the renardus web service a study based on log analysis
Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries
A comparative study of two automatic document classification methods in a library setting
Journal of Information Science
Journal of Information Science
Hi-index | 0.00 |
With the significant growth in the number of available electronic documents on the Internet, intranets, and digital libraries, the need for developing effective methods and systems to index and organize E-documents is felt more than ever. In this paper we introduce a new method for automatic text classification for categorizing E-documents by utilizing classification metadata of books, journals and other library holdings, that already exists in online catalogues of libraries. The method is based on identifying all references cited in a given document and, using the classification metadata of these references as catalogued in a physical library, devising an appropriate class for the document itself according to a standard library classification scheme with the help of a weighting mechanism. We have demonstrated the application of the proposed method and assessed its performance by developing a prototype classification system for classifying electronic syllabus documents archived in the Irish National Syllabus Repository according to the well-known Dewey Decimal Classification (DDC) scheme.