ICDAR '01 Proceedings of the Sixth International Conference on Document Analysis and Recognition
Automated Detection and Segmentation of Table of Contents Page from Document Images
ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Structuring documents according to their table of contents
Proceedings of the 2005 ACM symposium on Document engineering
Implementation of Content Analysis System for Recognition of Journals_ Table of Contents
ICDAR '07 Proceedings of the Ninth International Conference on Document Analysis and Recognition - Volume 02
Digital paper bookmarks: collaborative structuring, indexing and tagging of paper documents
CHI '08 Extended Abstracts on Human Factors in Computing Systems
On the Reading of Tables of Contents
DAS '08 Proceedings of the 2008 The Eighth IAPR International Workshop on Document Analysis Systems
iBookmark: locative texts and place-based authoring
CHI '09 Extended Abstracts on Human Factors in Computing Systems
Analysis of Book Documents' Table of Content Based on Clustering
ICDAR '09 Proceedings of the 2009 10th International Conference on Document Analysis and Recognition
Table of contents recognition for converting PDF documents in e-book formats
Proceedings of the 10th ACM symposium on Document engineering
Touch-Bookmark: a lightweight navigation and bookmarking technique for e-books
CHI '11 Extended Abstracts on Human Factors in Computing Systems
Searching online book documents and analyzing book citations
Proceedings of the 2013 ACM symposium on Document engineering
Hi-index | 0.00 |
ABSTRACT The task of extracting document structures from a digital e-book is difficult and is an active area of research. On the other hand, many e-books already have a table of contents (TOC) at the beginning of the document. This may lead us to believe that adding bookmarks into digital document (e-book) based on the existing TOC would be trivial. In this paper, we highlight the challenges involved in this task of automatically adding bookmarks to an existing e-book based on the TOC that exists within the document. If we are able to reliably identify the specific locations of each TOC entry within the document, the algorithms can be easily extended to identify document structures within e-books that have TOC. We describe a tool we have built called Booky that tries to add automatic PDF bookmarks to existing PDF based e-books as they have TOC as part of the document content. The tool addresses most of the challenges that have been identified while still leaving a few tricky scenarios still open.