Compilers: principles, techniques, and tools
Compilers: principles, techniques, and tools
Clean up your Web pages with HP's HTML tidy
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Speech and Language Processing: An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition
The Theory of Parsing, Translation, and Compiling
The Theory of Parsing, Translation, and Compiling
Offline Dictionary-Based Compression
DCC '99 Proceedings of the Conference on Data Compression
The harpy speech recognition system.
The harpy speech recognition system.
Identifying hierarchical structure in sequences: a linear-time algorithm
Journal of Artificial Intelligence Research
User modeling for personalized Web search with self-organizing map: Research Articles
Journal of the American Society for Information Science and Technology
Hi-index | 0.00 |
Automatically generated HTML, as produced by WYSIWYG programs, typically contains much repetitive and unnecessary markup. Thispaper identifies aspects of such HTML that may be altered whileleaving a semantically equivalent document, and proposes techniques to achieve optimizing modifications. These techniques include attribute re-arrangement via dynamic programming, the use of style classes, and dead-coderemoval. These techniques produce documents as small as 33% of original size. The size decreases obtained are still significant when the techniques are used in combination with conventional text-based compression.