The design and analysis of efficient lossless data compression systems
The design and analysis of efficient lossless data compression systems
Text Mining: A New Frontier for Lossless Compression
DCC '99 Proceedings of the Conference on Data Compression
An Open Interface for Probabilistic Models of Text
DCC '99 Proceedings of the Conference on Data Compression
Using Compression to Identify Acronyms in Text
DCC '00 Proceedings of the Conference on Data Compression
Correcting English Text Using PPM Models
DCC '98 Proceedings of the Conference on Data Compression
Switching Between Two Universal Source Coding Algorithms
DCC '98 Proceedings of the Conference on Data Compression
On-line stochastic processes in data compression
On-line stochastic processes in data compression
The context-tree weighting method: basic properties
IEEE Transactions on Information Theory
Universal Text Preprocessing for Data Compression
IEEE Transactions on Computers
A new ppm variant for chinese text compression
Natural Language Engineering
Identification of gene function using prediction by partial matching (PPM) language models
Proceedings of the 17th ACM conference on Information and knowledge management
A fast and efficient nearly-optimal adaptive Fano coding scheme
Information Sciences: an International Journal
A high performance centroid-based classification approach for language identification
Pattern Recognition Letters
Hi-index | 0.00 |
Abstract: This paper introduces a novel switching method which can be used to combine two or more PPM models. The work derives from our earlier work on modeling English and text mining, and the approach takes advantage of both to help improve the compression performance significantly. The performance of the combination of models is at least as good as (and in many cases significantly better than) the best performed of the individual models.