The nature of statistical learning theory
The nature of statistical learning theory
Picture Segmentation by a Tree Traversal Algorithm
Journal of the ACM (JACM)
SECRET: a scalable linear regression tree algorithm
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Hi-index | 0.00 |
This paper presents a framework named "Classifier Molding" that imitates arbitrary classifiers by linear regression trees so as to accelerate classification speed. This framework requires an accurate (but slow) classifier and large amount of training data. As an example of accurate classifier, we used the Compound Similarity Method (CSM) for Industrial Ink Jet Printer (IIJP) character recognition problem. The input-output relationship of trained CSM is imitated by a linear regression tree by providing a large amount of training data. For generating the training data, we developed a character pattern fluctuation method simulating the IIJP printing process. The learnt linear regression tree can be used as an accelerated classifier. Based on this classifier, we also developed Classification based Character Segmentation (CCS) method, which extracts character patterns from an image so as to maximize the total classification scores. Through extensive experiments, we confirmed that imitated classifiers are 1500 times faster than the original classifier without dropping the recognition rate and CCS method greatly corrects the segmentation errors of bottom-up segmentation method.