Handwritten Character Classification Using Nearest Neighbor in Large Databases

Authors:
S. J. Smith;M. O. Bourgoin;K. Sims;H. L. Voorhees
Affiliations:
-;-;-;-
Venue:
IEEE Transactions on Pattern Analysis and Machine Intelligence
Year:
1994

Citing 8
Cited 16

Toward memory-based reasoning

Communications of the ACM - Special issue on parallelism
On the Recognition of Printed Characters of Any Font and Size

IEEE Transactions on Pattern Analysis and Machine Intelligence
Computer recognition of totally unconstrained handwritten zip codes

International Journal of Pattern Recognition and Artificial Intelligence
Parallel distance transforms on pyramid machines: theory and implementation

Signal Processing
Instance-Based Learning Algorithms

Machine Learning
Trading MIPS and memory for knowledge engineering

Communications of the ACM
Algorithms for Graphics and Imag

Algorithms for Graphics and Imag
Roles of Knowledge in Motor Learning

Roles of Knowledge in Motor Learning

Off-Line, Handwritten Numeral Recognition by Perturbation Method

IEEE Transactions on Pattern Analysis and Machine Intelligence
Joint Induction of Shape Features and Tree Classifiers

IEEE Transactions on Pattern Analysis and Machine Intelligence
A structural/statistical feature based vector for handwritten character recognition

Pattern Recognition Letters
An Evaluation of Parallel Thinning Algorithms for Character Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Training Set Expansion in Handwritten Character Recognition

Proceedings of the Joint IAPR International Workshop on Structural, Syntactic, and Statistical Pattern Recognition
Fast and Accurate Handwritten Character Recognition Using Approximate Nearest Neighbours Search on Large Databases

Proceedings of the Joint IAPR International Workshops on Advances in Pattern Recognition
Recognition of Cursive Roman Handwriting - Past, Present and Future

ICDAR '03 Proceedings of the Seventh International Conference on Document Analysis and Recognition - Volume 1
Deformation Models for Image Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
An associative memory-based learning model with an efficient hardware implementation in FPGA

Expert Systems with Applications: An International Journal
Massive character recognition with a large ground-truthed database

Proceedings of the 2011 ACM Symposium on Applied Computing
The fast and the flexible: extended pseudo two-dimensional warping for face recognition

IbPRIA'11 Proceedings of the 5th Iberian conference on Pattern recognition and image analysis
An efficient feature extraction method for handwritten character recognition

SEMCCO'11 Proceedings of the Second international conference on Swarm, Evolutionary, and Memetic Computing - Volume Part II
Image warping for face recognition: From local optimality towards global optimization

Pattern Recognition
Training of an on-line handwritten Japanese character recognizer by artificial patterns

Pattern Recognition Letters
DIMO: distributed index for matching multimedia objects using MapReduce

Proceedings of the 5th ACM Multimedia Systems Conference
k-NN classification of handwritten characters via accelerated GAT correlation

Pattern Recognition

Quantified Score

Hi-index	0.15

Visualization

Abstract

Shows that systems built on a simple statistical technique and a large training database can be automatically optimized to produce classification accuracies of 99% in the domain of handwritten digits. It is also shown that the performance of these systems scale consistently with the size of the training database, where the error rate is cut by more than half for every tenfold increase in the size of the training set from 10 to 100,000 examples. Three distance metrics for the standard nearest neighbor classification system are investigated: a simple Hamming distance metric, a pixel distance metric, and a metric based on the extraction of penstroke features. Systems employing these metrics were trained and tested on a standard, publicly available, database of nearly 225,000 digits provided by the National Institute of Standards and Technology. Additionally, a confidence metric is both introduced by the authors and also discovered and optimized by the system. The new confidence measure proves to be superior to the commonly used nearest neighbor distance.