Stereo matching using intra- and inter-row dynamic programming
Pattern Recognition Letters
Improving memory performance of sorting algorithms
Journal of Experimental Algorithmics (JEA)
Matching and Retrieval of Distorted and Occluded Shapes Using Dynamic Programming
IEEE Transactions on Pattern Analysis and Machine Intelligence
Representation and Detection of Deformable Shapes
IEEE Transactions on Pattern Analysis and Machine Intelligence
Chip Multithreading: Opportunities and Challenges
HPCA '05 Proceedings of the 11th International Symposium on High-Performance Computer Architecture
Shape Matching and Object Recognition Using Low Distortion Correspondences
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
Stereo Correspondence by Dynamic Programming on a Tree
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 2 - Volume 02
Multiclass Object Recognition with Sparse, Localized Features
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 1
Design and Performance of a Fault-Tolerant Real-Time CORBA Event Service
ECRTS '06 Proceedings of the 18th Euromicro Conference on Real-Time Systems
SVM-KNN: Discriminative Nearest Neighbor Classification for Visual Category Recognition
CVPR '06 Proceedings of the 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition - Volume 2
Cache-Friendly implementations of transitive closure
Journal of Experimental Algorithmics (JEA)
Computer Vision and Image Understanding
Evaluating MapReduce for Multi-core and Multiprocessor Systems
HPCA '07 Proceedings of the 2007 IEEE 13th International Symposium on High Performance Computer Architecture
Groups of Adjacent Contour Segments for Object Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence
Amdahl's Law in the Multicore Era
Computer
A Multimodal Constellation Model for Object Category Recognition
MMM '09 Proceedings of the 15th International Multimedia Modeling Conference on Advances in Multimedia Modeling
Shape-Based Object Localization for Descriptive Classification
International Journal of Computer Vision
Object Categorization Based on Kernel Principal Component Analysis of Visual Words
WACV '08 Proceedings of the 2008 IEEE Workshop on Applications of Computer Vision
NPC'10 Proceedings of the 2010 IFIP international conference on Network and parallel computing
Object Recognition using Full Pixel Matching
CIT '10 Proceedings of the 2010 10th IEEE International Conference on Computer and Information Technology
ULCC: a user-level facility for optimizing shared cache performance on multicores
Proceedings of the 16th ACM symposium on Principles and practice of parallel programming
CAB: Cache Aware Bi-tier Task-Stealing in Multi-socket Multi-core Architecture
ICPP '11 Proceedings of the 2011 International Conference on Parallel Processing
Hi-index | 0.00 |
Image recognition is a very useful technique that can be applied in many areas. Two-Dimensional Continuous Dynamic Programming (2DCDP) is a pixel-level matching algorithm for object recognition. Compared with other methods, 2DCDP can offer a sufficiently high accuracy of recognition without training. In our previous work we use 2DCDP to implement image classification. However, we find the processing speed of 2DCDP is very slow. In this paper, we first analyze the performance issue of 2DCDP algorithm, and point out that large memory consumption is the performance bottleneck. Then, we improve 2DCDP algorithm and propose a new object recognition algorithm named Pixel-based Multi-Anchor (PMA) algorithm, which can locate anchor points that can be further used to locate the recognized area. Theoretical analysis expresses that our new algorithm can effectively reduce memory capacity requirement from O(n4) to O(n3), where n is the size of image. Furthermore, based on the understanding of multi-core architecture, we propose a fine-grained parallelism thread model to parallelize our PMA algorithm on mutli-core systems. Especially we take cache coherence problem into account, such that we further propose a coarse-grained parallelism thread model to optimize the PMA performance. Experimental results show that compared with the original 2DCDP algorithm, our new PMA algorithm can decrease the memory capacity requirement dramatically which improves the recognition speed. More important, PMA algorithm can processes efficiently big images that exceed the ability of original 2DCDP algorithm.