Biomedical image analysis on a cooperative cluster of GPUs and multicores
Proceedings of the 22nd annual international conference on Supercomputing
MICCAI '08 Proceedings of the 11th International Conference on Medical Image Computing and Computer-Assisted Intervention, Part II
Attaining High Performance in General-Purpose Computations on Current Graphics Processors
High Performance Computing for Computational Science - VECPAR 2008
Stroma classification for neuroblastoma on graphics processors
International Journal of Data Mining and Bioinformatics
Fusion of fuzzy statistical distributions for classification of thyroid ultrasound patterns
Artificial Intelligence in Medicine
Run-time optimizations for replicated dataflows on heterogeneous environments
Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
Optimizing dataflow applications on heterogeneous environments
Cluster Computing
Hi-index | 0.00 |
Neuroblastoma is one of the most malignant childhood cancers affecting infants mostly. The current prognosis is based on microscopic examination of slides by expert pathologists, a process that is error-prone, time consuming and may lead to inter- and intra-reader variations. Therefore, we are developing a Computer Aided Prognosis (CAP) system which provides computerized image analysis t o assist pathologist in their prognosis. Since this system operates on relatively largescale images and requires sophisticated algorithms, it takes a long time to process whole-slide images. I n this paper, we propose a novel and eficient approach for the execution of a CAP system for neuroblastoma prognosis, using the graphics processing unit (GPU). B y leveraging high memory bandwidth and strong floating point operation capabilities of the GPU, our goal is t o achieve order of magnitude reduction in the overall execution time as compared t o that on a CPU alone. The proposed approach was tested on a set of testing images with a promising accuracy of 99.4% and an execution performance gain factor up to 45 times compared t o C++ code running on the CPU.