Reliability-aware core partitioning in chip multiprocessors
Journal of Systems Architecture: the EUROMICRO Journal
Thread vulnerability in parallel applications
Journal of Parallel and Distributed Computing
Hi-index | 0.00 |
In order to improve the reliability of the single-chip multi-processor (CMP), this paper proposes a fault- tolerant CMP architecture which combines with the Simultaneous Multi-threading (SMT) technology so as to implement the transient fault detection and to automatically accomplish the thread-level recovery. The architecture, through adopting a simple strategies and a little extra hardware to implement the functionality of fault tolerance, attains a wider coverage of the fault and improves the performance of the fault-tolerant CMP. Keywords: CMP, SMT, fault tolerance, thread