Loading...

Low-cost scan-chain-based technique to recover multiple errors in TMR systems

Ebrahimi, M ; Sharif University of Technology | 2013

752 Viewed
  1. Type of Document: Article
  2. DOI: 10.1109/TVLSI.2012.2213102
  3. Publisher: 2013
  4. Abstract:
  5. In this paper, we present a scan-chain-based multiple error recovery technique for triple modular redundancy (TMR) systems (SMERTMR). The proposed technique reuses scan-chain flip-flops fabricated for testability purposes to detect and correct faulty modules in the presence of single or multiple transient faults. In the proposed technique, the manifested errors are detected at the modules' outputs, while the latent faults are detected by comparing the internal states of the TMR modules. Upon detection of any mismatch, the faulty modules are located and the state of a fault-free module is copied into the faulty modules. In case of detecting a permanent fault, the system is degraded to a master/checker configuration by disregarding the faulty module. FPGA-based fault injection experiments reveal that SMERTMR has the error detection and recovery coverage of 100% and 99.7% in the presence of single and two faulty modules, respectively, while imposing negligible area and performance overheads on the traditional TMR systems
  6. Keywords:
  7. Roll-forward error recovery ; Error detection and recovery ; Fault-tolerant designs ; Permanent faults ; Recovery techniques ; Roll-forward ; Scan chain ; Transient faults ; Triple modular redundancy ; Errors ; Fault tolerant computer systems ; Fault detection
  8. Source: IEEE Transactions on Very Large Scale Integration (VLSI) Systems ; Volume 21, Issue 8 , 2013 , Pages 1454-1468 ; 10638210 (ISSN)
  9. URL: http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6307891