Loading...

Enhancing Reliability of STT-MRAM Caches by Eliminating Read Disturbance Accumulation

Cheshmikhani, E ; Sharif University of Technology | 2019

471 Viewed
  1. Type of Document: Article
  2. DOI: 10.23919/DATE.2019.8714946
  3. Publisher: Institute of Electrical and Electronics Engineers Inc , 2019
  4. Abstract:
  5. Spin-Transfer Torque Magnetic RAM (STT-MRAM) as one of the most promising replacements for SRAMs in on-chip cache memories benefits from higher density and scalability, near-zero leakage power, and non-volatility, but its reliability is threatened by high read disturbance error rate. Error-Correcting Codes (ECCs) are conventionally suggested to overcome the read disturbance errors in STT-MRAM caches. By employing aggressive ECCs and checking out a cache block on every read access, a high level of cache reliability is achieved. However, to minimize the cache access time in modern processors, all blocks in the target cache set are simultaneously read in parallel for tags comparison operation and only the requested block is sent out, if any, after checking its ECC. These extra cache block reads without checking their ECCs until requesting the blocks by the processor cause the accumulation of read disturbance error, which significantly degrades the cache reliability. In this paper, we first introduce and formulate the read disturbance accumulation phenomenon and reveal that this accumulation due to conventional parallel accesses of cache blocks significantly increases the cache error rate. Then, we propose a simple yet effective scheme, so-called Read Error Accumulation Preventer cache (REAP-cache) to completely eliminate the accumulation of read disturbances without compromising the cache performance. Our evaluations show that the proposed REAP-cache extends the cache Mean Time To Failure (MTTF) by 171x, while increases the cache area by less than 1% and energy consumption by only 2.7%
  6. Keywords:
  7. Cache ; Error Rate ; Error-Correcting Code (ECC) ; Read Disturbance ; STT-MRAM ; Cache memory ; Energy utilization ; Error correction ; Magnetic leakage ; Magnetic recording ; Parallel processing systems ; Reliability ; Static random access storage ; Cache ; Error correcting code ; MRAM devices
  8. Source: 22nd Design, Automation and Test in Europe Conference and Exhibition, DATE 2019, 25 March 2019 through 29 March 2019 ; Pages 854-859 , 2019 , Pages 854-859 ; 9783981926323 (ISBN)
  9. URL: https://ieeexplore.ieee.org/document/8714946/authors#authors