Loading...
				
	
				
	
								
					
				
				
	
				
															
TA-LRW: A replacement policy for error rate reduction in stt-mram caches
Cheshmikhani, E ; Sharif University of Technology | 2019
				
											540
									
				Viewed
			
		- Type of Document: Article
- DOI: 10.1109/TC.2018.2875439
- Publisher: IEEE Computer Society , 2019
- Abstract:
- As technology process node scales down, on-chip SRAM caches lose their efficiency because of their low scalability, high leakage power, and increasing rate of soft errors. Among emerging memory technologies, Spin-Transfer Torque Magnetic RAM (STT-MRAM) is known as the most promising replacement for SRAM-based cache memories. The main advantages of STT-MRAM are its non-volatility, near-zero leakage power, higher density, soft-error immunity, and higher scalability. Despite these advantages, high error rate in STT-MRAM cells due to retention failure, write failure, and read disturbance threatens the reliability of cache memories built upon STT-MRAM technology. The error rate is significantly increased in higher temperature, which further affects the reliability of STT-MRAM-based cache memories. The major source of heat generation and temperature increase in STT-MRAM cache memories is write operations, which are managed by cache replacement policy. To the best of our knowledge, none of previous studies have attempted to mitigate heat generation and high temperature of STT-MRAM cache memories using replacement policy. In this paper, we first analyze the cache behavior in conventional Least-Recently Used (LRU) replacement policy and demonstrate that the majority of consecutive write operations (more than 66 percent) are committed to adjacent cache blocks. These adjacent write operations cause accumulated heat and increased temperature, which significantly increase the cache error rate. To eliminate heat accumulation and the adjacency of consecutive writes, we propose a cache replacement policy, named Thermal-Aware Least-Recently Written (TA-LRW), to smoothly distribute the generated heat by conducting consecutive write operations in distant cache blocks. TA-LRW guarantees the distance of at least three blocks for each two consecutive write operations in an 8-way associative cache. This distant write scheme reduces the temperature-induced error rate by 94.8 percent, on average, compared with the conventional LRU policy, which results in 6.9x reduction in cache error rate. The implementation cost and complexity of TA-LRW is as low as First-In, First-Out (FIFO) policy while providing a near-LRU performance, having the advantages of both replacement policies. The significantly reduced error rate is achieved by imposing only 2.3 percent performance overhead compared with the LRU policy. © 2018 IEEE
- Keywords:
- Cache memory ; Read disturbance ; Retention failure ; STT-MRAM ; Write failure ; Buffer storage ; Error correction ; Heat generation ; Magnetic recording ; Magnetic storage ; MRAM devices ; Outages ; Radiation hardening ; Scalability ; Static random access storage ; Cache replacement policy ; Emerging memory technologies ; Heat accumulation ; Increased temperature ; Replacement policy ; Spin transfer torque
- Source: IEEE Transactions on Computers ; Volume 68, Issue 3 , 2019 , Pages 455-470 ; 00189340 (ISSN)
- URL: https://ieeexplore.ieee.org/document/8489928
 
		