Loading...

Improving Reliability and Durability of Phase Change Main Memories

Asadinia, Marjan | 2016

1353 Viewed
  1. Type of Document: M.Sc. Thesis
  2. Language: English
  3. Document No: 48426 (52)
  4. University: Sharif University of Technology, International Campus, Kish Island
  5. Department: Science and Engineering
  6. Advisor(s): Sarbazi Azad, Hamid
  7. Abstract:
  8. Dynamic Random Access Memory (DRAM) has been the leading main memory technology during the last four decades. In deep sub-micron regime, however, scaling DRAM comes with several challenges caused by charge leakage and imprecise charge placement. Phase Change Memory (PCM) technology is known as one of the most promising technologies to replace DRAM. Compared to competitive non-volatile memories, PCM benefits from best attributes of fast random access, negligible leakage energy, superior scalability, high density, and operating in both Single-level Cell (SLC) and Multi-level Cell (MLC) storage levels without imposing large storage overhead. Unfortunately, density advantage of MLC PCM devices comes at the cost of lower write endurance that results in fast wear-out of memory cells. Other preliminary concerns for PCM applicability are related to higher latency and energy consumption as well as low resilience to soft errors because of resistance drift. In this line, recent studies have proposed redirection or correction schemes to alleviate this problem, but all suffer from poor throughput and latency. None of the techniques proposed in the literature to improve the lifetime and reliability of PCM memories consider the impressive characteristic of PCMs to easily shape-shifting from SLC to MLC storage level using some negligible overhead of read/write circuits. We exploit this remarkable ability to propose some new techniques to improve the lifetime of PCMs. In this thesis, firstly, we show that one of the inefficiency sources of current schemes, even when wear-leveling algorithms are used, is the non-uniformity of write endurance across different cells incurred by process variation. That is why when some memory pages have reached their endurance limit in the PCM memory, other pages may be far from their limit. We present On-Demand Page Paired PCM (OD3P), a technique that mitigates the problem of fast failure of some pages by redirecting them onto some other healthy pages. During recovery, the target PCM page is converted to MLC to keep data of the two pages. Compared to the state-of-the-art error correction scheme, OD3P can improve PCM time-to-failure and system performance (IPC) by 12% and 14%, respectively, under multi-threaded and multi-program workloads. We then extend the durability of PCM pages by enabling line pairing within a page (line-level OD3P) and word-level pairing within a line (Intra-line level pairing). Next, we show that beside the process variation, unbalanced write traffic makes the memory cells wear-out even sooner. Therefore, we propose a byte-level shifting scheme, BLESS, that uses a simple shift mechanism for balancing the write traffic and error recovery purposes using the MLC capability of PCM. Simulation results show that our proposed scheme is life-time effective when tested under a wide range of workloads compared to state-of-the-art line-level and page-level schemes. On average, BLESS can improve the lifetime by 14-25% over the state-of-the-art schemes. Our schemes mainly rely on MLC capability of PCMs to recover hard errors but MLC storage level is prone to drifts. To address this problem, we propose Variable Resistance Spectrum MLC PCM (VR-PCM), a simple micro-architectural technique with more efficient drift-aware MLC PCM access operations. Using full-system evaluation of an MLC PCM main memory with conservative resistance drift model, we show that VR-PCM tailored for high-density MLCs can deliver considerable improvements in performance (13.25%), energy (21.2%), and lifetime (1.77x), on average
  9. Keywords:
  10. Phase Change Memory ; Performance ; Energy ; Resistance Drift ; Life Time ; Reliability Improvment ; Single Level Cell (SLC)Memory ; Multilevel Cell (MLC)Memory ; Hard Errors

 Digital Object List

 Bookmark

...see more