Loading...

Effective cache bank placement for GPUs

Sadrosadati, M ; Sharif University of Technology | 2017

698 Viewed
  1. Type of Document: Article
  2. DOI: 10.23919/DATE.2017.7926954
  3. Publisher: Institute of Electrical and Electronics Engineers Inc , 2017
  4. Abstract:
  5. The placement of the Last Level Cache (LLC) banks in the GPU on-chip network can significantly affect the performance of memory-intensive workloads. In this paper, we attempt to offer a placement methodology for the LLC banks to maximize the performance of the on-chip network connecting the LLC banks to the streaming multiprocessors in GPUs. We argue that an efficient placement needs to be derived based on a novel metric that considers the latency hiding capability of the GPUs through thread level parallelism. To this end, we propose a throughput aware metric, called Effective Latency Impact (ELI). Moreover, we define an optimization problem to formulate our placement approach based on the ELI metric mathematically. To solve this optimization problem, we deploy a heuristic solution as this optimization problem is NP-hard. Experimental results show that our placement approach improves the performance by up to 15.7% compared to the state-of-the-art placement. © 2017 IEEE
  6. Keywords:
  7. On-chip networks ; Problem solving ; Heuristic solutions ; Lastlevel caches (LLC) ; Latency hiding ; Program processors ; Optimization problems ; State of the art ; Streaming multiprocessors ; Thread level parallelism ; Optimization
  8. Source: 20th Design, Automation and Test in Europe, DATE 2017, 27 March 2017 through 31 March 2017 ; 2017 , Pages 31-36 ; 9783981537093 (ISBN)
  9. URL: https://ieeexplore.ieee.org/document/7926954