Loading...

TherMa-MiCs: Thermal-Aware scheduling for fault-tolerant mixed-criticality systems

Safari, S ; Sharif University of Technology | 2022

80 Viewed
  1. Type of Document: Article
  2. DOI: 10.1109/TPDS.2021.3123544
  3. Publisher: IEEE Computer Society , 2022
  4. Abstract:
  5. Multicore platforms are becoming the dominant trend in designing Mixed-Criticality Systems (MCSs), which integrate applications of different levels of criticality into the same platform. A well-known MCS is the dual-criticality system that is composed of low-criticality and high-criticality tasks. The availability of multiple cores on a single chip provides opportunities to employ fault-Tolerant techniques, such as N-Modular Redundancy (NMR), to ensure the reliability of MCSs. However, applying fault-Tolerant techniques will increase the power consumption on the chip, and thereby on-chip temperatures might increase beyond safe limits. To prevent thermal emergencies, urgent countermeasures, like Dynamic Voltage and Frequency Scaling (DVFS) or Dynamic Power Management (DPM) will be triggered to cool down the chip. Such countermeasures, however, might not only lead to suspending low-criticality tasks, but also it might lead to violating timing constraints of high-criticality tasks. In order to prevent such severe scenarios, it is indispensable to consider a temperature constraint within the scheduling process of fault-Tolerant MCSs. Therefore, this paper presents, for the first time, a thermal-Aware scheduling scheme for fault-Tolerant MCSs, named TherMa-MiCs. In particular, TherMa-MiCs, satisfies the temperature constraint jointly with the timing constraints of the high-criticality tasks, while attempting to maximize the QoS of low-criticality tasks under the predefined constraints. At the same time, a reliability target is satisfied by employing the well-known N-Modular Redundancy (NMR) fault-Tolerant technique. Experimental results show that our proposed scheme meets the temperature and timing constraints, while at the same time, improving the QoS of low-criticality tasks, with an average of 44%. © 1990-2012 IEEE
  6. Keywords:
  7. Multicores ; N Modular Redundancy (NMR) ; QoS ; Temperature ; Criticality (nuclear fission) ; Fault tolerance ; Job analysis ; Redundancy ; Reliability analysis ; Scheduling ; Timing circuits ; Fault tolerant technique ; Fault- tolerant systems ; Fault-tolerant ; Mixed-criticality systems ; Multi-core processing ; Multi-cores ; N modular redundancy ; N-modular redundancies ; Task analysis ; Timing constraints ; Nuclear magnetic resonance
  8. Source: IEEE Transactions on Parallel and Distributed Systems ; Volume 33, Issue 7 , 2022 , Pages 1678-1694 ; 10459219 (ISSN)
  9. URL: https://ieeexplore.ieee.org/abstract/document/9591366