Liu, Naijia
ORCID: 0000-0002-9979-9048
(2025)
Aging-Aware Runtime Management for Many-Core Systems.
PhD thesis, University of York.
Abstract
Transistor scaling has enabled the fabrication of devices implementing many-core architectures, but has introduced challenges such as the need to limit power consumption and increase reliability. These factors can have a significant impact on the degradation of devices through time and, as a consequence, managing aging effects has become an important design requirement. While dynamic power and runtime resource management are generally used for efficient execution and to control workloads to achieve optimal performance while satisfying power constraints, they can also be effective tools to improve the lifetime of the system.
This thesis presents an aging-aware runtime management approach is proposed to address the combined effects of traffic congestion and task execution on reliability degradation in many-core systems. The method relies on a predictive fitness-driven mapping and remapping strategy, where observed energy consumption in task computation and communication activity guide decisions to balance workload distribution and reduce stress on vulnerable components and the total system aging. By redistributing tasks accordingly, our approach ensures that remapping not only maintains application performance but also mitigates aging effects that accumulate during execution.
To address system aging and evaluate proposed management strategy, a dedicated simulation environment \textbf{CASTLE} is developed using a multi-agent model that captures the dynamic interactions of components existed in many-core systems in computation and communication induced aging phenomena. The proposed aging-aware runtime management strategy is evaluated and experimented in \textbf{CASTLE} simulation platform by applying different size of systems with typical applications, to be proved that it can enable more effective and efficient mitigation of aging effects caused by heavy computational load or traffic. Results demonstrate that incorporating predictive temperature modelling into runtime management significantly improves mitigation of aging effects, extending system lifetime without compromising application performance and showing the potential of runtime-aware strategies for ensuring the long-term reliability of future many-core systems.
Metadata
| Supervisors: | Tempesti, Gianluca |
|---|---|
| Related URLs: | |
| Keywords: | Many-core Systems, Lifetime Reliability, Task Mapping, Runtime Management, Aging-Aware, Sustainable Systolic Array, Multi-agent Modelling, Reliability, Aging,Runtime Management, Prediction |
| Awarding institution: | University of York |
| Academic Units: | The University of York > School of Physics, Engineering and Technology (York) |
| Date Deposited: | 26 May 2026 13:13 |
| Last Modified: | 26 May 2026 13:13 |
| Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:38724 |
Download
Examined Thesis (PDF)
Embargoed until: 26 November 2026
Please use the button below to request a copy.
Filename: Liu_204033314_CorrectedThesisClean2.pdf
Export
Statistics
Please use the 'Request a copy' link(s) in the 'Downloads' section above to request this thesis. This will be sent directly to someone who may authorise access.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.