Markovic, Ana ORCID: https://orcid.org/0000-0001-9837-3961
(2025)
Locality-Aware Scheduling of Software Repository Mining Workflows in Heterogeneous Environments.
PhD thesis, University of York.
Abstract
This thesis investigates advancements in job scheduling within heterogeneous computing environments, focusing particularly on the domains of software repository analysis. Two main contributions are presented: the Bidding Scheduler and the self-correcting worker mechanism, denoted as the Feedback-Based Estimator.
Firstly, the Bidding Scheduler is introduced as a novel approach that decentralises the job allocation process. In this model, worker nodes engage in a bidding process to acquire jobs, leveraging localised data and workload conditions to submit bids that reflect expected processing times. The master node collects these bids and allocates jobs in a manner that minimises overall execution time and optimises resource utilisation. This approach addresses the common inefficiencies in traditional master-worker architectures by integrating both data locality and worker heterogeneity into the decision-making process. Experimental work demonstrates the scheduler's ability to adapt to dynamic workloads and varying node capacities, resulting in improved performance metrics.
Secondly, the Feedback-Based Estimator mechanism is developed to enhance the accuracy of job completion time estimates. Acknowledging the limitations in developers' ability to predict precise processing times due to the variability in job attributes and system performance, this mechanism introduces a real-time adjustment process. Based on continuous feedback from executed jobs, the Feedback-Based Estimator dynamically updates the estimation formula to correct any persistent discrepancies between estimated and actual job completion times, thereby reducing the margin of error in time estimates.
Together, these contributions provide a framework for efficient and adaptive job scheduling in heterogeneous computational environments. Future work will focus on investigating applicability of these approaches beyond mining software repositories.
Metadata
Supervisors: | Dimitris, Kolovos and Leandro, Soares Indrusiak |
---|---|
Awarding institution: | University of York |
Academic Units: | The University of York > Computer Science (York) |
Depositing User: | Miss Ana Markovic |
Date Deposited: | 05 Sep 2025 11:11 |
Last Modified: | 05 Sep 2025 11:11 |
Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:37395 |
Download
Examined Thesis (PDF)
Filename: AnaM_Thesis.pdf
Licence:
This work is licensed under a Creative Commons Attribution NonCommercial NoDerivatives 4.0 International License
Export
Statistics
You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.