Elgamal, Abdelrahman Said Youssef Abdelraouf (2025) Resource Allocation in Cellular Optical Wireless Systems Using Reinforcement Learning. PhD thesis, University of Leeds.
Abstract
In modern communication networks, indoor users and their data rate demands are massively increasing. Radio-based systems struggle to meet these demands due to their limited available spectrum. Researchers proposed optical wireless communication (OWC) systems in indoor environments to meet these demands due to their high data rates, reliability and energy efficiency. As the number of users increases, efficient allocation of system resources becomes essential.
The thesis introduces, for the first time a Q-Learning (QL)-based resource allocation algorithm tailored for indoor OWC systems. The proposed method was evaluated on wavelength division multiple access (WDMA)-based visible light communication (VLC) and steerable laser-based OWC systems. The QL approach delivers resource allocation solutions comparable to the optimal solutions achieved by mixed integer linear programming (MILP), while operating without prior environmental knowledge. Nevertheless, the method’s reliance on discrete Q-tables limits its applicability to small-scale environments with fewer users and access points.
To address more complex scenarios, the thesis proposes a deep reinforcement learning (DRL) framework for resource allocation in indoor OWC systems. The DRL approach uses neural networks to replace the Q-table, enabling operation in larger, more dynamic settings. Evaluations show DRL achieves performance close to MILP benchmarks and outperforms simpler heuristics, such as distance-based allocation (DBA). However, its effectiveness depends on careful hyperparameter tuning and model design.
Finally, the thesis integrates artificial neural network (ANN)-based user positioning with DRL to manage resource allocation under user mobility. Using a
V
random waypoint (RWP) mobility model, the hybrid ANN-DRL system significantly improves learning performance in dynamic conditions. Yet, the system assumes a specific room configuration; environmental changes, such as moving furniture or introducing obstacles, may reduce positioning accuracy and require periodic retraining to sustain performance.
Metadata
Supervisors: | Elmirghani, Jaafar and Elgorashi, Taisir |
---|---|
Awarding institution: | University of Leeds |
Academic Units: | The University of Leeds > Faculty of Engineering (Leeds) > School of Electronic & Electrical Engineering (Leeds) |
Depositing User: | Mr Abdelrahman Said Youssef Abdelraouf Elgamal |
Date Deposited: | 01 Jul 2025 11:43 |
Last Modified: | 01 Jul 2025 11:43 |
Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:36919 |
Download
Final eThesis - complete (pdf)
Embargoed until: 1 July 2030
Please use the button below to request a copy.
Filename: Elgamal_ASYA_ElectronicandElectricalEngineering_PhD_2025.pdf

Export
Statistics
Please use the 'Request a copy' link(s) in the 'Downloads' section above to request this thesis. This will be sent directly to someone who may authorise access.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.