Wang, Ziyong (2025) Deep Learning-based Error Level Modelling for Image Manipulation Detection and Localization. PhD thesis, University of Sheffield.
Abstract
The widespread accessibility of multimedia content through social platforms has significantly increased the risk of image manipulation, thereby amplifying the dissemination and influence of fabricated visual information. Addressing this critical challenge necessitates the development of robust, efficient, and explainable methods to verify the authenticity of visual content. This thesis investigates novel AI-based approaches for image manipulation detection, with a particular emphasis on achieving strong generalization across challenging datasets and enhancing model interpretability.
Specifically, three deep learning architectures are proposed to address different tasks within image manipulation detection and localization: 1) WCBnet introduces an adaptive cross-block weighting mechanism at the convolutional block level, allowing the network to dynamically fuse low-level and high-level features based on their relevance to manipulation cues. This hierarchical feature weighting strategy enables WCBnet to achieve fast and accurate fact-checking with minimal additional computational overhead (only a 2.3% increase in trainable parameters), making it particularly suitable for large-scale scenarios involving newly generated manipulated images. 2) DenseWCBnet extends WCBnet by integrating multi-scale receptive fields across multiple convolutional blocks. Through adaptive fusion across different spatial dimensions, DenseWCBnet generates densely weighted feature representations that further enhance manipulation detection accuracy and significantly improve robustness against diverse and challenging manipulation types. 3) WSWCBnet proposes a novel weakly supervised localization framework, combining image-level manipulation heatmaps and semantically irrelevant segmented maps to localize manipulated regions without the need for pixel-level annotations. By leveraging only image-level supervision, WSWCBnet achieves pixel-level localization performance comparable to fully supervised methods, substantially reducing the annotation burden while maintaining high localization precision.
Extensive experiments demonstrate that WCBnet and DenseWCBnet consistently outperform state-of-the-art methods across six widely-used datasets in terms of classification accuracy and F1-score. Furthermore, when evaluated on the particularly challenging DeepfakeArt generative dataset, WCBnet and DenseWCBnet achieve classification accuracies of 94% and 97%, respectively. Additionally, WSWCBnet demonstrates its effectiveness in manipulation localization, achieving comparable performance to fully supervised models despite relying solely on weak supervision. Overall, this thesis advances the field of image forensics by providing robust, explainable, and annotation-efficient deep learning solutions for image manipulation detection and localization, with strong applicability to real-world forensic scenarios.
Metadata
Supervisors: | Abhayaratne, Charith |
---|---|
Keywords: | Deep-learning, image manipulation detection, weakly-supervised localization |
Awarding institution: | University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Electronic and Electrical Engineering (Sheffield) |
Depositing User: | Mr Ziyong Wang |
Date Deposited: | 01 Jul 2025 14:40 |
Last Modified: | 01 Jul 2025 14:40 |
Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:37121 |
Download
Final eThesis - complete (pdf)
Embargoed until: 1 July 2026
This file cannot be downloaded or requested.
Filename: thesis_new.pdf

Export
Statistics
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.