IMAM, NIDDAL ORCID: https://orcid.org/0000-0001-8399-0449 (2021) Adversary-Aware, Machine Learning-based Detection of Spam in Twitter Hashtags. PhD thesis, University of York.
Abstract
Concerns about the vulnerability of Machine Learning (ML) to adversarial examples in cybersecurity systems have been growing in recent years. These systems are operating in adversarial environments, so any solutions need to consider the presence of adversaries and to evolve over time in the face of emerging threats. However, most of existing ML-based models designed for cybersecurity systems, such as Online Social Networks (OSNs)’ spam detection are either adversary-agnostic models or only focus on one aspect of adversarial environments.
The goal of this work is to design adversary-aware ML-based detectors of spam in Twitter consid-ering three key points: the robustness to adversarial examples, adaptability to evolving attacks and interpertability to security analysts. Throughout the thesis, we used health-related spam campaignsin Twitter Arabic hashtags as a case study. The analysis of these campaigns help us to identify three adversarial attacks and develop three adversary-aware ML- and DL-based detectors. The first contribution of this thesis is a taxonomy of potential adversarial attacks scenarios in Twitter. Then, we moved forward to develop an adversary-aware spam detector, which was built on the observation that the targeted campaigns were found to be using unique hijacked accounts to fool the deployed spam detectors. We designed a new feature, which is faster to compute compared to features usedin the literature, and which also improves the accuracy of detecting the identified hijacked accountsby 73%. Additionally, we proposed an approach for designing adversary-aware spam image detectors. The key novelty is that our approach improves the robustness through adversarial training anduses black/ white list with human-in-the-loop (HITL) approach to ensure the detectors can evolveover time. The developed adversary-aware Optical Character Recognition (OCR)-based detector outperforms two SOTA OCRs in recognising Arabic and English text embedded in Twitter spam images. We further propose an OCR post-correction algorithm, which improves the robustness of OCR-based detectors with at least 10% against the generated Adversarial Text Images.
Metadata
Supervisors: | Vasilakis, Vasileios and Dimitris, Kolovos |
---|---|
Keywords: | ML, Adversarial examples, DL, OCR, Spam detection, Twitter, OSNs, Interpertability, Adaptability |
Awarding institution: | University of York |
Academic Units: | The University of York > Computer Science (York) |
Identification Number/EthosID: | uk.bl.ethos.844257 |
Depositing User: | Mr. NIDDAL IMAM |
Date Deposited: | 16 Dec 2021 08:41 |
Last Modified: | 21 Jan 2023 10:53 |
Open Archives Initiative ID (OAI ID): | oai:etheses.whiterose.ac.uk:29774 |
Download
Examined Thesis (PDF)
Filename: Imam_203011412_CorrectedThesisClean.pdf
Licence:
This work is licensed under a Creative Commons Attribution NonCommercial NoDerivatives 4.0 International License
Export
Statistics
You do not need to contact us to get a copy of this thesis. Please use the 'Download' link(s) above to get a copy.
You can contact us about this thesis. If you need to make a general enquiry, please see the Contact us page.