Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14279/34619
Title: An Evaluation of a Federated Learning Framework for Detecting Hate Speech and Disinformation on Social Media Platforms
Authors: Demou, Michael Angelos 
Keywords: Federated Learning;Non-IID;Disinformation;Harmful Content;Text Classification
Advisor: Sirivianos, Michael
Issue Date: May-2024
Department: Department of Electrical Engineering, Computer Engineering and Informatics
Faculty: Faculty of Engineering and Technology
Abstract: The spread of harmful content and fake news on social media platforms has become a significant issue, highlighting the need for automatic detection and filtering tools. Machine learning, which enables computers to identify patterns, is frequently employed to develop these tools. However, it involves handling sensitive user data, which must be managed carefully to respect privacy. Federated Learning (FL) addresses this by allowing data to remain on the user’s device, rather than being transferred to a central server. In FL, learning occurs locally on individual devices and only model updates are shared with the server, thus minimizing privacy risks. Despite its advantages, FL faces challenges such as statistical heterogeneity, where data is not uniformly distributed across devices. This can result in biased predictions when some classes are over-represented in a client’s dataset or when data amounts vary, affecting model influence. This thesis develops a text classifier within a distributed Federated Learning (FL) setup to identify tweets containing harmful content or fake news, focusing on simulating two custom non-IID methods from another study [12] to create real a life scenario for detecting harmful content, as well as a real-life non-IID scenario for disinformation detection. Testing this model on five Twitter datasets representing various types of user misbehavior and misinformation yielded promising results, with F1 scores ranging from 70% to 82%. We further analyzed the model’s performance with different numbers of clients and compared it to a centralized approach. Additionally, we experimented with various alpha values affecting the custom non-IID methods to assess the model’s performance compared to IID conditions. This comprehensive evaluation confirms the robustness of our FL framework, making it a viable solution for detecting harmful content and disinformation in real-world, non-IID scenarios.
URI: https://hdl.handle.net/20.500.14279/34619
Rights: Attribution-NonCommercial-NoDerivatives 4.0 International
Type: Bachelors Thesis
Affiliation: Cyprus University of Technology 
Appears in Collections:Πτυχιακές Εργασίες/ Bachelor's Degree Theses

Files in This Item:
File Description SizeFormat
BSC_Michael Angelos Demou_2024_abstract.pdfabstract140.66 kBAdobe PDFView/Open
CORE Recommender
Show full item record

Page view(s)

129
Last Week
2
Last month
9
checked on Nov 18, 2025

Download(s)

20
checked on Nov 18, 2025

Google ScholarTM

Check


This item is licensed under a Creative Commons License Creative Commons