An Evaluation of a Federated Learning Framework for Detecting Hate Speech and Disinformation on Social Media Platforms
Date Issued
May 2024
Author(s)
Advisor
Abstract
The spread of harmful content and fake news on social media platforms has become a significant issue,
highlighting the need for automatic detection and filtering tools. Machine learning, which enables computers
to identify patterns, is frequently employed to develop these tools. However, it involves handling
sensitive user data, which must be managed carefully to respect privacy. Federated Learning (FL) addresses
this by allowing data to remain on the user’s device, rather than being transferred to a central
server. In FL, learning occurs locally on individual devices and only model updates are shared with the
server, thus minimizing privacy risks. Despite its advantages, FL faces challenges such as statistical heterogeneity,
where data is not uniformly distributed across devices. This can result in biased predictions
when some classes are over-represented in a client’s dataset or when data amounts vary, affecting model
influence. This thesis develops a text classifier within a distributed Federated Learning (FL) setup to identify
tweets containing harmful content or fake news, focusing on simulating two custom non-IID methods
from another study [12] to create real a life scenario for detecting harmful content, as well as a real-life
non-IID scenario for disinformation detection. Testing this model on five Twitter datasets representing
various types of user misbehavior and misinformation yielded promising results, with F1 scores ranging
from 70% to 82%. We further analyzed the model’s performance with different numbers of clients and
compared it to a centralized approach. Additionally, we experimented with various alpha values affecting
the custom non-IID methods to assess the model’s performance compared to IID conditions. This
comprehensive evaluation confirms the robustness of our FL framework, making it a viable solution for
detecting harmful content and disinformation in real-world, non-IID scenarios.
highlighting the need for automatic detection and filtering tools. Machine learning, which enables computers
to identify patterns, is frequently employed to develop these tools. However, it involves handling
sensitive user data, which must be managed carefully to respect privacy. Federated Learning (FL) addresses
this by allowing data to remain on the user’s device, rather than being transferred to a central
server. In FL, learning occurs locally on individual devices and only model updates are shared with the
server, thus minimizing privacy risks. Despite its advantages, FL faces challenges such as statistical heterogeneity,
where data is not uniformly distributed across devices. This can result in biased predictions
when some classes are over-represented in a client’s dataset or when data amounts vary, affecting model
influence. This thesis develops a text classifier within a distributed Federated Learning (FL) setup to identify
tweets containing harmful content or fake news, focusing on simulating two custom non-IID methods
from another study [12] to create real a life scenario for detecting harmful content, as well as a real-life
non-IID scenario for disinformation detection. Testing this model on five Twitter datasets representing
various types of user misbehavior and misinformation yielded promising results, with F1 scores ranging
from 70% to 82%. We further analyzed the model’s performance with different numbers of clients and
compared it to a centralized approach. Additionally, we experimented with various alpha values affecting
the custom non-IID methods to assess the model’s performance compared to IID conditions. This
comprehensive evaluation confirms the robustness of our FL framework, making it a viable solution for
detecting harmful content and disinformation in real-world, non-IID scenarios.
File(s)![Thumbnail Image]()
Name
BSC_Michael Angelos Demou_2024_abstract.pdf
Size
140.66 KB
Format
Adobe PDF
Checksum (MD5)
5a2e443e6381743df699f5198244be05

