Repository logoCyprus University of Technology
Log In(current)
Ελληνικά
English
  1. Home
  2. Cyprus University of Technology (Research Output)
  3. Πτυχιακές Εργασίες/ Bachelor's Degree Theses
  4. An Evaluation of a Federated Learning Framework for Detecting Hate Speech and Disinformation on Social Media Platforms
  • Details

An Evaluation of a Federated Learning Framework for Detecting Hate Speech and Disinformation on Social Media Platforms

Date Issued
May 2024
Author(s)
Demou, Michael Angelos  
Advisor
Sirivianos, Michael  
Abstract
The spread of harmful content and fake news on social media platforms has become a significant issue,
highlighting the need for automatic detection and filtering tools. Machine learning, which enables computers
to identify patterns, is frequently employed to develop these tools. However, it involves handling
sensitive user data, which must be managed carefully to respect privacy. Federated Learning (FL) addresses
this by allowing data to remain on the user’s device, rather than being transferred to a central
server. In FL, learning occurs locally on individual devices and only model updates are shared with the
server, thus minimizing privacy risks. Despite its advantages, FL faces challenges such as statistical heterogeneity,
where data is not uniformly distributed across devices. This can result in biased predictions
when some classes are over-represented in a client’s dataset or when data amounts vary, affecting model
influence. This thesis develops a text classifier within a distributed Federated Learning (FL) setup to identify
tweets containing harmful content or fake news, focusing on simulating two custom non-IID methods
from another study [12] to create real a life scenario for detecting harmful content, as well as a real-life
non-IID scenario for disinformation detection. Testing this model on five Twitter datasets representing
various types of user misbehavior and misinformation yielded promising results, with F1 scores ranging
from 70% to 82%. We further analyzed the model’s performance with different numbers of clients and
compared it to a centralized approach. Additionally, we experimented with various alpha values affecting
the custom non-IID methods to assess the model’s performance compared to IID conditions. This
comprehensive evaluation confirms the robustness of our FL framework, making it a viable solution for
detecting harmful content and disinformation in real-world, non-IID scenarios.
Subjects

Federated Learning

Non-IID

Disinformation

Harmful Content

Text Classification

File(s)
Thumbnail Image
Name

BSC_Michael Angelos Demou_2024_abstract.pdf

Size

140.66 KB

Format

Adobe PDF

Checksum (MD5)

5a2e443e6381743df699f5198244be05

Explore by
  • Collections
  • Research Outputs
  • Researchers
  • Faculty & Departments
  • Theses
  • Patents
  • Projects
  • Journals
  • Conferences
Useful Links
  • Researcher Portfolio Guide
  • Researcher Profile
  • Create an ORCID ID
  • CUT Open Access Author Fund
  • ETDS Guide
Copyright Policies

Use Sherpa/Romeo to find publisher copyright policies

Go
Go
  • SPARC Author Addendum Engine
  • National Open Access Policy in Cyprus
Deposit your work to Ktisis
  • Self-archiving. Please sign in to Ktisis.
  • Email your work to:
    library.dspace@cut.ac.cy
  • Contact your subject librarian

Member of

OpenAIREre3dataOpenDOARCOREDART
Cyprus University of Technology
Library and
Information
Services

Copyright © 2022 - Library and Information Services Feedback - Built with DSpace-CRIS - 4Science

  • Accessibility settings
  • Privacy policy
  • End User Agreement
COAR NotifyCOAR Notify