Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14279/25968
Title: Towards privacy-preserving cybersafety tools by using Federated Learning
Authors: Leonidou, Pantelitsa 
Keywords: Cybersafety;Machine Learning algorithms;Federated Learning
Advisor: Sirivianos, Michael
Issue Date: Aug-2021
Department: Department of Electrical Engineering, Computer Engineering and Informatics
Faculty: Faculty of Engineering and Technology
Abstract: Living in the digital era, people can access a huge amount of online content daily. Online services might bene t humanity by easing many everyday life tasks. However, people and especially minor users can en- counter many threats while they are online. Despite various cybersafety tools and applications, the number of minors experiencing online threats is not decreasing. This work focuses on cybersafety tools that use Ma- chine Learning algorithms for automatic detection of inappropriate con- tent. Such tools require the collection of big of data that are often sen- sitive. Additionally, keeping these datasets up-to-date and retraining the models can be challenging. We propose using Federated Learning (FL) training to overcome these challenges. FL allows training a model on dis- tributed data without transferring data to a central unit. We provide a conceptual mapping between the components of a cybersafety framework architecture and the actors in the FL communication protocol to explain how FL can be applied in the context of cybersafety tools. We design and implement a TensorFlow-Federated simulation to explore FL training on a text classi cation model that detects aggressive text. We experimented with a centralized dataset of aggressive tweet posts to assess the perfor- mance of the model trained in the FL approach compared to a model trained in the centralized approach and explore how the number of clients participating in FL a ects the model's performance. Additionally, we ex- perimented with a local model training to assess the client device's CPU utilization, memory consumption, and execution time. The results show that the model's performance when trained in FL settings can approach the model's performance when trained in the traditional approach. Also, the model's performance improves when more clients participate in the FL training. Regarding the performance of a client's device, the results show that the execution time for a local model's training is short and does not over-consume the device resources. The ndings show that cybersafety tools are an applicable use case for FL training.
URI: https://hdl.handle.net/20.500.14279/25968
Rights: Απαγορεύεται η δημοσίευση ή αναπαραγωγή, ηλεκτρονική ή άλλη χωρίς τη γραπτή συγκατάθεση του δημιουργού και κάτοχου των πνευματικών δικαιωμάτων.
Attribution-NonCommercial-NoDerivatives 4.0 International
Type: MSc Thesis
Affiliation: Cyprus University of Technology 
Appears in Collections:Μεταπτυχιακές Εργασίες/ Master's thesis

CORE Recommender
Show full item record

Page view(s)

140
Last Week
1
Last month
6
checked on Dec 22, 2024

Download(s) 50

76
checked on Dec 22, 2024

Google ScholarTM

Check


This item is licensed under a Creative Commons License Creative Commons