Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14279/29969
DC FieldValueLanguage
dc.contributor.authorLeonidou, Pantelitsa-
dc.contributor.authorKourtellis, Nicolas-
dc.contributor.authorSalamanos, Nikos-
dc.contributor.authorSirivianos, Michael-
dc.date.accessioned2023-07-25T09:39:24Z-
dc.date.available2023-07-25T09:39:24Z-
dc.date.issued2023-04-30-
dc.identifier.citationACM Web Conference - Companion of the World Wide Web Conference, 2023, 30-4 April, pp. 280 - 289en_US
dc.identifier.isbn9781450394161-
dc.identifier.urihttps://hdl.handle.net/20.500.14279/29969-
dc.description.abstractUsers are exposed to a large volume of harmful content that appears daily on various social network platforms. One solution to users' protection is developing online moderation tools using Machine Learning (ML) techniques for automatic detection or content filtering. On the other hand, the processing of user data requires compliance with privacy policies. In this paper, we propose a framework for developing content moderation tools in a privacy-preserving manner where sensitive information stays on the users' device. For this purpose, we apply Differentially Private Federated Learning (DP-FL), where the training of ML models is performed locally on the users' devices, and only the model updates are shared with a central entity. To demonstrate the utility of our approach, we simulate harmful text classification on Twitter data in a distributed FL fashion- but the overall concept can be generalized to other types of misbehavior, data, and platforms. We show that the performance of the proposed FL framework can be close to the centralized approach - for both the DP-FL and non-DP FL. Moreover, it has a high performance even if a small number of clients (each with a small number of tweets) are available for the FL training. When reducing the number of clients (from fifty to ten) or the tweets per client (from 1K to 100), the classifier can still achieve AUC. Furthermore, we extend the evaluation to four other Twitter datasets that capture different types of user misbehavior and still obtain a promising performance (61% - 80% AUC). Finally, we explore the overhead on the users' devices during the FL training phase and show that the local training does not introduce excessive CPU utilization and memory consumption overhead.en_US
dc.language.isoenen_US
dc.rights© Copyright held by the owner/author(s)en_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/-
dc.subjectContent moderationen_US
dc.subjectFederated learningen_US
dc.subjectPrivacyen_US
dc.titlePrivacy-Preserving Online Content Moderation: A Federated Learning Use Caseen_US
dc.typeArticleen_US
dc.collaborationCyprus University of Technologyen_US
dc.collaborationTelefonica Researchen_US
dc.subject.categoryMechanical Engineeringen_US
dc.countryCyprusen_US
dc.countrySpainen_US
dc.subject.fieldEngineering and Technologyen_US
dc.relation.conferenceACM Web Conference 2023 - Companion of the World Wide Web Conferenceen_US
dc.identifier.doi10.1145/3543873.3587604en_US
dc.identifier.scopus2-s2.0-85159630044-
dc.identifier.urlhttps://api.elsevier.com/content/abstract/scopus_id/85159630044-
cut.common.academicyear2022-2023en_US
dc.identifier.spage280en_US
dc.identifier.epage289en_US
item.fulltextNo Fulltext-
item.cerifentitytypePublications-
item.grantfulltextnone-
item.openairecristypehttp://purl.org/coar/resource_type/c_6501-
item.openairetypearticle-
item.languageiso639-1en-
crisitem.author.deptDepartment of Electrical Engineering, Computer Engineering and Informatics-
crisitem.author.deptDepartment of Electrical Engineering, Computer Engineering and Informatics-
crisitem.author.facultyFaculty of Engineering and Technology-
crisitem.author.facultyFaculty of Engineering and Technology-
crisitem.author.orcid0000-0002-5946-0074-
crisitem.author.orcid0000-0002-6500-581X-
crisitem.author.parentorgFaculty of Engineering and Technology-
crisitem.author.parentorgFaculty of Engineering and Technology-
Appears in Collections:Άρθρα/Articles
CORE Recommender
Show simple item record

Page view(s)

159
Last Week
4
Last month
19
checked on May 11, 2024

Google ScholarTM

Check

Altmetric


This item is licensed under a Creative Commons License Creative Commons