Privacy-Preserving Online Content Moderation: A Federated Learning Use Case

Leonidou, Pantelitsa; Kourtellis, Nicolas; Salamanos, Nikos; Sirivianos, Michael

doi:10.1145/3543873.3587604

Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14279/29969

DC Field	Value	Language
dc.contributor.author	Leonidou, Pantelitsa	-
dc.contributor.author	Kourtellis, Nicolas	-
dc.contributor.author	Salamanos, Nikos	-
dc.contributor.author	Sirivianos, Michael	-
dc.date.accessioned	2023-07-25T09:39:24Z	-
dc.date.available	2023-07-25T09:39:24Z	-
dc.date.issued	2023-04-30	-
dc.identifier.citation	ACM Web Conference - Companion of the World Wide Web Conference, 2023, 30-4 April, pp. 280 - 289	en_US
dc.identifier.isbn	9781450394161	-
dc.identifier.uri	https://hdl.handle.net/20.500.14279/29969	-
dc.description.abstract	Users are exposed to a large volume of harmful content that appears daily on various social network platforms. One solution to users' protection is developing online moderation tools using Machine Learning (ML) techniques for automatic detection or content filtering. On the other hand, the processing of user data requires compliance with privacy policies. In this paper, we propose a framework for developing content moderation tools in a privacy-preserving manner where sensitive information stays on the users' device. For this purpose, we apply Differentially Private Federated Learning (DP-FL), where the training of ML models is performed locally on the users' devices, and only the model updates are shared with a central entity. To demonstrate the utility of our approach, we simulate harmful text classification on Twitter data in a distributed FL fashion- but the overall concept can be generalized to other types of misbehavior, data, and platforms. We show that the performance of the proposed FL framework can be close to the centralized approach - for both the DP-FL and non-DP FL. Moreover, it has a high performance even if a small number of clients (each with a small number of tweets) are available for the FL training. When reducing the number of clients (from fifty to ten) or the tweets per client (from 1K to 100), the classifier can still achieve AUC. Furthermore, we extend the evaluation to four other Twitter datasets that capture different types of user misbehavior and still obtain a promising performance (61% - 80% AUC). Finally, we explore the overhead on the users' devices during the FL training phase and show that the local training does not introduce excessive CPU utilization and memory consumption overhead.	en_US
dc.language.iso	en	en_US
dc.rights	© Copyright held by the owner/author(s)	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/4.0/	-
dc.subject	Content moderation	en_US
dc.subject	Federated learning	en_US
dc.subject	Privacy	en_US
dc.title	Privacy-Preserving Online Content Moderation: A Federated Learning Use Case	en_US
dc.type	Article	en_US
dc.collaboration	Cyprus University of Technology	en_US
dc.collaboration	Telefonica Research	en_US
dc.subject.category	Mechanical Engineering	en_US
dc.country	Cyprus	en_US
dc.country	Spain	en_US
dc.subject.field	Engineering and Technology	en_US
dc.relation.conference	ACM Web Conference 2023 - Companion of the World Wide Web Conference	en_US
dc.identifier.doi	10.1145/3543873.3587604	en_US
dc.identifier.scopus	2-s2.0-85159630044	-
dc.identifier.url	https://api.elsevier.com/content/abstract/scopus_id/85159630044	-
cut.common.academicyear	2022-2023	en_US
dc.identifier.spage	280	en_US
dc.identifier.epage	289	en_US
item.fulltext	No Fulltext	-
item.languageiso639-1	en	-
item.grantfulltext	none	-
item.openairecristype	http://purl.org/coar/resource_type/c_6501	-
item.cerifentitytype	Publications	-
item.openairetype	article	-
crisitem.author.dept	Department of Electrical Engineering, Computer Engineering and Informatics	-
crisitem.author.dept	Department of Electrical Engineering, Computer Engineering and Informatics	-
crisitem.author.faculty	Faculty of Engineering and Technology	-
crisitem.author.faculty	Faculty of Engineering and Technology	-
crisitem.author.orcid	0000-0002-5946-0074	-
crisitem.author.orcid	0000-0002-6500-581X	-
crisitem.author.parentorg	Faculty of Engineering and Technology	-
crisitem.author.parentorg	Faculty of Engineering and Technology	-
Appears in Collections:	Άρθρα/Articles

CORE Recommender

Show simple item record

Page view(s) 20

246

Last Week
0

Last month
37

checked on Mar 14, 2025

Google Scholar^TM

Check

Altmetric

This item is licensed under a Creative Commons License

Page view(s) 20

Google ScholarTM

Altmetric

Google Scholar^TM