دورية أكاديمية

Toxicity in the Decentralized Web and the Potential for Model Sharing

التفاصيل البيبلوغرافية
العنوان: Toxicity in the Decentralized Web and the Potential for Model Sharing
المؤلفون: Zia, Haris Bin, Raman, Aravindh., Castro, Ignacio, Anaobi, Ishaku Hassan, De Cristofaro, Emiliano, Sastry, Nishanth, Tyson, Gareth
سنة النشر: 2022
المجموعة: ArXiv.org (Cornell University Library)
مصطلحات موضوعية: Computer Science - Computers and Society, Computer Science - Networking and Internet Architecture
الوصف: The "Decentralised Web" (DW) is an evolving concept, which encompasses technologies aimed at providing greater transparency and openness on the web. The DW relies on independent servers (aka instances) that mesh together in a peer-to-peer fashion to deliver a range of services (e.g. micro-blogs, image sharing, video streaming). However, toxic content moderation in this decentralised context is challenging. This is because there is no central entity that can define toxicity, nor a large central pool of data that can be used to build universal classifiers. It is therefore unsurprising that there have been several high-profile cases of the DW being misused to coordinate and disseminate harmful material. Using a dataset of 9.9M posts from 117K users on Pleroma (a popular DW microblogging service), we quantify the presence of toxic content. We find that toxic content is prevalent and spreads rapidly between instances. We show that automating per-instance content moderation is challenging due to the lack of sufficient training data available and the effort required in labelling. We therefore propose and evaluate ModPair, a model sharing system that effectively detects toxic content, gaining an average per-instance macro-F1 score 0.89.
نوع الوثيقة: text
اللغة: unknown
العلاقة: http://arxiv.org/abs/2204.12709Test
الإتاحة: http://arxiv.org/abs/2204.12709Test
رقم الانضمام: edsbas.E6074E75
قاعدة البيانات: BASE