BOUTEF: A Multilingual Corpus for FakeNews in North Africa -- Language as a Weapon 文章

ArXiv CS.CL2026-06-02NEWSen作者: Kamel Smaili, Yassine Toughrai, Amina Laggoun, David Langlois

摘要

arXiv:2606.00193v1 Announce Type: new Abstract: The rapid spread of fake news on social media has become a major challenge, particularly in multilingual and under-resourced contexts such as North Africa. In this paper, we introduce BOUTEF, a large-scale multilingual corpus designed to study the propagation, characteristics, and impact of fake news in Algeria and Tunisia. The corpus integrates three complementary components: fake narratives, genuine narratives, and associated user-generated comments, along with verified debunking information. It covers a wide range of languages and linguistic varieties, including MSA, Algerian and Tunisian dialects, Arabizi, French, English, and code-switched language. Building on this resource, we conduct a comprehensive empirical analysis combining quantitative and qualitative approaches. We examine thematic distributions, linguistic and rhetorical strategies, sentiment patterns, and social engagement dynamics.

相关公司

暂无数据

相关人物

暂无数据

相关技术

暂无数据