Skip to main content
Article
AFND: Arabic fake news dataset for the detection and classification of articles credibility
Data in Brief
  • Ashwaq Khalil, Jordan University of Science and Technology
  • Moath Jarrah, Jordan University of Science and Technology
  • Monther Aldwairi, Jordan University of Science and Technology; Zayed University
  • Manar Jaradat, Hashemite University
Document Type
Article
Publication Date
4-1-2022
Abstract

The news credibility detection task has started to gain more attention recently due to the rapid increase of news on different social media platforms. This article provides a large, labeled, and diverse Arabic Fake News Dataset (AFND) that is collected from public Arabic news websites. This dataset enables the research community to use supervised and unsupervised machine learning algorithms to classify the credibility of Arabic news articles. AFND consists of 606912 public news articles that were scraped from 134 public news websites of 19 different Arab countries over a 6-month period using Python scripts. The Arabic fact-check platform, Misbar, is used manually to classify each public news source into credible, not credible, or undecided. Weak supervision is applied to label news articles with the same label as the public source. AFND is imbalanced in the number of articles in each class. Hence, it is useful for researchers who focus on finding solutions for imbalanced datasets. The dataset is available in JSON format and can be accessed from Mendeley Data repository.

Publisher
Elsevier BV
Keywords
  • Arabic news dataset,
  • Arabic fake news,
  • Article credibility,
  • Weak labeling,
  • Detection,
  • Classification
Scopus ID

85128713162

Creative Commons License
Creative Commons Attribution 4.0 International
Indexed in Scopus
Yes
Open Access
Yes
Open Access Type
Gold: This publication is openly available in an open access journal/series
Citation Information
Ashwaq Khalil, Moath Jarrah, Monther Aldwairi and Manar Jaradat. "AFND: Arabic fake news dataset for the detection and classification of articles credibility" Data in Brief (2022) p. 108141 - 108141 ISSN: <p><a href="https://v2.sherpa.ac.uk/id/publication/issn/2352-3409" target="_blank">2352-3409</a></p>
Available at: http://works.bepress.com/monther-aldwairi/50/