Skip to main content
Article
A survey on sentiment analysis in Urdu: A resource-poor language
Egyptian Informatics Journal
  • Asad Khattak, Zayed University
  • Muhammad Zubair Asghar, Gomal University
  • Anam Saeed, Gomal University
  • Ibrahim A. Hameed, Faculty of Information Technology and Electrical Engineering
  • Syed Asif Hassan, King Abdulaziz University
  • Shakeel Ahmad, King Abdulaziz University
Document Type
Article
Publication Date
1-1-2020
Abstract

© 2020 Background/introduction: The dawn of the internet opened the doors to the easy and widespread sharing of information on subject matters such as products, services, events and political opinions. While the volume of studies conducted on sentiment analysis is rapidly expanding, these studies mostly address English language concerns. The primary goal of this study is to present state-of-art survey for identifying the progress and shortcomings saddling Urdu sentiment analysis and propose rectifications. Methods: We described the advancements made thus far in this area by categorising the studies along three dimensions, namely: text pre-processing lexical resources and sentiment classification. These pre-processing operations include word segmentation, text cleaning, spell checking and part-of-speech tagging. An evaluation of sophisticated lexical resources including corpuses and lexicons was carried out, and investigations were conducted on sentiment analysis constructs such as opinion words, modifiers, negations. Results and conclusions: Performance is reported for each of the reviewed study. Based on experimental results and proposals forwarded through this paper provides the groundwork for further studies on Urdu sentiment analysis.

Publisher
Elsevier B.V.
Disciplines
Keywords
  • Corpus,
  • Datasets,
  • Pre-processing,
  • Semantic orientation,
  • Sentiment lexicon,
  • Urdu sentiment analysis,
  • Urdu sentiment classification
Scopus ID

85084649873

Creative Commons License
Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International
Indexed in Scopus
Yes
Open Access
Yes
Open Access Type
Gold: This publication is openly available in an open access journal/series
Citation Information
Asad Khattak, Muhammad Zubair Asghar, Anam Saeed, Ibrahim A. Hameed, et al.. "A survey on sentiment analysis in Urdu: A resource-poor language" Egyptian Informatics Journal (2020) ISSN: <p><a href="https://v2.sherpa.ac.uk/id/publication/issn/1110-8665" target="_blank">1110-8665</a></p>
Available at: http://works.bepress.com/asad-khattak/8/