Skip to main content
Article
Grammatical Error Correction: A Survey of the State of the Art
Computational Linguistics
  • Christopher Bryant, Department of Computer Science and Technology
  • Zheng Yuan, King's College London
  • Muhammad Reza Qorib, National University of Singapore
  • Hannan Cao, National University of Singapore
  • Hwee Tou Ng, National University of Singapore
  • Ted Briscoe, Mohamed Bin Zayed University of Artificial Intelligence
Document Type
Article
Abstract

Grammatical Error Correction (GEC) is the task of automatically detecting and correcting errors in text. The task not only includes the correction of grammatical errors, such as missing prepositions and mismatched subject–verb agreement, but also orthographic and semantic errors, such as misspellings and word choice errors, respectively. The field has seen significant progress in the last decade, motivated in part by a series of five shared tasks, which drove the development of rule-based methods, statistical classifiers, statistical machine translation, and finally neural machine translation systems, which represent the current dominant state of the art. In this survey paper, we condense the field into a single article and first outline some of the linguistic challenges of the task, introduce the most popular datasets that are available to researchers (for both English and other languages), and summarize the various methods and techniques that have been developed with a particular focus on artificial error generation. We next describe the many different approaches to evaluation as well as concerns surrounding metric reliability, especially in relation to subjective human judgments, before concluding with an overview of recent progress and suggestions for future work and remaining challenges. We hope that this survey will serve as a comprehensive resource for researchers who are new to the field or who want to be kept apprised of recent developments.

DOI
10.1162/coli_a_00478
Publication Date
9-1-2023
Keywords
  • Classification (of information),
  • Computational linguistics,
  • Computer aided language translation,
  • Error correction,
  • Neural machine translation,
  • Petroleum reservoir evaluation
Comments

Open Access version available on MIT Press Direct

License: CC BY-NC-ND

Uploaded: 15 February 2024

Citation Information
Christopher Bryant, Zheng Yuan, Muhammad Reza Qorib, Hannan Cao, et al.. "Grammatical Error Correction: A Survey of the State of the Art" Computational Linguistics Vol. 49 Iss. 3 (2023) p. 643 - 701 ISSN: 08912017
Available at: http://works.bepress.com/ted-briscoe/1/