Skip to main content
Presentation
A Process-oriented Dataset of Revisions during Writing
Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020)
  • Rianne Conijn, Tilburg University and University of Antwerp
  • Emily Dux Speltz, Iowa State University
  • Menno van Zaanen, North-West University (South Africa)
  • Luuk Van Waes, University of Antwerp
  • Evgeny Chukharev-Hudilainen, Iowa State University
Document Type
Conference Proceeding
Conference
12th Conference on Language Resources and Evaluation (LREC 2020)
Publication Version
Published Version
Publication Date
1-1-2020
Conference Title
12th Conference on Language Resources and Evaluation (LREC 2020)
Conference Date
May 11-16, 2020
Geolocation
(43.296482, 5.36978)
Abstract

Revision plays a major role in writing and the analysis of writing processes. Revisions can be analyzed using a product-oriented approach (focusing on a finished product, the text that has been produced) or a process-oriented approach (focusing on the process that the writer followed to generate this product). Although several language resources exist for the product-oriented approach to revisions, there are hardly any resources available yet for an in-depth analysis of the process of revisions. Therefore, we provide an extensive dataset on revisions made during writing (accessible via hdl.handle.net/10411/VBDYGX). This dataset is based on keystroke data and eye tracking data of 65 students from a variety of backgrounds (undergraduate and graduate English as a first language and English as a second language students) and a variety of tasks (argumentative text and academic abstract). In total, 7,120 revisions were identified in the dataset. For each revision, 18 features have been manually annotated and 31 features have been automatically extracted. As a case study, we show two potential use cases of the dataset. In addition, future uses of the dataset are described.

Comments

This proceeding is published as Conijn, R., E. Dux Speltz, M. van Zaanen, L. Van Waes, and E. Chukharev-Hudilainen. A process-oriented dataset of revisions during writing. In Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020). (2020): 363-368.

Creative Commons License
Creative Commons Attribution-NonCommercial 4.0 International
Copyright Owner
European Language Resources Association (ELRA)
Language
en
File Format
application/pdf
Citation Information
Rianne Conijn, Emily Dux Speltz, Menno van Zaanen, Luuk Van Waes, et al.. "A Process-oriented Dataset of Revisions during Writing" Marseille, FranceProceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020) (2020) p. 363 - 368
Available at: http://works.bepress.com/evgeny-chukharev-hudilainen/16/