A Process-oriented Dataset of Revisions during Writing

Thumbnail Image
Date
2020-01-01
Authors
Conijn, Rianne
Dux Speltz, Emily
van Zaanen, Menno
Van Waes, Luuk
Chukharev-Hudilainen, Evgeny
Major Professor
Advisor
Committee Member
Journal Title
Journal ISSN
Volume Title
Publisher
Authors
Person
Chukharev-Hudilainen, Evgeny
Associate Professor
Research Projects
Organizational Units
Organizational Unit
Journal Issue
Is Version Of
Versions
Series
Department
English
Abstract

Revision plays a major role in writing and the analysis of writing processes. Revisions can be analyzed using a product-oriented approach (focusing on a finished product, the text that has been produced) or a process-oriented approach (focusing on the process that the writer followed to generate this product). Although several language resources exist for the product-oriented approach to revisions, there are hardly any resources available yet for an in-depth analysis of the process of revisions. Therefore, we provide an extensive dataset on revisions made during writing (accessible via hdl.handle.net/10411/VBDYGX). This dataset is based on keystroke data and eye tracking data of 65 students from a variety of backgrounds (undergraduate and graduate English as a first language and English as a second language students) and a variety of tasks (argumentative text and academic abstract). In total, 7,120 revisions were identified in the dataset. For each revision, 18 features have been manually annotated and 31 features have been automatically extracted. As a case study, we show two potential use cases of the dataset. In addition, future uses of the dataset are described.

Comments

This proceeding is published as Conijn, R., E. Dux Speltz, M. van Zaanen, L. Van Waes, and E. Chukharev-Hudilainen. A process-oriented dataset of revisions during writing. In Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020). (2020): 363-368.

Description
Keywords
Citation
DOI
Source
Copyright
Wed Jan 01 00:00:00 UTC 2020