Campus Units

English

Document Type

Conference Proceeding

Conference

12th Conference on Language Resources and Evaluation (LREC 2020)

Publication Version

Published Version

Publication Date

2020

Journal or Book Title

Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020)

First Page

363

Last Page

368

Conference Title

12th Conference on Language Resources and Evaluation (LREC 2020)

Conference Date

May 11-16, 2020

City

Marseille, France

Abstract

Revision plays a major role in writing and the analysis of writing processes. Revisions can be analyzed using a product-oriented approach (focusing on a finished product, the text that has been produced) or a process-oriented approach (focusing on the process that the writer followed to generate this product). Although several language resources exist for the product-oriented approach to revisions, there are hardly any resources available yet for an in-depth analysis of the process of revisions. Therefore, we provide an extensive dataset on revisions made during writing (accessible via hdl.handle.net/10411/VBDYGX). This dataset is based on keystroke data and eye tracking data of 65 students from a variety of backgrounds (undergraduate and graduate English as a first language and English as a second language students) and a variety of tasks (argumentative text and academic abstract). In total, 7,120 revisions were identified in the dataset. For each revision, 18 features have been manually annotated and 31 features have been automatically extracted. As a case study, we show two potential use cases of the dataset. In addition, future uses of the dataset are described.

Comments

This proceeding is published as Conijn, R., E. Dux Speltz, M. van Zaanen, L. Van Waes, and E. Chukharev-Hudilainen. A process-oriented dataset of revisions during writing. In Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020). (2020): 363-368.

Creative Commons License

Creative Commons Attribution-NonCommercial 4.0 International License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License

Copyright Owner

European Language Resources Association (ELRA)

Language

en

File Format

application/pdf

Share

Article Location

 
COinS