Ideas cascading into keystrokes – Modelling writing hesitations as Bayesian mixture process

Dataset	Source	Keylogger	Task	N	Age	Sample	Country	Language
C2L1	Rønneberg et al. (2022)	EyeWrite	Argumentative	126	12	6th graders	Norway	Norwegian
CATO	Torrance et al. (2016)	EyeWrite	Expository	52	17	Secondary school students (dyslexic, non dyslexic)	Norway	Norwegian
GE2	Ofstad Oxborough and Torrance (2011)	EyeWrite	Argumentative	45	19	Undergraduate students	UK	English
LIFT	Vandermeulen et al. (2020)	InputLog	Synthesis	658	17	Secondary school students	The Netherlands	Dutch
PLanTra	Rossetti and Van Waes (2022)	InputLog	Text simplification	47	23	Master students	Belgium	English (L2)
SPL2	Torrance, Roeser, and Chukharev (n.d.)	CyWrite	Argumentative	39	21	Undergraduate students	USA	English

Dataset

Source

Keylogger

Task

Age

Sample

Country

Language

C2L1

Rønneberg et al. (2022)

EyeWrite

Argumentative

126

6th graders

Norway

Norwegian

CATO

Torrance et al. (2016)

EyeWrite

Expository

Secondary school students (dyslexic, non dyslexic)

Norway

Norwegian

GE2

Ofstad Oxborough and Torrance (2011)

EyeWrite

Argumentative

Undergraduate students

English

LIFT

Vandermeulen et al. (2020)

InputLog

Synthesis

658

Secondary school students

The Netherlands

Dutch

PLanTra

Rossetti and Van Waes (2022)

InputLog

Text simplification

Master students

Belgium

English (L2)

SPL2

Torrance, Roeser, and Chukharev (n.d.)

CyWrite

Argumentative

Undergraduate students

USA

English

Location	Example
Within word	T\(^{\wedge}\)h\(^{\wedge}\)e c\(^{\wedge}\)a\(^{\wedge}\)t m\(^{\wedge}\)e\(^{\wedge}\)o\(^{\wedge}\)w\(^{\wedge}\)e\(^{\wedge}\)d. T\(^{\wedge}\)h\(^{\wedge}\)e\(^{\wedge}\)n i\(^{\wedge}\)t s\(^{\wedge}\)l\(^{\wedge}\)e\(^{\wedge}\)p\(^{\wedge}\)t.
Below word	The \(^{\wedge}\)cat \(^{\wedge}\)meowed. Then \(^{\wedge}\)it \(^{\wedge}\)slept.
Before sentence	The cat meowed. \(^{\wedge}\)Then it slept.
^a Note: Key intervals that terminated in a revision were removed.

Location

Example

Within word

T\(^{\wedge}\)h\(^{\wedge}\)e c\(^{\wedge}\)a\(^{\wedge}\)t m\(^{\wedge}\)e\(^{\wedge}\)o\(^{\wedge}\)w\(^{\wedge}\)e\(^{\wedge}\)d. T\(^{\wedge}\)h\(^{\wedge}\)e\(^{\wedge}\)n i\(^{\wedge}\)t s\(^{\wedge}\)l\(^{\wedge}\)e\(^{\wedge}\)p\(^{\wedge}\)t.

Below word

The \(^{\wedge}\)cat \(^{\wedge}\)meowed. Then \(^{\wedge}\)it \(^{\wedge}\)slept.

Before sentence

The cat meowed. \(^{\wedge}\)Then it slept.

^a Note: Key intervals that terminated in a revision were removed.

Models	Description
Serial
M1	Single distribution Gaussian
M2	Single distribution log-Gaussian
M3	Single distribution log-Gaussian with different variance components per text location
Parallel
M4	Two-distributions mixture of log-Gaussians

Models

Description

Serial

Single distribution Gaussian

Single distribution log-Gaussian

Single distribution log-Gaussian with different variance components per text location

Parallel

Two-distributions mixture of log-Gaussians

Values are the absolute ratio of the difference in predictive performance measured as \(\widehat{elpd}\) (Vehtari, Gelman, and Gabry 2015, 2017) and its standard error \(\mid\frac{\Delta\widehat{elpd}}{\text{SE}}\mid\) which corresponds to the \(z\)-score of the change in predictive performance (Sivula et al. 2020).

Data set

Mixture process (M3 vs M4)

Single process (unequal var.; M2 vs M3)

Single process (M1 vs M2)

C2L1

CATO

GE2

LIFT

PLanTra

SPL2

Values are the absolute ratio of the difference in predictive performance measured as \(\widehat{elpd}\) (Vehtari, Gelman, and Gabry 2015, 2017) and its standard error \(\mid\frac{\Delta\widehat{elpd}}{\text{SE}}\mid\) which corresponds to the \(z\)-score of the change in predictive performance (Sivula et al. 2020).
Data set	Mixture process (M3 vs M4)	Single process (unequal var.; M2 vs M3)	Single process (M1 vs M2)
C2L1	23	13	47
CATO	24	18	43
GE2	27	26	63
LIFT	40	21	46
PLanTra	26	17	64
SPL2	18	25	69

Data set

Mixture process (M3 vs M4)

Single process (unequal var.; M2 vs M3)

Single process (M1 vs M2)

C2L1

CATO

GE2

LIFT

PLanTra

SPL2

Values indicate log BFs in support of the alternative hypothesis over the null hypothesis.
	Hesitation slowdown	Hesitation probability
C2L1	0.11	-2.72	-0.39	46.52
CATO	18.63	-1.6	-0.41	20.8
GE2	33.53	3.29	18.31	26.33
LIFT	-0.69	-0.88	0.47	24.56
PLanTra	13.24	4.02	-1.27	31.88
SPL2	48.28	2.2	5.8	24.28

Values indicate log BFs in support of the alternative hypothesis over the null hypothesis.

Hesitation slowdown

Hesitation probability

Dataset

before sentence vs word

before vs within word

before sentence vs word

before vs within word

C2L1

0.11

-2.72

-0.39

46.52

CATO

18.63

-1.6

-0.41

20.8

GE2

33.53

3.29

18.31

26.33

LIFT

-0.69

-0.88

0.47

24.56

PLanTra

13.24

4.02

-1.27

31.88

SPL2

48.28

2.2

5.8

24.28

Baaijen, Veerle M., David Galbraith, and Kees De Glopper. 2012. “Keystroke Analysis: Reflections on Procedures and Measures.” Written Communication 29 (3): 246–77.

Conijn, Rianne, Jens Roeser, and Menno van Zaanen. 2019. “Understanding the Keystroke Log: The Effect of Writing Task on Keystroke Features.” Reading and Writing 32 (9): 2353–74.

Hayes, John R. 2012. “Evidence from Language Bursts, Revision, and Transcription for Translation and Its Relation to Other Writing Processes.” In Translation of Thought to Written Text While Composing: Advancing Theory, Knowledge, Methods, and Applications, edited by M. Fayol, D. Alamargot, and V. Berninger, 15–25. Psychology Press.

Kaufer, David S., John R. Hayes, and Linda S. Flower. 1986. “Composing Written Sentences.” Research in the Teaching of English, 121–40.

Nottbusch, Guido. 2010. “Grammatical Planning, Execution, and Control in Written Sentence Production.” Reading and Writing 23 (7): 777–801.

Ofstad Oxborough, Gunn Helen, and Mark Torrance. 2011. “Multilevel Analysis of Latency in Writing.” In 21st Annual Meeting of the Society for Text and Discourse. http://textanddiscourse2011.conference.univ-poitiers.fr/PROG_DEFIN.pdf.

Olive, Thierry. 2014. “Toward a Parallel and Cascading Model of the Writing System: A Review of Research on Writing Processes Coordination.” Journal of Writing Research 6 (2): 173–94.

Roeser, Jens, Sven De Maeyer, Mariëlle Leijten, and Luuk Van Waes. 2024. “Modelling Typing Disfluencies as Finite Mixture Process.” Reading and Writing 37 (2): 359–84.

Roeser, Jens, Mark Torrance, and Thom Baguley. 2019. “Advance Planning in Written and Spoken Sentence Production.” Journal of Experimental Psychology: Learning, Memory, and Cognition 45 (11): 1983–2009. https://doi.org/10.1037/xlm0000685.

Rønneberg, Vibeke, Mark Torrance, Per Henning Uppstad, and Christer Johansson. 2022. “The Process-Disruption Hypothesis: How Spelling and Typing Skill Affects Written Composition Process and Product.” Psychological Research 86 (7): 2239–55.

Rossetti, Alessandra, and Luuk Van Waes. 2022. “Text Simplification in Second Language: Process and Product Data.” Zenodo. https://doi.org/10.5281/zenodo.6720290.

Sivula, Tuomas, Måns Magnusson, Asael Alonzo Matamoros, and Aki Vehtari. 2020. “Uncertainty in Bayesian Leave-One-Out Cross-Validation Based Model Comparison.” arXiv Preprint arXiv:2008.10296.

Torrance, Mark, Jens Roeser, and Evgeny Chukharev. n.d. “Lookback Supports Cascaded, Just-in-Time Processing in Second Language Written Composition.”

Torrance, Mark, Vibeke Rønneberg, Christer Johansson, and Per Henning Uppstad. 2016. “Adolescent Weak Decoders Writing in a Shallow Orthography: Process and Product.” Scientific Studies of Reading 20 (5): 375–88.

Van Galen, Gerard P. 1991. “Handwriting: Issues for a Psychomotor Theory.” Human Movement Science 10 (2): 165–91.

Van Waes, Luuk, Mariëlle Leijten, Jens Roeser, Thierry Olive, and Joachim Grabowski. 2021. “Measuring and Assessing Typing Skills in Writing Research.” Journal of Writing Research 13 (1): 107–53. https://doi.org/10.17239/jowr-2021.13.01.04.

Vandermeulen, Nina, Sven De Maeyer, Elke Van Steendam, Marije Lesterhuis, Huub Van den Bergh, and Gert Rijlaarsdam. 2020. “Mapping Synthesis Writing in Various Levels of Dutch Upper-Secondary Education: A National Baseline Study on Text Quality, Writing Process and Students’ Perspectives on Writing.” Pedagogische Studiën: Tijdschrift Voor Onderwijskunde En Opvoedkunde 97 (3): 187–236.

Vehtari, Aki, Andrew Gelman, and Jonah Gabry. 2015. “Pareto Smoothed Importance Sampling.” arXiv Preprint arXiv:1507.02646.

———. 2017. “Practical Bayesian Model Evaluation Using Leave-One-Out Cross-Validation and WAIC.” Statistics and Computing 27 (5): 1413–32.

Slides: rpubs.com/jensroes/sig-paris-2024

How do we produce texts?

Three general planning stages

Higher level info cascades into lower levels

Classic (serial) view of writing

Classic (serial) view of writing

Parallel view of writing (Olive 2014)

Parallel view of writing (Olive 2014)

Parallel view of writing (Olive 2014)

Serial view

Parallel view

How can we represent these models statistcally?

Serial view: single-process model

Serial view: single-process model

Parallel view can be implemented as two distributions mixture-process (Roeser et al. 2024)

Parallel view: mixture of two processes

Parallel view: mixture of two processes

Parallel view: mixture of two processes

How can we evaluate these models?

Six datasets with interkey intervals

Text location classifications

Model overview

Which model showed better performance?

Model comparisons

Model comparisons

Model fit to data

How do text locations affect the mixture process?

Mixture distributions

Location estimates by model parameter

Location estimates by model parameter

Location estimates by model parameter

Planning text unfolds in parallel to production!

Planning text unfolds in parallel to production!

“Maybe mixture models are just always bettter?”

Simulation

Thank you!

References