Revisiting the Regularity of Student Learning Rate: Sensitivity to Which Observations Are Included

Benjamin W. Domingue; Candace Thille; Cristina Barnard; Guilherme Lichand; Hansol Lee; Lucas Klotz; Yunsung Kim

arxiv: 2605.01690 · v2 · pith:GSLWYP6Znew · submitted 2026-05-03 · 💻 cs.CY

Revisiting the Regularity of Student Learning Rate: Sensitivity to Which Observations Are Included

Hansol Lee , Guilherme Lichand , Cristina Barnard , Lucas Klotz , Candace Thille , Yunsung Kim , Benjamin W. Domingue This is my paper

Pith reviewed 2026-05-21 00:42 UTC · model grok-4.3

classification 💻 cs.CY

keywords student learning ratesmixed-effects modelsobservational practice datasensitivity to observationsinitial knowledge variationAdditive Factors Modeleducational datasets

0 comments

The pith

Estimates of student variation in learning rate from practice data change sharply depending on which observations the model includes, while initial knowledge estimates remain stable.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tests whether estimates of how much individual students differ in starting knowledge and in learning speed, drawn from mixed-effects models on observational practice logs, are stable properties of the students or artifacts of modeling choices. It reanalyzes 27 educational datasets previously used to claim that students vary far more in initial knowledge than in learning rate. Refitting the identical model under two altered rules for selecting which practice trials per student and skill to keep shows that initial-knowledge variation stays roughly constant. Learning-rate variation, however, rises by a median of 118 percent under one rule and by several times under the other. The finding matters because these model-derived numbers are increasingly used to describe real differences among learners.

Core claim

When the individual Additive Factors Model is applied to the same 27 datasets but with different rules for how many observations of each student's practice on a given skill are retained, the estimated variance across students in learning rate increases markedly while the estimated variance in initial knowledge does not. One specification raises the learning-rate variance by a median of 118 percent; the second raises it several-fold. The same model and data therefore produce substantially different pictures of how much students differ in learning speed solely because of the choice of which observations enter the fit.

What carries the argument

The individual Additive Factors Model (iAFM), a mixed-effects regression that fits student-specific intercepts for initial knowledge and student-specific slopes for learning rate to sequences of practice observations.

Load-bearing premise

The model is correctly specified, so that differences in estimates arise mainly from which observations are kept rather than from changes in sample composition or unmodeled data features.

What would settle it

A side-by-side test of which observation-inclusion rule yields better out-of-sample predictions of future student performance on held-out practice trials.

Figures

Figures reproduced from arXiv: 2605.01690 by Benjamin W. Domingue, Candace Thille, Cristina Barnard, Guilherme Lichand, Hansol Lee, Lucas Klotz, Yunsung Kim.

**Figure 1.** Figure 1: Replication of reported IQR of student learning view at source ↗

**Figure 1.** Figure 1: Empirical distribution of student-slope BLUPs [PITH_FULL_IMAGE:figures/full_fig_p005_1.png] view at source ↗

**Figure 2.** Figure 2: Distribution of student–KC pair lengths across all 26 datasets. view at source ↗

**Figure 3.** Figure 3: IQR of student random effects from the full model ( view at source ↗

**Figure 4.** Figure 4: iAFM parameter estimates under full and truncated ( [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

**Figure 4.** Figure 4: Estimated fixed effects (population-average learn view at source ↗

**Figure 5.** Figure 5: iAFM parameter estimates within the short-pair stratum ( [PITH_FULL_IMAGE:figures/full_fig_p009_5.png] view at source ↗

**Figure 6.** Figure 6: Practice-length distribution heaviness ( view at source ↗

read the original abstract

Mixed-effects models fit to observational practice data are widely used in learning analytics to estimate student-level variation in initial knowledge and learning rate, and the resulting estimates increasingly inform substantive claims about learners. We examine whether such estimates can be read as properties of learners or whether they depend on choices about which observations the model is fit to. As a case study, we revisit the ``astonishing regularity'' reported by Koedinger et al. (2023): that students vary substantially in initial knowledge but much less in learning rate. The finding is based on fits of the individual Additive Factors Model (iAFM) to 27 educational datasets, and rests on a model-derived estimate of student-level learning-rate variation being small in absolute terms. We refit the same model on the same datasets under two specifications, each varying how much of each student's practice on a given skill is used in fitting. The estimate of student-level variation in initial knowledge stays approximately stable across both specifications. The estimate of student-level variation in learning rate does not: it inflates by a median of 118\% under one specification and is several times larger under the other. The same model, fit to the same data, returns substantially different estimates of how much students vary in learning rate depending on which observations are included. When estimates from mixed-effects models on observational practice data are used to support substantive claims about learners, sensitivity to such choices deserves a central place in how those estimates are reported and read.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Refitting the same iAFM on the same 27 datasets under different observation-inclusion rules shows learning-rate variance inflating by 118% median or more while initial-knowledge variance holds steady.

read the letter

The main point is that the same model on the same data gives substantially different estimates of how much students vary in learning rate once you change which observations get kept. Initial knowledge variance stays stable, but learning rate variance does not. That is the concrete result here. They take the Koedinger et al. setup and simply vary the per-student-skill observation count rules, then refit. The demonstration is direct and uses the existing public datasets, so the comparison is clean. No new model or parameter is introduced, which keeps the exercise focused on robustness rather than novelty for its own sake. The quantitative shifts reported are the useful part: a median 118% increase under one rule and several times larger under the other. That kind of check matters when these variance estimates start feeding into claims about learners. A remaining question is whether the inclusion rules also change the effective sample or the number of observations per random-effect group. In mixed models, variance estimates for slopes can shift with within-group sample size because of shrinkage. If one specification systematically retains more data points or a different subset of students, part of the inflation could trace to that rather than to intrinsic sensitivity of the model. The abstract does not flag whether the student set or opportunity distributions were held fixed, so that detail would help readers judge how much of the change is pure model response. This work is for people who fit or rely on mixed-effects models in learning analytics and want to know how stable the variance components are under ordinary data-handling choices. Readers already working with these datasets or similar practice logs will find the comparison directly relevant. It is worth sending for peer review because the central sensitivity result is grounded in explicit refits and raises a practical reporting issue without overclaiming.

Referee Report

2 major / 2 minor

Summary. The paper claims that student-level variation in learning rate estimated from the individual Additive Factors Model (iAFM) on 27 educational datasets is highly sensitive to choices about which observations are included in the fit. Refitting the identical model under two alternative specifications for observation inclusion (varying how much of each student's practice on a given skill is retained) produces a median 118% inflation in the learning-rate variance estimate under one rule and several-times-larger values under the other, while the estimate of student-level variation in initial knowledge remains approximately stable. The authors conclude that such estimates cannot be read as intrinsic properties of learners without explicit attention to sensitivity to data-inclusion decisions.

Significance. If the central sensitivity result holds after addressing sample-composition controls, the work provides a concrete, reproducible demonstration that mixed-effects estimates of learner differences in observational practice data can shift substantially with routine modeling choices. This strengthens the case for routine robustness reporting in learning analytics and supplies a direct counter-example to the 'astonishing regularity' claim in Koedinger et al. (2023). The use of publicly referenced datasets and identical model re-fits is a methodological strength.

major comments (2)

[Methods / Results] Methods section (and any results tables reporting variance components): the manuscript must explicitly state and verify whether the set of students and the distribution of skill-opportunity counts are held fixed across the three inclusion specifications. If restricting observations per student-skill pair causes some students to drop below inclusion thresholds or alters the balance of observations per random-effect group, the reported inflation in learning-rate variance could partly reflect changes in effective sample size and shrinkage rather than pure model sensitivity. The abstract notes stability of initial-knowledge variance but does not report diagnostics confirming fixed student sets or within-student observation counts.
[Results] Results (variance-component tables or figures): provide the per-dataset student counts, mean observations per student-skill pair, and effective sample sizes under each inclusion rule. Without these, it is impossible to rule out that the 118% median inflation (or larger multiples) arises from reduced shrinkage in the fuller-inclusion condition rather than intrinsic sensitivity of the iAFM random-slope variance.

minor comments (2)

[Methods] Clarify the exact two specifications used for 'varying how much of each student's practice' (e.g., first-N vs. all observations, or minimum-count thresholds) with precise pseudocode or equations.
[Results] Add a short paragraph comparing the new variance estimates to the original Koedinger et al. (2023) numbers to make the magnitude of change immediately interpretable.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive and detailed comments, which help clarify important aspects of our sensitivity analysis. We address each major comment below and have revised the manuscript accordingly to improve transparency regarding sample composition.

read point-by-point responses

Referee: [Methods / Results] Methods section (and any results tables reporting variance components): the manuscript must explicitly state and verify whether the set of students and the distribution of skill-opportunity counts are held fixed across the three inclusion specifications. If restricting observations per student-skill pair causes some students to drop below inclusion thresholds or alters the balance of observations per random-effect group, the reported inflation in learning-rate variance could partly reflect changes in effective sample size and shrinkage rather than pure model sensitivity. The abstract notes stability of initial-knowledge variance but does not report diagnostics confirming fixed student sets or within-student observation counts.

Authors: We agree that explicitly confirming the fixed student set and reporting relevant diagnostics is necessary to isolate the effect of observation-inclusion rules from potential changes in effective sample size or shrinkage. In the original analysis, the set of students and skills was held constant across the three inclusion specifications; we varied only the number of observations retained per student-skill pair (e.g., limiting to the first k opportunities) without imposing minimum thresholds that would drop students. We have now added explicit statements to the Methods section verifying that no students were excluded due to the inclusion rules and that the student set remains identical. We have also included a brief verification note that the number of students per dataset is unchanged. While the distribution of observations per student-skill pair necessarily varies by design, the stability of the initial-knowledge variance component across specifications provides supporting evidence that sample-composition shifts are not the primary driver of the reported changes in learning-rate variance. revision: yes
Referee: [Results] Results (variance-component tables or figures): provide the per-dataset student counts, mean observations per student-skill pair, and effective sample sizes under each inclusion rule. Without these, it is impossible to rule out that the 118% median inflation (or larger multiples) arises from reduced shrinkage in the fuller-inclusion condition rather than intrinsic sensitivity of the iAFM random-slope variance.

Authors: We accept this recommendation and have added the requested information to the revised manuscript. A new supplementary table now reports, for each of the 27 datasets, the number of students, the mean observations per student-skill pair, and the total observations (as a proxy for effective sample size) under each of the three inclusion rules. This allows direct assessment of how observation counts change and helps readers evaluate the potential contribution of differential shrinkage. We continue to interpret the differential sensitivity—large changes in learning-rate variance but stability in initial-knowledge variance—as evidence of intrinsic model sensitivity to inclusion decisions rather than a pure artifact of sample size. revision: yes

Circularity Check

0 steps flagged

No significant circularity in empirical sensitivity analysis

full rationale

The paper conducts an empirical re-analysis by refitting the individual Additive Factors Model (iAFM) to the same 27 datasets under two alternative observation-inclusion specifications. The reported result—that student-level variance in learning rate inflates substantially (median 118% or more) while initial-knowledge variance remains stable—is obtained directly from the new parameter estimates on the altered data subsets. No step reduces by construction to a fitted input, self-definition, or load-bearing self-citation; the original Koedinger et al. (2023) finding is treated as an external benchmark that is then tested for robustness. The derivation chain is therefore self-contained and falsifiable against the public datasets.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The paper relies on the standard assumptions of the individual Additive Factors Model as a mixed-effects representation of student practice data; no new free parameters or invented entities are introduced in the sensitivity analysis itself.

axioms (1)

domain assumption The individual Additive Factors Model (iAFM) is an appropriate mixed-effects model for estimating student-level initial knowledge and learning rate from observational practice data.
The analysis builds directly on this model from Koedinger et al. (2023) without re-deriving its functional form or error structure.

pith-pipeline@v0.9.0 · 5818 in / 1264 out tokens · 39006 ms · 2026-05-21T00:42:47.524344+00:00 · methodology

Review history (2 revisions) →

discussion (0)

Reference graph

Works this paper leans on

42 extracted references · 42 canonical work pages

[1]

Vincent Aleven, Elmar Stahl, Silke Schworm, Frank Fischer, and Raven Wallace

work page
[2]

Help seeking and help design in interactive learning environments.Review of Educational Research73, 3 (2003), 277–320

work page 2003
[3]

Anderson

John R. Anderson. 1982. Acquisition of cognitive skill.Psychological Review89, 4 (1982), 369–406

work page 1982
[4]

Ryan S Baker and Aaron Hawn. 2022. Algorithmic bias in education.International Journal of Artificial Intelligence in Education32, 4 (2022), 1052–1092

work page 2022
[5]

Ryan S. J. d. Baker. 2007. Modeling and understanding students’ off-task behavior in intelligent tutoring systems. InProceedings of the ACM CHI Conference on Human Factors in Computing Systems. 1059–1068. Revisiting the Regularity of Student Learning Rate L@S ’26, June 29–July 3, 2026, Seoul, Republic of Korea

work page 2007
[6]

Ryan S. J. d. Baker, Albert T. Corbett, Kenneth R. Koedinger, Shelley Evenson, Ido Roll, Angela Z. Wagner, Meghan Naim, Jay Raspat, Daniel J. Baker, and Joseph E. Beck. 2006. Adapting to when students game an intelligent tutoring system. InProceedings of the International Conference on Intelligent Tutoring Systems. Springer, 392–401

work page 2006
[7]

Douglas Bates, Martin Mächler, Ben Bolker, and Steve Walker. 2015. Fitting linear mixed-effects models using lme4.Journal of Statistical Software67, 1 (2015), 1–48

work page 2015
[8]

Joseph E Beck and Kai-min Chang. 2007. Identifiability: A fundamental problem of student modeling. InInternational Conference on User Modeling. Springer, 137–146

work page 2007
[9]

Beck and Yue Gong

Joseph E. Beck and Yue Gong. 2013. Wheel-spinning: Students who fail to master a skill. InProceedings of the International Conference on Artificial Intelligence in Education. Springer, 431–440

work page 2013
[10]

Block and Robert B

James H. Block and Robert B. Burns. 1976. Mastery learning.Review of Research in Education4 (1976), 3–49

work page 1976
[11]

Hao Cen, Kenneth Koedinger, and Brian Junker. 2006. Learning Factors Analysis: a general method for cognitive model evaluation and improvement. InProceedings of the International Conference on Intelligent Tutoring Systems. Springer, 164–175

work page 2006
[12]

Koedinger, and Brian Junker

Hao Cen, Kenneth R. Koedinger, and Brian Junker. 2007. Is over practice necessary?—Improving learning efficiency with the Cognitive Tutor through educational data mining.Frontiers in Artificial Intelligence and Applications158 (2007), 511

work page 2007
[13]

Koedinger, Geoffrey J

Min Chi, Kenneth R. Koedinger, Geoffrey J. Gordon, Pamela Jordan, and Kurt VanLehn. 2011. Instructional factors analysis: a cognitive model for multiple instructional interventions. InProceedings of the 4th International Conference on Educational Data Mining (EDM). 61–70

work page 2011
[14]

Peter Diggle and Michael G. Kenward. 1994. Informative drop-out in longitudinal data analysis.Journal of the Royal Statistical Society: Series C (Applied Statistics) 43, 1 (1994), 49–73

work page 1994
[15]

Tomáš Effenberger, Radek Pelánek, and Jaroslav Čechák. 2020. Exploration of the robustness and generalizability of the additive factors model. InProceedings of the Tenth International Conference on Learning Analytics & Knowledge. 472–479

work page 2020
[16]

Evans, Scott D

Nathan J. Evans, Scott D. Brown, Douglas J. K. Mewhort, and Andrew Heathcote

work page
[17]

Refining the law of practice.Psychological Review125, 4 (2018), 592–605

work page 2018
[18]

Fitts and Michael I

Paul M. Fitts and Michael I. Posner. 1967.Human Performance. Brooks/Cole, Belmont, CA

work page 1967
[19]

Theodore W. Frick. 1990. A comparison of three decision models for adapting the length of computer-based mastery tests.Journal of Educational Computing Research6, 4 (1990), 479–513

work page 1990
[20]

April Galyardt and Ilya Goldin. 2015. Move your lamp post: recent data reflects learner knowledge better than older data.Journal of Educational Data Mining7, 2 (2015), 83–108

work page 2015
[21]

Carvalho

Gillian Gold, Conrad Borchers, and Paulo F. Carvalho. 2024. Further evidence for regularity in student learning rates across demographic, academic proficiency, and motivational groups. InCompanion Proceedings of the 14th International Conference on Learning Analytics & Knowledge (LAK24)

work page 2024
[22]

Cyril Goutte, Guillaume Durand, and Serge Léger. 2018. On the learning curve attrition bias in additive factor modeling. InProceedings of the International Conference on Artificial Intelligence in Education. Springer, 109–113

work page 2018
[23]

Andrew Heathcote, Scott Brown, and D. J. K. Mewhort. 2000. The power law repealed: the case for an exponential law of practice.Psychonomic Bulletin & Review7, 2 (2000), 185–207

work page 2000
[24]

Koedinger, and Markus Gross

Tanja Käser, Kenneth R. Koedinger, and Markus Gross. 2014. Different parameters—same prediction: an analysis of learning curves. InProceedings of the 7th International Conference on Educational Data Mining. 52–59

work page 2014
[25]

Kizilcec and Hansol Lee

René F. Kizilcec and Hansol Lee. 2022. Algorithmic fairness in education. InThe Ethics of Artificial Intelligence in Education. Routledge, 174–202

work page 2022
[26]

Koedinger, Ryan S

Kenneth R. Koedinger, Ryan S. J. d. Baker, Kyle Cunningham, Alida Skogsholm, Brett Leber, and John Stamper. 2010. A data repository for the EDM community: the PSLC DataShop. InHandbook of Educational Data Mining, Cristóbal Romero, Sebastián Ventura, Mykola Pechenizkiy, and Ryan S. J. d. Baker (Eds.). CRC Press, 43–56

work page 2010
[27]

Koedinger, Emma Brunskill, Ryan S

Kenneth R. Koedinger, Emma Brunskill, Ryan S. J. d. Baker, Elizabeth A. McLaugh- lin, and John Stamper. 2013. New potentials for data-driven intelligent tutoring system development and optimization.AI Magazine34, 3 (2013), 27–41

work page 2013
[28]

Koedinger, Paulo F

Kenneth R. Koedinger, Paulo F. Carvalho, Ran Liu, and Elizabeth A. McLaughlin

work page
[29]

Overcoming catastrophic forgetting in neural networks

An astonishing regularity in student learning rate.Proceedings of the National Academy of Sciences120, 13 (2023), e2221311120. doi:10.1073/pnas. 2221311120

work page doi:10.1073/pnas 2023
[30]

Roderick J. A. Little. 1993. Pattern-mixture models for multivariate incomplete data.J. Amer. Statist. Assoc.88, 421 (1993), 125–134

work page 1993
[31]

Koedinger

Ran Liu and Kenneth R. Koedinger. 2015. Variations in learning rate: student classification based on systematic residual error patterns across practice oppor- tunities.International Educational Data Mining Society(2015)

work page 2015
[32]

Koedinger

Ran Liu and Kenneth R. Koedinger. 2017. Towards reliable and valid measurement of individualized student parameters. InProceedings of the 10th International Conference on Educational Data Mining. 135–142

work page 2017
[33]

Charles Murray, Steven Ritter, Tristan Nixon, Ryan Schwiebert, Robert G

R. Charles Murray, Steven Ritter, Tristan Nixon, Ryan Schwiebert, Robert G. M. Hausmann, Brendon Towle, Stephen E. Fancsali, and Annalies Vuong. 2013. Revealing the learning in learning curves. InInternational Conference on Artificial Intelligence in Education. Springer, 473–482

work page 2013
[34]

Rosenbloom

Allen Newell and Paul S. Rosenbloom. 1981. Mechanisms of skill acquisition and the law of practice. InCognitive Skills and Their Acquisition, John R. Anderson (Ed.). Psychology Press, 1–55

work page 1981
[35]

Pavlik, Hao Cen, and Kenneth R

Philip I. Pavlik, Hao Cen, and Kenneth R. Koedinger. 2009. Performance factors analysis—a new alternative to knowledge tracing. InProceedings of the 14th International Conference on Artificial Intelligence in Education (AIED). IOS Press, 531–538

work page 2009
[36]

Radek Pelánek. 2018. The details matter: methodological nuances in the evalua- tion of student models.User Modeling and User-Adapted Interaction28, 3 (2018), 207–235

work page 2018
[37]

2012.Joint Models for Longitudinal and Time-to-Event Data: With Applications in R

Dimitris Rizopoulos. 2012.Joint Models for Longitudinal and Time-to-Event Data: With Applications in R. CRC Press

work page 2012
[38]

astonishing regularity in student learning rate

Mary Ann Simpson, Kole A. Norberg, and Stephen E. Fancsali. 2024. Replicating an “astonishing regularity in student learning rate”. InProceedings of the 17th International Conference on Educational Data Mining. 420–425

work page 2024
[39]

Tsiatis and Marie Davidian

Anastasios A. Tsiatis and Marie Davidian. 2004. Joint modeling of longitudinal and time-to-event data: an overview.Statistica Sinica14 (2004), 809–834

work page 2004
[40]

Kurt VanLehn. 2006. The behavior of tutoring systems.International Journal of Artificial Intelligence in Education16, 3 (2006), 227–265

work page 2006
[41]

Yi, and Yangxin Huang

Lang Wu, Wei Liu, Grace Y. Yi, and Yangxin Huang. 2012. Analysis of longitudinal and survival data: joint modeling, inference methods, and issues.Journal of Probability and Statistics2012 (2012), 640153

work page 2012
[42]

Wu and Raymond J

Margaret C. Wu and Raymond J. Carroll. 1988. Estimation and comparison of changes in the presence of informative right censoring by modeling the censoring process.Biometrics44, 1 (1988), 175–188

work page 1988

[1] [1]

Vincent Aleven, Elmar Stahl, Silke Schworm, Frank Fischer, and Raven Wallace

work page

[2] [2]

Help seeking and help design in interactive learning environments.Review of Educational Research73, 3 (2003), 277–320

work page 2003

[3] [3]

Anderson

John R. Anderson. 1982. Acquisition of cognitive skill.Psychological Review89, 4 (1982), 369–406

work page 1982

[4] [4]

Ryan S Baker and Aaron Hawn. 2022. Algorithmic bias in education.International Journal of Artificial Intelligence in Education32, 4 (2022), 1052–1092

work page 2022

[5] [5]

Ryan S. J. d. Baker. 2007. Modeling and understanding students’ off-task behavior in intelligent tutoring systems. InProceedings of the ACM CHI Conference on Human Factors in Computing Systems. 1059–1068. Revisiting the Regularity of Student Learning Rate L@S ’26, June 29–July 3, 2026, Seoul, Republic of Korea

work page 2007

[6] [6]

Ryan S. J. d. Baker, Albert T. Corbett, Kenneth R. Koedinger, Shelley Evenson, Ido Roll, Angela Z. Wagner, Meghan Naim, Jay Raspat, Daniel J. Baker, and Joseph E. Beck. 2006. Adapting to when students game an intelligent tutoring system. InProceedings of the International Conference on Intelligent Tutoring Systems. Springer, 392–401

work page 2006

[7] [7]

Douglas Bates, Martin Mächler, Ben Bolker, and Steve Walker. 2015. Fitting linear mixed-effects models using lme4.Journal of Statistical Software67, 1 (2015), 1–48

work page 2015

[8] [8]

Joseph E Beck and Kai-min Chang. 2007. Identifiability: A fundamental problem of student modeling. InInternational Conference on User Modeling. Springer, 137–146

work page 2007

[9] [9]

Beck and Yue Gong

Joseph E. Beck and Yue Gong. 2013. Wheel-spinning: Students who fail to master a skill. InProceedings of the International Conference on Artificial Intelligence in Education. Springer, 431–440

work page 2013

[10] [10]

Block and Robert B

James H. Block and Robert B. Burns. 1976. Mastery learning.Review of Research in Education4 (1976), 3–49

work page 1976

[11] [11]

Hao Cen, Kenneth Koedinger, and Brian Junker. 2006. Learning Factors Analysis: a general method for cognitive model evaluation and improvement. InProceedings of the International Conference on Intelligent Tutoring Systems. Springer, 164–175

work page 2006

[12] [12]

Koedinger, and Brian Junker

Hao Cen, Kenneth R. Koedinger, and Brian Junker. 2007. Is over practice necessary?—Improving learning efficiency with the Cognitive Tutor through educational data mining.Frontiers in Artificial Intelligence and Applications158 (2007), 511

work page 2007

[13] [13]

Koedinger, Geoffrey J

Min Chi, Kenneth R. Koedinger, Geoffrey J. Gordon, Pamela Jordan, and Kurt VanLehn. 2011. Instructional factors analysis: a cognitive model for multiple instructional interventions. InProceedings of the 4th International Conference on Educational Data Mining (EDM). 61–70

work page 2011

[14] [14]

Peter Diggle and Michael G. Kenward. 1994. Informative drop-out in longitudinal data analysis.Journal of the Royal Statistical Society: Series C (Applied Statistics) 43, 1 (1994), 49–73

work page 1994

[15] [15]

Tomáš Effenberger, Radek Pelánek, and Jaroslav Čechák. 2020. Exploration of the robustness and generalizability of the additive factors model. InProceedings of the Tenth International Conference on Learning Analytics & Knowledge. 472–479

work page 2020

[16] [16]

Evans, Scott D

Nathan J. Evans, Scott D. Brown, Douglas J. K. Mewhort, and Andrew Heathcote

work page

[17] [17]

Refining the law of practice.Psychological Review125, 4 (2018), 592–605

work page 2018

[18] [18]

Fitts and Michael I

Paul M. Fitts and Michael I. Posner. 1967.Human Performance. Brooks/Cole, Belmont, CA

work page 1967

[19] [19]

Theodore W. Frick. 1990. A comparison of three decision models for adapting the length of computer-based mastery tests.Journal of Educational Computing Research6, 4 (1990), 479–513

work page 1990

[20] [20]

April Galyardt and Ilya Goldin. 2015. Move your lamp post: recent data reflects learner knowledge better than older data.Journal of Educational Data Mining7, 2 (2015), 83–108

work page 2015

[21] [21]

Carvalho

Gillian Gold, Conrad Borchers, and Paulo F. Carvalho. 2024. Further evidence for regularity in student learning rates across demographic, academic proficiency, and motivational groups. InCompanion Proceedings of the 14th International Conference on Learning Analytics & Knowledge (LAK24)

work page 2024

[22] [22]

Cyril Goutte, Guillaume Durand, and Serge Léger. 2018. On the learning curve attrition bias in additive factor modeling. InProceedings of the International Conference on Artificial Intelligence in Education. Springer, 109–113

work page 2018

[23] [23]

Andrew Heathcote, Scott Brown, and D. J. K. Mewhort. 2000. The power law repealed: the case for an exponential law of practice.Psychonomic Bulletin & Review7, 2 (2000), 185–207

work page 2000

[24] [24]

Koedinger, and Markus Gross

Tanja Käser, Kenneth R. Koedinger, and Markus Gross. 2014. Different parameters—same prediction: an analysis of learning curves. InProceedings of the 7th International Conference on Educational Data Mining. 52–59

work page 2014

[25] [25]

Kizilcec and Hansol Lee

René F. Kizilcec and Hansol Lee. 2022. Algorithmic fairness in education. InThe Ethics of Artificial Intelligence in Education. Routledge, 174–202

work page 2022

[26] [26]

Koedinger, Ryan S

Kenneth R. Koedinger, Ryan S. J. d. Baker, Kyle Cunningham, Alida Skogsholm, Brett Leber, and John Stamper. 2010. A data repository for the EDM community: the PSLC DataShop. InHandbook of Educational Data Mining, Cristóbal Romero, Sebastián Ventura, Mykola Pechenizkiy, and Ryan S. J. d. Baker (Eds.). CRC Press, 43–56

work page 2010

[27] [27]

Koedinger, Emma Brunskill, Ryan S

Kenneth R. Koedinger, Emma Brunskill, Ryan S. J. d. Baker, Elizabeth A. McLaugh- lin, and John Stamper. 2013. New potentials for data-driven intelligent tutoring system development and optimization.AI Magazine34, 3 (2013), 27–41

work page 2013

[28] [28]

Koedinger, Paulo F

Kenneth R. Koedinger, Paulo F. Carvalho, Ran Liu, and Elizabeth A. McLaughlin

work page

[29] [29]

Overcoming catastrophic forgetting in neural networks

An astonishing regularity in student learning rate.Proceedings of the National Academy of Sciences120, 13 (2023), e2221311120. doi:10.1073/pnas. 2221311120

work page doi:10.1073/pnas 2023

[30] [30]

Roderick J. A. Little. 1993. Pattern-mixture models for multivariate incomplete data.J. Amer. Statist. Assoc.88, 421 (1993), 125–134

work page 1993

[31] [31]

Koedinger

Ran Liu and Kenneth R. Koedinger. 2015. Variations in learning rate: student classification based on systematic residual error patterns across practice oppor- tunities.International Educational Data Mining Society(2015)

work page 2015

[32] [32]

Koedinger

Ran Liu and Kenneth R. Koedinger. 2017. Towards reliable and valid measurement of individualized student parameters. InProceedings of the 10th International Conference on Educational Data Mining. 135–142

work page 2017

[33] [33]

Charles Murray, Steven Ritter, Tristan Nixon, Ryan Schwiebert, Robert G

R. Charles Murray, Steven Ritter, Tristan Nixon, Ryan Schwiebert, Robert G. M. Hausmann, Brendon Towle, Stephen E. Fancsali, and Annalies Vuong. 2013. Revealing the learning in learning curves. InInternational Conference on Artificial Intelligence in Education. Springer, 473–482

work page 2013

[34] [34]

Rosenbloom

Allen Newell and Paul S. Rosenbloom. 1981. Mechanisms of skill acquisition and the law of practice. InCognitive Skills and Their Acquisition, John R. Anderson (Ed.). Psychology Press, 1–55

work page 1981

[35] [35]

Pavlik, Hao Cen, and Kenneth R

Philip I. Pavlik, Hao Cen, and Kenneth R. Koedinger. 2009. Performance factors analysis—a new alternative to knowledge tracing. InProceedings of the 14th International Conference on Artificial Intelligence in Education (AIED). IOS Press, 531–538

work page 2009

[36] [36]

Radek Pelánek. 2018. The details matter: methodological nuances in the evalua- tion of student models.User Modeling and User-Adapted Interaction28, 3 (2018), 207–235

work page 2018

[37] [37]

2012.Joint Models for Longitudinal and Time-to-Event Data: With Applications in R

Dimitris Rizopoulos. 2012.Joint Models for Longitudinal and Time-to-Event Data: With Applications in R. CRC Press

work page 2012

[38] [38]

astonishing regularity in student learning rate

Mary Ann Simpson, Kole A. Norberg, and Stephen E. Fancsali. 2024. Replicating an “astonishing regularity in student learning rate”. InProceedings of the 17th International Conference on Educational Data Mining. 420–425

work page 2024

[39] [39]

Tsiatis and Marie Davidian

Anastasios A. Tsiatis and Marie Davidian. 2004. Joint modeling of longitudinal and time-to-event data: an overview.Statistica Sinica14 (2004), 809–834

work page 2004

[40] [40]

Kurt VanLehn. 2006. The behavior of tutoring systems.International Journal of Artificial Intelligence in Education16, 3 (2006), 227–265

work page 2006

[41] [41]

Yi, and Yangxin Huang

Lang Wu, Wei Liu, Grace Y. Yi, and Yangxin Huang. 2012. Analysis of longitudinal and survival data: joint modeling, inference methods, and issues.Journal of Probability and Statistics2012 (2012), 640153

work page 2012

[42] [42]

Wu and Raymond J

Margaret C. Wu and Raymond J. Carroll. 1988. Estimation and comparison of changes in the presence of informative right censoring by modeling the censoring process.Biometrics44, 1 (1988), 175–188

work page 1988