and Bergen, Benjamin K

Chang, Tyler A · 2024 · DOI 10.1162/coli_a_00492

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

Child-directed speech facilitates production, not comprehension, in BabyLMs

cs.CL · 2026-05-31 · unverdicted · novelty 6.0

CDS-trained BabyLMs show earlier and more appropriate production in a new frame-completion task while FineWeb-edu models lead on comprehension benchmarks, indicating current tests underestimate CDS benefits.

How Human-Like Are Large Language Models? A Register-Aware Linguistic Evaluation Framework

cs.CL · 2026-05-22 · unverdicted · novelty 6.0

A new evaluation framework using MMD on Biber features shows LLMs deviate from human linguistic distributions across registers, with closest models varying by register rather than size.

Daily and Weekly Periodicity in Large Language Model Performance and Its Implications for Research

stat.AP · 2026-02-06 · unverdicted · novelty 5.0

GPT-4o exhibits daily and weekly periodic fluctuations in performance on a fixed physics task, accounting for about 20% of observed variance.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Child-directed speech facilitates production, not comprehension, in BabyLMs cs.CL · 2026-05-31 · unverdicted · none · ref 159
CDS-trained BabyLMs show earlier and more appropriate production in a new frame-completion task while FineWeb-edu models lead on comprehension benchmarks, indicating current tests underestimate CDS benefits.
How Human-Like Are Large Language Models? A Register-Aware Linguistic Evaluation Framework cs.CL · 2026-05-22 · unverdicted · none · ref 107
A new evaluation framework using MMD on Biber features shows LLMs deviate from human linguistic distributions across registers, with closest models varying by register rather than size.

and Bergen, Benjamin K

fields

years

verdicts

representative citing papers

citing papers explorer