The "Small World of Words" German Free-Association Norms
Pith reviewed 2026-05-10 03:13 UTC · model grok-4.3
The pith
German free-association norms for 5,877 words predict performance in lexical decision tasks, relatedness judgments, and word ratings.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors collected free-association responses for 5,877 German cue words and demonstrated that these norms robustly predict performance in lexical decision tasks, relatedness judgments, and psycholinguistic word ratings. The SWOW-DE dataset compares favorably with existing German resources and shows both shared and language-specific association patterns in preliminary cross-linguistic comparisons.
What carries the argument
The SWOW-DE free-association norms, which record people's spontaneous word associations to German cues and function as data to model semantic structure and forecast behavioral outcomes in cognitive experiments.
If this is right
- SWOW-DE enables more accurate modeling of semantic processing for German speakers.
- The resource supports cross-linguistic studies by allowing comparisons of association patterns across languages.
- Researchers can use the norms to validate or extend other German psycholinguistic datasets.
- Future work on cultural influences in word associations can build directly on this collection.
Where Pith is reading between the lines
- The norms might help improve German-language AI systems by adding association-based semantic knowledge.
- Applying the same validation approach to other languages could standardize cross-cultural semantic research.
- Investigating how these associations change with age or dialect within German speakers would extend the findings.
Load-bearing premise
The online participant sample and chosen cues produce representative German associations, and the preprocessing steps remove noise without altering the underlying associative structure.
What would settle it
New free-association data gathered from an offline, demographically matched German sample that shows substantially weaker correlations with independent lexical decision latencies or relatedness ratings would falsify the predictive robustness.
read the original abstract
Free-association norms provide essential empirical data for investigating linguistic, semantic, and cultural phenomena in the cognitive sciences. Although large-scale norms exist for languages such as English, Dutch, Spanish, and Mandarin Chinese, no comparable resource has been available for German. To address this gap, we present free-association norms for 5,877 German cue words as part of the German version of the multilingual Small World of Words (SWOW) project. We describe the data collection procedures, participant characteristics, and our comprehensive preprocessing pipeline before introducing the resulting SWOW-DE data set. Using data from three established psycholinguistic paradigms, we show that SWOW-DE norms robustly predict performance in lexical decision tasks, relatedness judgments, and psycholinguistic word ratings. Furthermore, we demonstrate that SWOW-DE responses compare favorably with existing German resources and provide a preliminary cross-linguistic comparison revealing both shared and language-specific association patterns, highlighting promising directions for future research. Overall, SWOW-DE represents the largest collection of German free associations to date and offers a unique resource for linguistic, psychological, and cross-cultural research.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces SWOW-DE, a large-scale free-association norm dataset for 5,877 German cue words collected via the multilingual Small World of Words project. It details participant recruitment, data collection procedures, and a preprocessing pipeline, then validates the norms by showing they predict performance in lexical decision tasks, relatedness judgments, and psycholinguistic word ratings from three established paradigms. The norms are shown to compare favorably with existing German resources, with a preliminary cross-linguistic comparison highlighting shared and language-specific patterns.
Significance. If the predictive validations hold after appropriate controls, SWOW-DE would fill a critical gap as the largest German free-association resource, supporting research on semantic networks, lexical processing, and cross-cultural psycholinguistics. The multi-paradigm validation and cross-linguistic angle add utility beyond simple norm collection.
major comments (2)
- [Abstract and validation sections] Abstract and validation sections: the claim that SWOW-DE norms 'robustly predict' performance in lexical decision tasks, relatedness judgments, and word ratings does not address controls for lexical confounds. Lexical decision RTs are dominated by frequency, length, and neighborhood density; without reporting incremental validity (e.g., regressions adding association measures after these covariates) or partial correlations, the unique contribution of the free-association structure cannot be established and the 'robust' prediction claim is undermined.
- [Methods and Results] Methods and Results: no sample sizes, exclusion criteria, statistical details, or error bars are provided for the participant sample or the three validation analyses. This absence prevents assessment of whether the online sample and cue selection yield representative norms or whether the reported predictions are statistically reliable.
minor comments (2)
- [Abstract] Abstract: the phrase 'comprehensive preprocessing pipeline' is used without listing key steps (e.g., response cleaning, cue selection criteria); a one-sentence summary would improve readability.
- [Cross-linguistic comparison] Cross-linguistic comparison: the preliminary analysis is mentioned but lacks quantitative metrics or example pairs of shared vs. language-specific associations; a small table would clarify the patterns.
Simulated Author's Rebuttal
We thank the referee for their constructive and detailed feedback, which has helped us identify areas where the manuscript can be strengthened. We address each major comment below and describe the revisions we will implement.
read point-by-point responses
-
Referee: [Abstract and validation sections] Abstract and validation sections: the claim that SWOW-DE norms 'robustly predict' performance in lexical decision tasks, relatedness judgments, and word ratings does not address controls for lexical confounds. Lexical decision RTs are dominated by frequency, length, and neighborhood density; without reporting incremental validity (e.g., regressions adding association measures after these covariates) or partial correlations, the unique contribution of the free-association structure cannot be established and the 'robust' prediction claim is undermined.
Authors: We agree that the current validation analyses do not sufficiently control for lexical confounds such as word frequency, length, and neighborhood density, which limits the strength of our 'robustly predict' claim. In the revised manuscript, we will add multiple regression analyses for each validation paradigm that first enter the standard lexical covariates and then assess the incremental contribution of the SWOW-DE association measures (e.g., via change in R² and partial correlations). We will update the abstract, results, and discussion sections to report these new findings and qualify the prediction claims accordingly. This revision directly addresses the concern and will provide clearer evidence of the unique value of the free-association norms. revision: yes
-
Referee: [Methods and Results] Methods and Results: no sample sizes, exclusion criteria, statistical details, or error bars are provided for the participant sample or the three validation analyses. This absence prevents assessment of whether the online sample and cue selection yield representative norms or whether the reported predictions are statistically reliable.
Authors: We acknowledge that the omission of these details reduces transparency and makes it difficult to evaluate the reliability of the norms and validations. In the revised manuscript, we will expand the Methods section to report the full participant sample size, recruitment details, demographic characteristics, and all exclusion criteria applied. For each of the three validation analyses, we will add the relevant sample sizes (words and observations), exact statistical procedures, test statistics, p-values, effect sizes, and error bars on all figures. These additions will be presented in both text and tables to allow readers to assess statistical reliability and representativeness. revision: yes
Circularity Check
No significant circularity: empirical data collection with external validation
full rationale
The paper collects new free-association norms from participants, applies preprocessing, and validates the resulting dataset by correlating it with performance measures from three independent psycholinguistic paradigms (lexical decision tasks, relatedness judgments, word ratings). These validation targets are drawn from separate established resources and are not derived from or fitted to the SWOW-DE data itself. No equations, parameter fitting, self-definitional loops, or load-bearing self-citations appear in the derivation chain. The central claim reduces to an empirical correlation between independently collected datasets, which is self-contained and non-circular by the stated criteria.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Online participants provide honest and representative free associations for German cue words.
- domain assumption Standard preprocessing steps preserve the semantic signal without introducing systematic bias.
Reference graph
Works this paper leans on
-
[1]
https://doi.org/10.1037/a0038693 Abramski, K., Improta, R., Rossetti, G., & Stella, M. (2025). The “LLM World of Words” English free associ- ation norms generated by large language models. Scientific Data,12(1), Article 803. https://doi.org/ 10.1038/s41597-025-05156-9 Aeschbach, S., Mata, R., & Wulff, D. U. (2025a). Mapping mental representations with fre...
-
[2]
https://doi.org/10.3758/s13428-018-1115-7 De Deyne, S., Perfors, A., & Navarro, D. J. (2017). Predict- ing human similarity judgments with distributional models: The value of word associations.Proceed- ings of the Twenty-Sixth International Joint Confer- ence on Artificial Intelligence, 4806–4810. https:// doi.org/10.24963/ijcai.2017/671 Dubossarsky, H., ...
-
[3]
(2018, March).Learning word vectors for 157 languages
https://doi.org/10.3758/BF03211553 16AESCHBACH Grave, E., Bojanowski, P., Gupta, P., Joulin, A., & Mikolov, T. (2018, March).Learning word vectors for 157 languages. arXiv. https://doi.org/10.48550/arXiv. 1802.06893 Gurevych, I. (2005). Using the structure of a concep- tual network in computing semantic relatedness. In R. Dale, K.-F. Wong, J. Su, & O. Y ....
-
[4]
Computer Graphics Forum 31, 2pt2 (2012), 519–528
https://doi.org/10.1111/j.1467- 9280.2009. 02365.x hunspell. (2022).Hunspell. (Version 1.7.2) [Source code]. Github. https://github.com/hunspell/hunspell Hussain, Z., Binz, M., Mata, R., & Wulff, D. U. (2024). A tutorial on open-source large language models for behavioral science.Behavior Research Methods, 56(8), 8214–8237. https://doi.org/10.3758/s13428-...
-
[5]
https://doi.org/10.1037/0033-2909.87.2.245 Steyvers, M., & Tenenbaum, J. B. (2005). The large-scale structure of semantic networks: Statistical analy- ses and a model of semantic growth.Cognitive Sci- ence,29(1), 41–78. https :/ /doi . org/10 . 1207/ s15516709cog2901_3 The Stanford Natural Language Processing Group. (n.d.). Stanford log-linear part-of-spe...
-
[6]
Wenn eine Response a) richtig geschrieben ist auf Deutsch, b) richtig geschrieben ist in einer anderen Sprache und kein typisches, deutsches falsch geschriebenes Wort ist, oder c) ein korrekt geschriebener Eigenname ist: Die originale Response verwenden. Beispiele: Cue: Mensch; Response: Homo sapiens; Korrigiert: Homo sapiens Cue: Theorie; Response: The B...
-
[7]
Wenn eine Response falsch geschrieben ist und (in Betracht des Cues und der anderen Assoziationen) die richtige Schreibweise zugerordnet werden kann: Die korrekt geschriebene Response verwenden. Beispiele: Cue: Argument; Response: Diskussoin; Korrigiert: Diskussion Cue: Marketing; Response: Werbungh; Korrigiert: Werbung
-
[8]
Wenn eine Response Wortkonstrukte enthält die auf mehrere Responses in einem Antwortfeld hinweisen: Die erste Response verwenden. Beispiele: Cue: hören; Response: sehen, fühlen, riechen; Korrigiert: sehen Cue: Schleifpapier; Response: feinkörnig/grobkörnig; Korrigiert: feinkörnig
-
[9]
Beispiele: Cue: lokal; Response: Lokal/Bar; Korrigiert: Bar
Wenn eine Assoziation Wortkonstrukte enthält, die den Cue wiederholen und eine zusätzliche, eigenständige Komponente enthalten: Die korrekt geschriebene eigenständige Komponente verwenden. Beispiele: Cue: lokal; Response: Lokal/Bar; Korrigiert: Bar
-
[10]
Wenn eine Response ein unvollständiges Wort ist aber in Kombination mit dem Cue Sinn macht: Die sinnvolle Kombination verwenden. Beispiele: Cue: Gebiet; Response: Hohheits; Korrigiert: Hoheitsgebiet Cue: notwendig; Response: keit; Korrigiert: Notwendigkeit English translation of the system prompt: You help me correct potentially misspelled responses in a ...
-
[11]
If a response a) is correctly spelled in German, b) is correctly spelled in another language and is not a typical, misspelled German word, or c) is a correctly spelled proper name: Use the original response. Examples: Cue: Mensch; Response: Homo sapiens; Corrected: Homo sapiens Cue: Theorie; Response: The Big Bang Theory; Corrected: The Big Bang Theory
-
[12]
If a response is misspelled and (considering the cue and the other associations) the correct spelling can be assigned: Use the correctly spelled response. SWOW-DE21 Examples: Cue: Argument; Response: Diskussoin; Corrected: Diskussion Cue: Marketing; Response: Werbungh; Corrected: Werbung
-
[13]
If a response contains word constructs that indicate multiple responses in one answer field: Use the first response. Examples: Cue: hören; Response: sehen, fühlen, riechen; Corrected: sehen Cue: Schleifpapier; Response: feinkörnig/grobkörnig; Corrected: feinkörnig
-
[14]
Examples: Cue: lokal; Response: Lokal/Bar; Corrected: Bar
If an association contains word constructs that repeat the cue and contain an additional, independent component: Use the correctly spelled independent component. Examples: Cue: lokal; Response: Lokal/Bar; Corrected: Bar
-
[15]
If a response is an incomplete word but makes sense in combination with the cue: Use the meaningful combination. Examples: Cue: Gebiet; Response: Hohheits; Corrected: Hoheitsgebiet Cue: notwendig; Response: keit; Corrected: Notwendigkeit User Prompt For each incorrectly spelled response, the following prompt template was used to obtain a correction. Respo...
work page 2020
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.