Nefnir: A high accuracy lemmatizer for Icelandic
Pith reviewed 2026-05-24 14:54 UTC · model grok-4.3
The pith
Nefnir lemmatizes Icelandic text via suffix substitution rules from a morphological database, reaching 99.55% accuracy on correctly tagged input.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Nefnir uses suffix substitution rules, derived from a large morphological database, to lemmatize tagged text. Evaluation shows that for correctly tagged text, Nefnir obtains an accuracy of 99.55%, and for text tagged with a PoS tagger, the accuracy obtained is 96.88%.
What carries the argument
Suffix substitution rules derived from a morphological database that replace word endings to produce lemmas based on observed patterns.
Load-bearing premise
The morphological database is comprehensive enough that the suffix substitution rules derived from it will generalize accurately to new Icelandic text outside the database itself.
What would settle it
Running Nefnir on a new Icelandic corpus or set of words absent from the morphological database and measuring whether accuracy falls below 90%.
read the original abstract
Lemmatization, finding the basic morphological form of a word in a corpus, is an important step in many natural language processing tasks when working with morphologically rich languages. We describe and evaluate Nefnir, a new open source lemmatizer for Icelandic. Nefnir uses suffix substitution rules, derived from a large morphological database, to lemmatize tagged text. Evaluation shows that for correctly tagged text, Nefnir obtains an accuracy of 99.55%, and for text tagged with a PoS tagger, the accuracy obtained is 96.88%.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper presents Nefnir, an open-source lemmatizer for Icelandic that derives suffix substitution rules from a large morphological database and applies them to lemmatize tagged text. It reports accuracies of 99.55% when input tags are correct and 96.88% when input comes from an automatic PoS tagger.
Significance. If the reported accuracies reflect genuine generalization to word forms outside the source database, the work would provide a practical, high-accuracy tool for a morphologically rich language together with reproducible open-source code and a simple rule-based method. The empirical framing against an external database is a strength.
major comments (2)
- [Evaluation] Evaluation section: the manuscript provides no information on test-set size, the proportion of word forms absent from the morphological database, or any error analysis. Without evidence that a non-trivial fraction of the test material consists of forms unseen in the database, the accuracies of 99.55% and 96.88% cannot be interpreted as measuring generalization via the derived suffix rules rather than database coverage.
- [Method] Method section: the description of how suffix-substitution rules are extracted, filtered, and selected from the morphological database is insufficiently detailed to allow reproduction or assessment of whether the rule set is overfit to the database.
Simulated Author's Rebuttal
We thank the referee for the constructive comments. We address each major point below and indicate the revisions we will make.
read point-by-point responses
-
Referee: [Evaluation] Evaluation section: the manuscript provides no information on test-set size, the proportion of word forms absent from the morphological database, or any error analysis. Without evidence that a non-trivial fraction of the test material consists of forms unseen in the database, the accuracies of 99.55% and 96.88% cannot be interpreted as measuring generalization via the derived suffix rules rather than database coverage.
Authors: We agree that these details are necessary for proper interpretation. In the revised manuscript we will report the exact size of the test set, the proportion of word forms absent from the morphological database, and include a short error analysis. This will allow readers to assess the degree of generalization achieved by the suffix rules. revision: yes
-
Referee: [Method] Method section: the description of how suffix-substitution rules are extracted, filtered, and selected from the morphological database is insufficiently detailed to allow reproduction or assessment of whether the rule set is overfit to the database.
Authors: We acknowledge that the current description is not detailed enough for full reproducibility. We will expand the method section with a step-by-step account of rule extraction, the filtering criteria applied, and the selection procedure used, including any parameters or thresholds. This will also help evaluate potential overfitting. revision: yes
Circularity Check
No significant circularity detected
full rationale
The paper describes an empirical system that extracts suffix substitution rules from an external morphological database and measures accuracy on held-out tagged text (99.55% gold tags, 96.88% automatic tags). No equations, fitted parameters, or self-citations reduce the reported accuracies to the training inputs by construction; the evaluation is a direct comparison against an independent database on unseen material. The derivation chain is therefore self-contained and non-circular.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Suffix substitution rules derived from a morphological database can accurately map inflected forms to lemmas for Icelandic.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.