DELICATE: Diachronic Entity LInking using Classes And Temporal Evidence

Cristian Santini; Emanuele Frontoni; Mehwish Alam; Paolo Sernani; Sebastian Barzaghi

arxiv: 2511.10404 · v2 · pith:H2TQDJVCnew · submitted 2025-11-13 · 💻 cs.CL

DELICATE: Diachronic Entity LInking using Classes And Temporal Evidence

Cristian Santini , Sebastian Barzaghi , Paolo Sernani , Emanuele Frontoni , Mehwish Alam This is my paper

Pith reviewed 2026-05-25 08:03 UTC · model grok-4.3

classification 💻 cs.CL

keywords entity linkinghistorical textsItalianneuro-symbolictemporal evidenceentity typesWikidatainterpretability

0 comments

The pith

DELICATE uses temporal and type information from Wikidata to link entities in historical Italian more accurately than larger models.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper introduces DELICATE, a neuro-symbolic approach to entity linking that augments a BERT encoder with Wikidata-derived checks for temporal plausibility and entity type consistency. The goal is to handle the difficulties of long-tail entities and domain-specific texts in historical humanities better than standard neural methods. The authors also create the ENEIDE corpus from 19th and 20th century Italian literary and political texts. Evaluation shows DELICATE surpassing other models, including those with billions of parameters, while offering greater explainability through its features.

Core claim

The central discovery is that filtering candidate entities using temporal evidence and class consistency from Wikidata, in combination with contextual embeddings, allows for more accurate and interpretable entity linking in diachronic Italian texts than purely neural alternatives.

What carries the argument

Neuro-symbolic selection mechanism that applies temporal plausibility and entity type consistency constraints from Wikidata to candidates generated by a BERT-based encoder.

If this is right

DELICATE achieves higher performance than competing EL models on historical Italian data.
The system yields more explainable results via analysis of confidence scores and feature sensitivity.
It handles long-tail entities effectively in texts from the 19th to 20th centuries.
The ENEIDE corpus provides a new resource for training and evaluating EL models in this domain.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

This filtering strategy might generalize to entity linking tasks in other languages with structured knowledge bases containing temporal data.
Combining symbolic constraints with neural encoders could address similar challenges in other specialized domains like legal or scientific historical documents.
Further work could test whether updating Wikidata with more historical details would enhance the method's coverage for rare entities.

Load-bearing premise

Wikidata supplies sufficiently accurate and complete temporal and type information to correctly filter candidate entities for long-tail historical mentions in Italian texts.

What would settle it

Running DELICATE on a collection of historical Italian texts for which independent verification shows Wikidata's temporal or type data to be inaccurate or missing for many entities, and observing that performance drops below that of baseline models.

read the original abstract

In spite of the remarkable advancements in the field of Natural Language Processing, the task of Entity Linking (EL) remains challenging in the field of humanities due to complex document typologies, lack of domain-specific datasets and models, and long-tail entities, i.e., entities under-represented in Knowledge Bases (KBs). The goal of this paper is to address these issues with two main contributions. The first contribution is DELICATE, a novel neuro-symbolic method for EL on historical Italian which combines a BERT-based encoder with contextual information from Wikidata to select appropriate KB entities using temporal plausibility and entity type consistency. The second contribution is ENEIDE, a multi-domain EL corpus in historical Italian semi-automatically extracted from two annotated editions spanning from the 19th to the 20th century and including literary and political texts. Results show how DELICATE outperforms other EL models in historical Italian even if compared with larger architectures with billions of parameters. Moreover, further analyses reveal how DELICATE confidence scores and features sensitivity provide results which are more explainable and interpretable than purely neural methods.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Abstract claims DELICATE beats large models on historical Italian EL but supplies zero numbers, baselines, or stats to check the claim.

read the letter

The paper's main contributions are DELICATE, a neuro-symbolic entity linker that runs a BERT encoder then filters candidates with Wikidata temporal plausibility and type consistency, plus the ENEIDE corpus of 19th-20th century Italian literary and political texts. Both are presented as new. That combination targets a documented gap in non-English historical entity linking, where long-tail mentions and domain shift are common problems. The corpus itself could be useful to digital humanities groups working on Italian material if the annotations hold up under inspection. The abstract also notes that the method produces more interpretable scores than pure neural baselines, which is a reasonable secondary goal. The central problem is that none of the performance claims are backed by data in the abstract. It states outperformance over other models, including billion-parameter ones, yet gives no accuracy figures, no baseline descriptions, no dataset sizes, and no error analysis. Without those, the headline result cannot be evaluated. The stress-test concern about Wikidata coverage for long-tail historical Italian entities is also live: if many gold entities lack dates or classes, the filtering step either drops correct answers or keeps bad ones, and the neuro-symbolic edge reduces to whatever the BERT component already provides. The paper appears aimed at researchers in historical NLP or digital humanities who need Italian-specific resources. It does not look ready for peer review in its current form because the key empirical claims rest on information that is simply not shown. If the full manuscript contains the missing tables and analysis, it could be worth sending out; otherwise it needs substantial revision first.

Referee Report

2 major / 1 minor

Summary. The paper introduces DELICATE, a neuro-symbolic entity linking approach for historical Italian that augments a BERT encoder with Wikidata-derived temporal plausibility and entity-type consistency filters, and presents the ENEIDE corpus extracted from 19th–20th century literary and political texts. It claims that DELICATE outperforms existing EL models—including architectures with billions of parameters—and yields more interpretable results via confidence scores and feature sensitivity.

Significance. If the performance and interpretability claims are substantiated with rigorous evaluation, the work would advance diachronic EL for humanities texts by addressing long-tail entities through lightweight symbolic constraints rather than scale alone; the ENEIDE resource could also support further research in low-resource historical domains.

major comments (2)

[Abstract, §4] Abstract and §4 (results): the central claim that DELICATE outperforms larger models rests on quantitative evidence that is not supplied in the abstract and whose details (metrics, baselines, dataset statistics, error analysis) must be verified in the full results section; without these, the headline result cannot be evaluated.
[§3] §3 (method): the neuro-symbolic advantage is predicated on Wikidata supplying accurate and high-coverage temporal dates and type information for long-tail historical Italian entities in ENEIDE; no coverage statistics, ablation on filter accuracy, or failure-case analysis for sparse Wikidata records is provided, which directly bears on whether the filtering step improves or degrades the underlying BERT encoder.

minor comments (1)

[Abstract] The abstract states that further analyses reveal explainability advantages but does not specify the exact features or sensitivity metrics used.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments, which help clarify how to strengthen the presentation of our results and the justification for the neuro-symbolic components. We address each major comment below.

read point-by-point responses

Referee: [Abstract, §4] Abstract and §4 (results): the central claim that DELICATE outperforms larger models rests on quantitative evidence that is not supplied in the abstract and whose details (metrics, baselines, dataset statistics, error analysis) must be verified in the full results section; without these, the headline result cannot be evaluated.

Authors: The full manuscript already supplies the requested details in Section 4, including precision/recall/F1 scores, comparisons against baselines (including models with billions of parameters), ENEIDE dataset statistics, and error analysis. To make the central claim immediately verifiable from the abstract, we will revise the abstract to include a concise summary of the key quantitative results. revision: yes
Referee: [§3] §3 (method): the neuro-symbolic advantage is predicated on Wikidata supplying accurate and high-coverage temporal dates and type information for long-tail historical Italian entities in ENEIDE; no coverage statistics, ablation on filter accuracy, or failure-case analysis for sparse Wikidata records is provided, which directly bears on whether the filtering step improves or degrades the underlying BERT encoder.

Authors: We agree that explicit coverage statistics and targeted ablations would strengthen the justification for the symbolic filters. In the revised version we will add (i) Wikidata coverage statistics for the entities appearing in ENEIDE, (ii) an ablation that isolates the contribution of the temporal and type-consistency filters, and (iii) a short failure-case analysis highlighting instances where sparse Wikidata records limit the filters. These additions will clarify when the neuro-symbolic step improves versus degrades the BERT encoder. revision: yes

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper presents DELICATE as a neuro-symbolic EL method that augments a BERT encoder with external Wikidata temporal plausibility and entity-type consistency checks. No equations, parameter-fitting steps, or predictions are described. No self-citations appear in the provided text, and the performance claims rest on empirical comparison against other models on the independently constructed ENEIDE corpus rather than any reduction of outputs to inputs by construction. The approach is therefore self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the domain assumption that Wikidata temporal and type data are reliable for historical Italian entities; no free parameters or invented entities are mentioned in the abstract.

axioms (1)

domain assumption Wikidata contains accurate temporal information and entity types usable for filtering candidate links in historical texts
Invoked to select appropriate KB entities using temporal plausibility and entity type consistency.

pith-pipeline@v0.9.0 · 5736 in / 1139 out tokens · 31560 ms · 2026-05-25T08:03:09.503673+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

31 extracted references · 31 canonical work pages · 1 internal anchor

[1]

J Comput Cult Herit https://doi.org/10.1145/3725534, URL https: //doi.org/10.1145/3725534

Barzaghi S, Palmero Aprosio A, Paolucci F, et al (2025) The semantic digi- tal edition of aldo moro’s writings: A workflow supporting data sharing and replicability. J Comput Cult Herit https://doi.org/10.1145/3725534, URL https: //doi.org/10.1145/3725534

work page doi:10.1145/3725534 2025
[2]

In: The Semantic Web–ISWC 2019: 18th Interna- tional Semantic Web Conference, Auckland, New Zealand, October 26–30, 2019, Proceedings, Part II 18, Springer, pp 36–52

Carriero VA, Gangemi A, Mancinelli ML, et al (2019) Arco: The italian cultural heritage knowledge graph. In: The Semantic Web–ISWC 2019: 18th Interna- tional Semantic Web Conference, Auckland, New Zealand, October 26–30, 2019, Proceedings, Part II 18, Springer, pp 36–52

work page 2019
[3]

Journal of Open Humanities Data 3:2–2

Colavizza G, Romanello M (2017) Annotated References in the Historiography on Venice: 19th–21st centuries. Journal of Open Humanities Data 3:2–2. https:// 23 doi.org/10.5334/johd.9, URL https://account.openhumanitiesdata.metajnl.com/ index.php/up-j-johd/article/view/johd.9

work page doi:10.5334/johd.9 2017
[4]

International Journal of Information Science and Technology 9(1):42–53

Cristofaro S, Del Grosso AM, Mazzagufo L, et al (2025) Implementing collab- orative digital scholarly editions: Insights from bellini digital correspondence. International Journal of Information Science and Technology 9(1):42–53

work page 2025
[5]

Transactions of the Association for Computational Linguistics 10:274–290

De Cao N, Wu L, Popat K, et al (2022) Multilingual autoregressive entity linking. Transactions of the Association for Computational Linguistics 10:274–290

work page 2022
[6]

Devlin J, Chang MW, Lee K, et al (2019) BERT: Pre-training of Deep Bidirec- tional Transformers for Language Understanding. In: Burstein J, Doran C, Solorio T (eds) Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association fo...

work page doi:10.18653/v1/n19-1423 2019
[7]

In: International Conference of the Cross-Language Evaluation Forum for European Languages, Springer, pp 423–446

Ehrmann M, Romanello M, Najem-Meyer S, et al (2022) Overview of hipe-2022: named entity recognition and linking in multilingual historical documents. In: International Conference of the Cross-Language Evaluation Forum for European Languages, Springer, pp 423–446

work page 2022
[8]

ACM Comput Surv 56(2)

Ehrmann M, Hamdi A, Pontes EL, et al (2023) Named entity recognition and classification in historical documents: A survey. ACM Comput Surv 56(2). https: //doi.org/10.1145/3604931, URL https://doi.org/10.1145/3604931

work page doi:10.1145/3604931 2023
[9]

Artificial Intelligence Review 58(5):140

Graciotti A, Lazzari N, Presutti V, et al (2025) Musical heritage historical entity linking. Artificial Intelligence Review 58(5):140

work page 2025
[10]

In: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, Vienna, Austria

Graciotti A, Piano L, Lazzari N, et al (2025) Ke-mhisto: Towards a multilingual historical knowledge extraction benchmark for addressing the long-tail problem. In: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, Vienna, Austria

work page 2025
[11]

The Llama 3 Herd of Models

Grattafiori A, et al (2024) The llama 3 herd of models. URL https://arxiv.org/ abs/2407.21783, arXiv:2407.21783

work page internal anchor Pith review Pith/arXiv arXiv 2024
[12]

IEEE Transactions on Big Data 7(3):535–547

Johnson J, Douze M, J´ egou H (2019) Billion-scale similarity search with gpus. IEEE Transactions on Big Data 7(3):535–547

work page 2019
[13]

In: Korhonen A, Titov I (eds) Proceedings of the 22nd Conference on Computational Natural Language Learning

Kolitsas N, Ganea OE, Hofmann T (2018) End-to-end neural entity linking. In: Korhonen A, Titov I (eds) Proceedings of the 22nd Conference on Computational Natural Language Learning. Association for Computational Linguistics, Brus- sels, Belgium, pp 519–529, https://doi.org/10.18653/v1/K18-1050, URL https: //aclanthology.org/K18-1050 24

work page doi:10.18653/v1/k18-1050 2018
[14]

In: CLEF (Working Notes), p 33, URL https://ceur-ws.org/Vol-2696/paper 163.pdf?ref=https://githubhelp.com

Labusch K, Neudecker C (2020) Named Entity Disambiguation and Linking Historic Newspaper OCR with BERT. In: CLEF (Working Notes), p 33, URL https://ceur-ws.org/Vol-2696/paper 163.pdf?ref=https://githubhelp.com

work page 2020
[15]

In: Bouamor H, Pino J, Bali K (eds) Findings of the Association for Computational Lin- guistics: EMNLP 2023

Limkonchotiwat P, Cheng W, Christodoulopoulos C, et al (2023) mReFinED: An efficient end-to-end multilingual entity linking system. In: Bouamor H, Pino J, Bali K (eds) Findings of the Association for Computational Lin- guistics: EMNLP 2023. Association for Computational Linguistics, Singapore, pp 15080–15089, https://doi.org/10.18653/v1/2023.findings-emnl...

work page doi:10.18653/v1/2023.findings-emnlp.1007 2023
[16]

International journal on digital libraries pp 1–28

Linhares Pontes E, Cabrera-Diego LA, Moreno JG, et al (2022) Melhissa: a multilingual entity linking architecture for historical press articles. International journal on digital libraries pp 1–28

work page 2022
[17]

In: Calzolari N, B´ echet F, Blache P, et al (eds) Proceedings of the Thirteenth Language Resources and Evaluation Conference

Paccosi T, Palmero Aprosio A (2022) KIND: an Italian multi-domain dataset for named entity recognition. In: Calzolari N, B´ echet F, Blache P, et al (eds) Proceedings of the Thirteenth Language Resources and Evaluation Conference. European Language Resources Association, Marseille, France, pp 501–507, URL https://aclanthology.org/2022.lrec-1.52

work page 2022
[18]

In: International Conference of the Italian Association for Artificial Intelligence, Springer, pp 187–201

Pozzi R, Rubini R, Bernasconi C, et al (2023) Named entity recognition and linking for entity extraction from italian civil judgements. In: International Conference of the Italian Association for Artificial Intelligence, Springer, pp 187–201

work page 2023
[19]

Journal of Open Humanities Data 10(1)

Romanello M, Najem-Meyer S (2024) A Named Entity-Annotated Cor- pus of 19th Century Classical Commentaries. Journal of Open Humanities Data 10(1). https://doi.org/10.5334/johd.150, URL https://openhumanitiesdata. metajnl.com/articles/10.5334/johd.150

work page doi:10.5334/johd.150 2024
[20]

URL https://m.m88hub.co/dlib/bitstream/ HUST/23724/1/OER000002860.pdf#page=38, publisher: Open Book Publishers Cambridge

Sahle P (2016) What is a scholarly digital edition? Digital scholarly edit- ing: Theories and practices 1:19–39. URL https://m.m88hub.co/dlib/bitstream/ HUST/23724/1/OER000002860.pdf#page=38, publisher: Open Book Publishers Cambridge

work page 2016
[21]

Frontiers in Computer Science 6:1472512

Santini C (2024) Combining language models for knowledge extraction from italian tei editions. Frontiers in Computer Science 6:1472512

work page 2024
[22]

CEUR Workshop Proceedings 3234

Santini C, Tan MA, Bruns O, et al (2022) Knowledge Extraction for Art His- tory: the Case of Vasari’s The Lives of The Artists (1568). CEUR Workshop Proceedings 3234. URL https://ceur-ws.org/Vol-3234/paper7.pdf

work page 2022
[23]

In: CEUR Workshop Proceedings, p 11, URL https://ceur-ws.org/Vol-3967/X-TAIL-2024 paper 1.pdf 25

Santini C, Melosi L, Frontoni E (2024) Named Entity Recognition in Histor- ical Italian: The Case of Giacomo Leopardi’s Zibaldone. In: CEUR Workshop Proceedings, p 11, URL https://ceur-ws.org/Vol-3967/X-TAIL-2024 paper 1.pdf 25

work page 2024
[24]

Semantic Web 13(3):527–570

Sevgili ¨O, Shelmanov A, Arkhipov M, et al (2022) Neural entity linking: A survey of models based on deep learning. Semantic Web 13(3):527–570

work page 2022
[25]

IJCoL Italian Journal of Computational Linguistics 2(2-2):89–99

Sprugnoli R, Tonelli S, Moretti G, et al (2016) Fifty years of european history through the lens of computational linguistics: the de gasperi project. IJCoL Italian Journal of Computational Linguistics 2(2-2):89–99

work page 2016
[26]

magaz´ en 4(3):13

Stoyanova S (2023) Working with the Digital Edition of Giacomo Leopardi’s Zibaldone. magaz´ en 4(3):13

work page 2023
[27]

In: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp 131–140

Suchanek FM, Alam M, Bonald T, et al (2024) Yago 4.5: A large and clean knowledge base with a rich taxonomy. In: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp 131–140

work page 2024
[28]

Communications of the ACM 57(10):78–85

Vrandeˇ ci´ c D, Kr¨ otzsch M (2014) Wikidata: a free collaborative knowledgebase. Communications of the ACM 57(10):78–85

work page 2014
[29]

In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp 6397–6407

Wu L, Petroni F, Josifoski M, et al (2020) Scalable zero-shot entity linking with dense entity retrieval. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp 6397–6407

work page 2020
[30]

Yasodhara A, Asgarian A, Huang D, et al (2021) On the trustworthiness of tree en- semble explainability methods. In: Machine Learning and Knowledge Extraction: 5th IFIP TC 5, TC 12, WG 8.4, WG 8.9, WG 12.9 International Cross-Domain Conference, CD-MAKE 2021, Virtual Event, August 17–20, 2021, Proceedings 5, Springer, pp 293–308

work page 2021
[31]

arXiv preprint arXiv:231108526 26

Zaratiana U, Tomeh N, Holat P, et al (2023) Gliner: Generalist model for named entity recognition using bidirectional transformer. arXiv preprint arXiv:231108526 26

work page 2023

[1] [1]

J Comput Cult Herit https://doi.org/10.1145/3725534, URL https: //doi.org/10.1145/3725534

Barzaghi S, Palmero Aprosio A, Paolucci F, et al (2025) The semantic digi- tal edition of aldo moro’s writings: A workflow supporting data sharing and replicability. J Comput Cult Herit https://doi.org/10.1145/3725534, URL https: //doi.org/10.1145/3725534

work page doi:10.1145/3725534 2025

[2] [2]

In: The Semantic Web–ISWC 2019: 18th Interna- tional Semantic Web Conference, Auckland, New Zealand, October 26–30, 2019, Proceedings, Part II 18, Springer, pp 36–52

Carriero VA, Gangemi A, Mancinelli ML, et al (2019) Arco: The italian cultural heritage knowledge graph. In: The Semantic Web–ISWC 2019: 18th Interna- tional Semantic Web Conference, Auckland, New Zealand, October 26–30, 2019, Proceedings, Part II 18, Springer, pp 36–52

work page 2019

[3] [3]

Journal of Open Humanities Data 3:2–2

Colavizza G, Romanello M (2017) Annotated References in the Historiography on Venice: 19th–21st centuries. Journal of Open Humanities Data 3:2–2. https:// 23 doi.org/10.5334/johd.9, URL https://account.openhumanitiesdata.metajnl.com/ index.php/up-j-johd/article/view/johd.9

work page doi:10.5334/johd.9 2017

[4] [4]

International Journal of Information Science and Technology 9(1):42–53

Cristofaro S, Del Grosso AM, Mazzagufo L, et al (2025) Implementing collab- orative digital scholarly editions: Insights from bellini digital correspondence. International Journal of Information Science and Technology 9(1):42–53

work page 2025

[5] [5]

Transactions of the Association for Computational Linguistics 10:274–290

De Cao N, Wu L, Popat K, et al (2022) Multilingual autoregressive entity linking. Transactions of the Association for Computational Linguistics 10:274–290

work page 2022

[6] [6]

Devlin J, Chang MW, Lee K, et al (2019) BERT: Pre-training of Deep Bidirec- tional Transformers for Language Understanding. In: Burstein J, Doran C, Solorio T (eds) Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). Association fo...

work page doi:10.18653/v1/n19-1423 2019

[7] [7]

In: International Conference of the Cross-Language Evaluation Forum for European Languages, Springer, pp 423–446

Ehrmann M, Romanello M, Najem-Meyer S, et al (2022) Overview of hipe-2022: named entity recognition and linking in multilingual historical documents. In: International Conference of the Cross-Language Evaluation Forum for European Languages, Springer, pp 423–446

work page 2022

[8] [8]

ACM Comput Surv 56(2)

Ehrmann M, Hamdi A, Pontes EL, et al (2023) Named entity recognition and classification in historical documents: A survey. ACM Comput Surv 56(2). https: //doi.org/10.1145/3604931, URL https://doi.org/10.1145/3604931

work page doi:10.1145/3604931 2023

[9] [9]

Artificial Intelligence Review 58(5):140

Graciotti A, Lazzari N, Presutti V, et al (2025) Musical heritage historical entity linking. Artificial Intelligence Review 58(5):140

work page 2025

[10] [10]

In: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, Vienna, Austria

Graciotti A, Piano L, Lazzari N, et al (2025) Ke-mhisto: Towards a multilingual historical knowledge extraction benchmark for addressing the long-tail problem. In: Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics, Vienna, Austria

work page 2025

[11] [11]

The Llama 3 Herd of Models

Grattafiori A, et al (2024) The llama 3 herd of models. URL https://arxiv.org/ abs/2407.21783, arXiv:2407.21783

work page internal anchor Pith review Pith/arXiv arXiv 2024

[12] [12]

IEEE Transactions on Big Data 7(3):535–547

Johnson J, Douze M, J´ egou H (2019) Billion-scale similarity search with gpus. IEEE Transactions on Big Data 7(3):535–547

work page 2019

[13] [13]

In: Korhonen A, Titov I (eds) Proceedings of the 22nd Conference on Computational Natural Language Learning

Kolitsas N, Ganea OE, Hofmann T (2018) End-to-end neural entity linking. In: Korhonen A, Titov I (eds) Proceedings of the 22nd Conference on Computational Natural Language Learning. Association for Computational Linguistics, Brus- sels, Belgium, pp 519–529, https://doi.org/10.18653/v1/K18-1050, URL https: //aclanthology.org/K18-1050 24

work page doi:10.18653/v1/k18-1050 2018

[14] [14]

In: CLEF (Working Notes), p 33, URL https://ceur-ws.org/Vol-2696/paper 163.pdf?ref=https://githubhelp.com

Labusch K, Neudecker C (2020) Named Entity Disambiguation and Linking Historic Newspaper OCR with BERT. In: CLEF (Working Notes), p 33, URL https://ceur-ws.org/Vol-2696/paper 163.pdf?ref=https://githubhelp.com

work page 2020

[15] [15]

In: Bouamor H, Pino J, Bali K (eds) Findings of the Association for Computational Lin- guistics: EMNLP 2023

Limkonchotiwat P, Cheng W, Christodoulopoulos C, et al (2023) mReFinED: An efficient end-to-end multilingual entity linking system. In: Bouamor H, Pino J, Bali K (eds) Findings of the Association for Computational Lin- guistics: EMNLP 2023. Association for Computational Linguistics, Singapore, pp 15080–15089, https://doi.org/10.18653/v1/2023.findings-emnl...

work page doi:10.18653/v1/2023.findings-emnlp.1007 2023

[16] [16]

International journal on digital libraries pp 1–28

Linhares Pontes E, Cabrera-Diego LA, Moreno JG, et al (2022) Melhissa: a multilingual entity linking architecture for historical press articles. International journal on digital libraries pp 1–28

work page 2022

[17] [17]

In: Calzolari N, B´ echet F, Blache P, et al (eds) Proceedings of the Thirteenth Language Resources and Evaluation Conference

Paccosi T, Palmero Aprosio A (2022) KIND: an Italian multi-domain dataset for named entity recognition. In: Calzolari N, B´ echet F, Blache P, et al (eds) Proceedings of the Thirteenth Language Resources and Evaluation Conference. European Language Resources Association, Marseille, France, pp 501–507, URL https://aclanthology.org/2022.lrec-1.52

work page 2022

[18] [18]

In: International Conference of the Italian Association for Artificial Intelligence, Springer, pp 187–201

Pozzi R, Rubini R, Bernasconi C, et al (2023) Named entity recognition and linking for entity extraction from italian civil judgements. In: International Conference of the Italian Association for Artificial Intelligence, Springer, pp 187–201

work page 2023

[19] [19]

Journal of Open Humanities Data 10(1)

Romanello M, Najem-Meyer S (2024) A Named Entity-Annotated Cor- pus of 19th Century Classical Commentaries. Journal of Open Humanities Data 10(1). https://doi.org/10.5334/johd.150, URL https://openhumanitiesdata. metajnl.com/articles/10.5334/johd.150

work page doi:10.5334/johd.150 2024

[20] [20]

URL https://m.m88hub.co/dlib/bitstream/ HUST/23724/1/OER000002860.pdf#page=38, publisher: Open Book Publishers Cambridge

Sahle P (2016) What is a scholarly digital edition? Digital scholarly edit- ing: Theories and practices 1:19–39. URL https://m.m88hub.co/dlib/bitstream/ HUST/23724/1/OER000002860.pdf#page=38, publisher: Open Book Publishers Cambridge

work page 2016

[21] [21]

Frontiers in Computer Science 6:1472512

Santini C (2024) Combining language models for knowledge extraction from italian tei editions. Frontiers in Computer Science 6:1472512

work page 2024

[22] [22]

CEUR Workshop Proceedings 3234

Santini C, Tan MA, Bruns O, et al (2022) Knowledge Extraction for Art His- tory: the Case of Vasari’s The Lives of The Artists (1568). CEUR Workshop Proceedings 3234. URL https://ceur-ws.org/Vol-3234/paper7.pdf

work page 2022

[23] [23]

In: CEUR Workshop Proceedings, p 11, URL https://ceur-ws.org/Vol-3967/X-TAIL-2024 paper 1.pdf 25

Santini C, Melosi L, Frontoni E (2024) Named Entity Recognition in Histor- ical Italian: The Case of Giacomo Leopardi’s Zibaldone. In: CEUR Workshop Proceedings, p 11, URL https://ceur-ws.org/Vol-3967/X-TAIL-2024 paper 1.pdf 25

work page 2024

[24] [24]

Semantic Web 13(3):527–570

Sevgili ¨O, Shelmanov A, Arkhipov M, et al (2022) Neural entity linking: A survey of models based on deep learning. Semantic Web 13(3):527–570

work page 2022

[25] [25]

IJCoL Italian Journal of Computational Linguistics 2(2-2):89–99

Sprugnoli R, Tonelli S, Moretti G, et al (2016) Fifty years of european history through the lens of computational linguistics: the de gasperi project. IJCoL Italian Journal of Computational Linguistics 2(2-2):89–99

work page 2016

[26] [26]

magaz´ en 4(3):13

Stoyanova S (2023) Working with the Digital Edition of Giacomo Leopardi’s Zibaldone. magaz´ en 4(3):13

work page 2023

[27] [27]

In: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp 131–140

Suchanek FM, Alam M, Bonald T, et al (2024) Yago 4.5: A large and clean knowledge base with a rich taxonomy. In: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, pp 131–140

work page 2024

[28] [28]

Communications of the ACM 57(10):78–85

Vrandeˇ ci´ c D, Kr¨ otzsch M (2014) Wikidata: a free collaborative knowledgebase. Communications of the ACM 57(10):78–85

work page 2014

[29] [29]

In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp 6397–6407

Wu L, Petroni F, Josifoski M, et al (2020) Scalable zero-shot entity linking with dense entity retrieval. In: Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pp 6397–6407

work page 2020

[30] [30]

Yasodhara A, Asgarian A, Huang D, et al (2021) On the trustworthiness of tree en- semble explainability methods. In: Machine Learning and Knowledge Extraction: 5th IFIP TC 5, TC 12, WG 8.4, WG 8.9, WG 12.9 International Cross-Domain Conference, CD-MAKE 2021, Virtual Event, August 17–20, 2021, Proceedings 5, Springer, pp 293–308

work page 2021

[31] [31]

arXiv preprint arXiv:231108526 26

Zaratiana U, Tomeh N, Holat P, et al (2023) Gliner: Generalist model for named entity recognition using bidirectional transformer. arXiv preprint arXiv:231108526 26

work page 2023