If the Sources Could Talk: Evaluating Large Language Models for Research Assistance in History

Christian Weilbach; Giselle Gonzalez Garcia

arxiv: 2310.10808 · v1 · pith:E67N3NUHnew · submitted 2023-10-16 · 💻 cs.IR · cs.AI

If the Sources Could Talk: Evaluating Large Language Models for Research Assistance in History

Giselle Gonzalez Garcia , Christian Weilbach This is my paper

classification 💻 cs.IR cs.AI

keywords llmssourcesconversationaldataresearchersdemonstrateevaluatelarge

0 comments

read the original abstract

The recent advent of powerful Large-Language Models (LLM) provides a new conversational form of inquiry into historical memory (or, training data, in this case). We show that by augmenting such LLMs with vector embeddings from highly specialized academic sources, a conversational methodology can be made accessible to historians and other researchers in the Humanities. Concretely, we evaluate and demonstrate how LLMs have the ability of assisting researchers while they examine a customized corpora of different types of documents, including, but not exclusive to: (1). primary sources, (2). secondary sources written by experts, and (3). the combination of these two. Compared to established search interfaces for digital catalogues, such as metadata and full-text search, we evaluate the richer conversational style of LLMs on the performance of two main types of tasks: (1). question-answering, and (2). extraction and organization of data. We demonstrate that LLMs semantic retrieval and reasoning abilities on problem-specific tasks can be applied to large textual archives that have not been part of the its training data. Therefore, LLMs can be augmented with sources relevant to specific research projects, and can be queried privately by researchers.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Hey Chat, Can You Teach Me? Structuring Socratic Dialogue for Human Learning in the Wild
cs.CL 2026-06 unverdicted novelty 7.0

A PPO policy for deciding topic order and duration on a prerequisite knowledge graph, paired with an LLM for Socratic dialogue, improves student mastery rates and reduces turns compared to baselines and scaled models ...
From Text to Discovery: How Large Language Models Are Accelerating and Complicating Research Across Scientific and Humanistic Disciplines
cs.DL 2026-06 unverdicted novelty 3.0

LLMs accelerate research workflows from idea generation to writing but introduce challenges like hallucination, bias, opacity, and ten systemic risks requiring new governance frameworks.