Interpreto: An Explainability Library for Transformers

Antonin Poch\'e; C\'eline Hudelot; Charlotte Claye; Corentin Friedrich; Fanny Jourdan; Fran\c{c}ois Hoofd; Fr\'ed\'eric Boisnard; Gabriele Sarti; Nicholas Asher; Raphael Bernas

arxiv: 2512.09730 · v3 · pith:T2D2KTQWnew · submitted 2025-12-10 · 💻 cs.CL · cs.LG

Interpreto: An Explainability Library for Transformers

Antonin Poch\'e , Thomas Mullor , Gabriele Sarti , Fr\'ed\'eric Boisnard , Corentin Friedrich , Charlotte Claye , Fran\c{c}ois Hoofd , Raphael Bernas

show 3 more authors

Nicholas Asher C\'eline Hudelot Fanny Jourdan

This is my paper

classification 💻 cs.CL cs.LG

keywords githubinterpretolibraryconcept-basedfor-sight-aihttpsmethodsactivation

0 comments

read the original abstract

Interpreto is an open-source Python library for interpreting HuggingFace language models, from early BERT variants to LLMs. It provides two complementary families of methods: attribution methods and concept-based explanations. The library bridges recent research and practical tooling by exposing explanation workflows through a unified API for both classification and text generation. A key differentiator is its end-to-end concept-based pipeline (from activation extraction to concept learning, interpretation, and scoring), which goes beyond feature-level attributions and is uncommon in existing libraries. See GitHub: https://github.com/FOR-sight-ai/interpreto and the demo website: https://for-sight-ai.github.io/interpreto-demo/.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

BrainSurgery: Reproducible and Reliable Declarative Weight Manipulations for Model Editing and Upcycling
cs.LG 2026-06 unverdicted novelty 6.0

Introduces BrainSurgery, a declarative YAML-based tool for reproducible tensor surgery on deep learning checkpoints with built-in validation assertions, shown via examples and case studies in model upcycling and LoRA ...