Are robust llm fingerprints adversarially robust?

Anshul Nasery, Edoardo Contente, Alkin Kaz, Pramod Viswanath, Sewoong Oh · 2025 · arXiv 2509.26598

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

KBF: Knowledge Boundary as Fingerprint for Language Model and Black-Box API Auditing

cs.CR · 2026-05-28 · unverdicted · novelty 7.0

KBF uses stable numerical recall near the knowledge boundary to fingerprint and audit black-box LLM APIs, successfully detecting all tested substitutions and some real-world inconsistencies across production endpoints.

Referential Security as a New Paradigm for AI Evaluations

cs.CR · 2026-05-25 · unverdicted · novelty 5.0

Proposes referential security as a paradigm for AI evaluations that reframes model identity as verifiable to support reproducible audits and regulatory decisions despite system changes.

citing papers explorer

Showing 2 of 2 citing papers after filters.

KBF: Knowledge Boundary as Fingerprint for Language Model and Black-Box API Auditing cs.CR · 2026-05-28 · unverdicted · none · ref 9
KBF uses stable numerical recall near the knowledge boundary to fingerprint and audit black-box LLM APIs, successfully detecting all tested substitutions and some real-world inconsistencies across production endpoints.
Referential Security as a New Paradigm for AI Evaluations cs.CR · 2026-05-25 · unverdicted · none · ref 20
Proposes referential security as a paradigm for AI evaluations that reframes model identity as verifiable to support reproducible audits and regulatory decisions despite system changes.

Are robust llm fingerprints adversarially robust?

fields

years

verdicts

representative citing papers

citing papers explorer