multiple times

Exact quantities are not required - "multiple times" matches "three times"

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Introspection Adapters: Training LLMs to Report Their Learned Behaviors

cs.AI · 2026-04-18 · unverdicted · novelty 6.0

Introspection adapters are LoRA adapters trained jointly across fine-tunes with implanted behaviors to make LLMs verbalize their learned behaviors, generalizing to detect hidden behaviors on AuditBench and encrypted attacks.

citing papers explorer

Showing 1 of 1 citing paper.

Introspection Adapters: Training LLMs to Report Their Learned Behaviors cs.AI · 2026-04-18 · unverdicted · none · ref 12
Introspection adapters are LoRA adapters trained jointly across fine-tunes with implanted behaviors to make LLMs verbalize their learned behaviors, generalizing to detect hidden behaviors on AuditBench and encrypted attacks.

multiple times

fields

years

verdicts

representative citing papers

citing papers explorer