Title resolution pending

Heng Jin, Chaoyu Zhang, Shanghao Shi, Wenjing Lou · 2024 · arXiv 2405.02466

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Intrinsic Fingerprint of LLMs: Continue Training is NOT All You Need to Steal A Model!

cs.CR · 2025-07-02 · unverdicted · novelty 6.0

Standard deviation distributions of attention matrices in LLMs remain distinctive and stable after continued training, enabling fingerprinting to trace model lineage and detect potential plagiarism such as in Pangu Pro MoE.

Peering Behind the Shield: Guardrail Identification in Large Language Models

cs.CR · 2025-02-03 · unverdicted · novelty 6.0

AP-Test identifies deployed guardrails in LLMs via adversarial prompt testing and a match score metric, reporting perfect accuracy on four open-source guardrails.

citing papers explorer

Showing 2 of 2 citing papers.

Intrinsic Fingerprint of LLMs: Continue Training is NOT All You Need to Steal A Model! cs.CR · 2025-07-02 · unverdicted · none · ref 12
Standard deviation distributions of attention matrices in LLMs remain distinctive and stable after continued training, enabling fingerprinting to trace model lineage and detect potential plagiarism such as in Pangu Pro MoE.
Peering Behind the Shield: Guardrail Identification in Large Language Models cs.CR · 2025-02-03 · unverdicted · none · ref 22
AP-Test identifies deployed guardrails in LLMs via adversarial prompt testing and a match score metric, reporting perfect accuracy on four open-source guardrails.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer