Idiosyncrasies in large language models

URLhttps://arxiv · 2025 · arXiv 2502.12150

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

Machine individuality: Separating genuine idiosyncrasy from response bias in large language models

cs.AI · 2026-04-18 · unverdicted · novelty 7.0

Crossed random-effects models on LLM word ratings show 16.9% variance from genuine stimulus-specific individuality, exceeding null models and forming coherent per-model fingerprints.

Asking Back: Interaction-Layer Antidistillation Watermarks

cs.CR · 2026-05-15 · unverdicted · novelty 6.0

Interaction-layer antidistillation watermarks use system-prompt-induced behavioral markers like explicit follow-up questions that transfer to distilled student models at 45-89% relative fidelity and can be audited via black-box LLM-as-judge queries.

ACF: A Collaborative Framework for Agent Covert Communication under Cognitive Asymmetry

cs.AI · 2026-04-09 · unverdicted · novelty 6.0

ACF structurally decouples covert communication from semantic reasoning in agent networks using a shared steganographic configuration to maintain performance under cognitive asymmetry.

RedNote-Vibe: A Dataset for Capturing Temporal Dynamics of AI-Generated Text in Lifestyle Social Media

cs.CL · 2025-09-26 · unverdicted · novelty 6.0

RedNote-Vibe supplies a longitudinal dataset of AI versus human lifestyle posts from 2020 to mid-2025 plus the PLAD detection framework that applies cognitive psychology signatures for improved AI-text identification.

citing papers explorer

Showing 4 of 4 citing papers.

Machine individuality: Separating genuine idiosyncrasy from response bias in large language models cs.AI · 2026-04-18 · unverdicted · none · ref 5
Crossed random-effects models on LLM word ratings show 16.9% variance from genuine stimulus-specific individuality, exceeding null models and forming coherent per-model fingerprints.
Asking Back: Interaction-Layer Antidistillation Watermarks cs.CR · 2026-05-15 · unverdicted · none · ref 34
Interaction-layer antidistillation watermarks use system-prompt-induced behavioral markers like explicit follow-up questions that transfer to distilled student models at 45-89% relative fidelity and can be audited via black-box LLM-as-judge queries.
ACF: A Collaborative Framework for Agent Covert Communication under Cognitive Asymmetry cs.AI · 2026-04-09 · unverdicted · none · ref 36
ACF structurally decouples covert communication from semantic reasoning in agent networks using a shared steganographic configuration to maintain performance under cognitive asymmetry.
RedNote-Vibe: A Dataset for Capturing Temporal Dynamics of AI-Generated Text in Lifestyle Social Media cs.CL · 2025-09-26 · unverdicted · none · ref 18
RedNote-Vibe supplies a longitudinal dataset of AI versus human lifestyle posts from 2020 to mid-2025 plus the PLAD detection framework that applies cognitive psychology signatures for improved AI-text identification.

Idiosyncrasies in large language models

fields

years

verdicts

representative citing papers

citing papers explorer