hub Canonical reference

Spinning Language Models: Risks of Propaganda-As-A-Service and Countermeasures , url=

Hammond Pearce, Baleegh Ahmad, Benjamin Tan, Brendan Dolan-Gavitt, Ramesh Karri · 2022 · arXiv 6214.2022

Canonical reference. 94% of citing Pith papers cite this work as background.

38 Pith papers citing it

Background 94% of classified citations

read on arXiv browse 38 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 17

citation-polarity summary

background 16 support 1

representative citing papers

Cerisier: A Program Logic for Attestation in a Capability Machine

cs.PL · 2026-04-15 · unverdicted · novelty 8.0

Cerisier is the first mechanized program logic for modular reasoning about trusted, untrusted, and attested code in capability machines, with a universal contract for untrusted code and demonstrations on secure computation and mutual attestation.

PoisonForge: Task-Level Targeted Poisoning Benchmark for Instruction-Tuned LLMs

cs.CR · 2026-05-22 · accept · novelty 7.0

PoisonForge benchmark shows that 1% poisoned examples achieve over 70% attack success rate on targeted tasks across 11 of 12 tested LLMs with under 0.5% leakage to non-target tasks.

Single-Sample Black-Box Membership Inference Attack against Vision-Language Models via Cross-modal Semantic Alignment

cs.CV · 2026-05-17 · unverdicted · novelty 7.0

A cross-modal alignment attack achieves AUC 0.821 for single-sample black-box membership inference on VLMs such as LLaVA-1.5 by quantifying image-generated caption similarity.

Reconstruction of Personally Identifiable Information from Supervised Finetuned Models

cs.CR · 2026-05-12 · unverdicted · novelty 7.0

PII can be reconstructed from SFT models via prefix attacks, with the new COVA algorithm improving success rates and leakage varying by attacker knowledge and PII type.

Zombies in Alternate Realities: The Afterlife of Domain Names in DNS Integrations

cs.CR · 2026-05-07 · unverdicted · novelty 7.0

Zombie domain linkages persist after ownership changes in DNS integrations at rates of 3% in Web PKI, 24% in ENS, and 15% in Maven Central, with validate-once designs accumulating long-term risks while per-use validation prevents them.

Styx: Collaborative and Private Data Processing With TEE-Enforced Sticky Policy

cs.CR · 2026-04-05 · unverdicted · novelty 7.0

Styx integrates sticky policies with TEEs to enforce data-specific rules throughout the full lifecycle in multi-party collaborative computing.

Grassroots Bonds as a Foundation for Market Liquidity

cs.DC · 2026-03-14 · unverdicted · novelty 7.0

Grassroots bonds add maturity dates to local cryptocurrencies to enable lending and other instruments via enforceable digital social contracts.

SynBench: A Benchmark for Differentially Private Text Generation

cs.AI · 2025-09-18 · conditional · novelty 7.0

SynBench benchmarks DP text generators across nine datasets and uses a new MIA to show that public pre-training on portions of private data overestimates synthetic text quality and breaks DP privacy bounds.

Fast Byzantine Total Order Broadcast

cs.DC · 2024-12-18 · unverdicted · novelty 7.0

Flutter achieves 2Δ + ε good-case latency for Byzantine Total Order Broadcast via a new binary consensus called Blink, under partial synchrony with 5f+1 servers.

Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models

cs.AI · 2024-06-14 · conditional · novelty 7.0

LLMs trained on simple specification gaming generalize to zero-shot reward tampering including rewriting their own reward function.

GRASP -- Graph-Based Anomaly Detection Through Self-Supervised Classification

cs.CR · 2026-05-08 · unverdicted · novelty 6.0 · 2 refs

GRASP detects anomalies in system provenance graphs via self-supervised executable prediction from two-hop neighborhoods, outperforming prior PIDS on DARPA datasets by identifying all documented attacks where behaviors are learnable plus additional unlabeled suspicious activity.

EASE: Federated Multimodal Unlearning via Entanglement-Aware Anchor Closure

cs.NI · 2026-05-01 · unverdicted · novelty 6.0

EASE closes three residual anchors in federated multimodal unlearning using bilateral displacement, cosine-sine decomposition, and forget lock, achieving near-retrain performance on forget and retain data.

CuLifter: Lifting GPU Binaries to Typed IR

cs.AR · 2026-04-30 · unverdicted · novelty 6.0

CuLifter recovers types from untyped GPU register files via constraint propagation to lift 99.98% of 24,437 functions across 919 cubins to valid LLVM IR.

When AI reviews science: Can we trust the referee?

cs.AI · 2026-04-26 · unverdicted · novelty 6.0

AI peer review systems are vulnerable to prompt injections, prestige biases, assertion strength effects, and contextual poisoning, as demonstrated by a new attack taxonomy and causal experiments on real conference submissions.

VRSafe: A Secure Virtual Keyboard to Mitigate Keystroke Inference in Virtual Reality

cs.CR · 2026-04-22 · unverdicted · novelty 6.0

VRSafe adds false positive keystrokes to VR typing data to reduce keystroke inference attack accuracy and includes an efficient malicious login detector.

BONSAI: A Mixed-Initiative Workspace for Human-AI Co-Development of Visual Analytics Applications

cs.HC · 2026-04-21 · unverdicted · novelty 6.0

BONSAI introduces a four-layer architecture and four-phase workflow for human-AI co-development of visual analytics applications, shown in case studies to enable efficient novel tool creation and reconstruction from paper descriptions.

KindHML: formal verification of smart contracts based on Hennessy-Milner logic

cs.CR · 2026-04-15 · unverdicted · novelty 6.0

An encoding of Solidity contracts and first-order Hennessy-Milner logic into Lustre enables Kind 2 model checking of complex temporal properties in smart contracts.

GPIR: Enabling Practical Private Information Retrieval with GPUs

cs.CR · 2026-04-06 · unverdicted · novelty 6.0

GPIR achieves up to 297 times higher throughput than prior GPU PIR systems by fusing operations in stages and using pipelined transposed layouts to cut DRAM traffic during batched lattice-based queries.

Tracking Capabilities for Safer Agents

cs.AI · 2026-03-01 · unverdicted · novelty 6.0

AI agents can generate code in a capability-safe Scala dialect that statically prevents information leakage and malicious side effects while preserving task performance.

Capacitive Touchscreens at Risk: Recovering Handwritten Trajectory on Smartphone via Electromagnetic Emanations

cs.CR · 2025-12-12 · unverdicted · novelty 6.0

TESLA recovers 2D handwriting trajectories from touchscreen EM emanations on COTS smartphones, achieving 77% character recognition accuracy and 0.74 Jaccard index under realistic conditions.

Automated Side-Channel Analysis of Cryptographic Protocol Implementations

cs.CR · 2025-11-14 · unverdicted · novelty 6.0

The authors built an automated toolchain that extracts symbolic models from real binaries of cryptographic protocols and analyzes them for constant-time and speculative side-channel leaks, demonstrated on WhatsApp and e-passport implementations.

Ablating Safety: Mechanisms for Removing Alignment in Language Models for Security Applications

cs.CR · 2026-05-17 · unverdicted · novelty 5.0

Empirical comparison of alignment ablation methods on a 60-prompt security evaluation suite shows task-only LoRA achieves 0.87 mean security score with 0.13 unsafe compliance.

Understanding Student Experiences with TLS Client Authentication

cs.CR · 2026-04-15 · unverdicted · novelty 5.0

A longitudinal study of 46 CS students finds that configuring and using mTLS client certificates is difficult even for technical users, with only 9% understanding the security implications.

Evaluating Differential Privacy Against Membership Inference in Federated Learning: Insights from the NIST Genomics Red Team Challenge

cs.CR · 2026-04-14 · unverdicted · novelty 5.0

Stacking seven black-box estimators into a meta-classifier reveals persistent membership leakage in differentially private federated learning models at epsilon=200 on NIST genomics data, outperforming single-signal baselines.

citing papers explorer

Showing 5 of 5 citing papers after filters.

REALISTA: Realistic Latent Adversarial Attacks that Elicit LLM Hallucinations cs.CL · 2026-05-12 · unreviewed · ref 95
Unlearning with Asymmetric Sources: Improved Unlearning-Utility Trade-off with Public Data cs.LG · 2026-05-11 · unreviewed · ref 230
AI Slop and the Software Commons cs.SE · 2026-04-17 · unreviewed · ref 9
Finding Memory Leaks in C/C++ Programs via Neuro-Symbolic Augmented Static Analysis cs.SE · 2026-03-28 · unreviewed · ref 45
Tuning for TraceTarnish: Techniques, Trends, and Testing Tangible Traits cs.CR · 2025-12-03 · unreviewed · ref 5

Spinning Language Models: Risks of Propaganda-As-A-Service and Countermeasures , url=

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer