Wait, wait, wait

URLhttps://arxiv · 2002 · arXiv 2512.12895

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

representative citing papers

How Compliant Are GitHub Actions Workflows? A Checklist-Based Study with LLM-Assisted Auditing

cs.SE · 2026-05-03 · accept · novelty 6.0

GitHub Actions workflows achieve only 28% overall compliance with best practices, with LLMs enabling an 81% reduction in verification effort via hybrid adjudication but still requiring expert oversight for security judgments.

Escaping Mode Collapse in LLM Generation via Geometric Regulation

cs.CL · 2026-05-01 · unverdicted · novelty 6.0

Reinforced Mode Regulation (RMR) uses low-rank damping on the value cache to prevent geometric collapse and mode collapse in autoregressive LLM generation, supporting stable output down to 0.8 nats/step entropy.

Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability

cs.AI · 2026-04-08 · unverdicted · novelty 6.0

Reasoning SFT generalizes cross-domain conditionally on sufficient optimization, high-quality long-CoT data, and strong base models, while degrading safety.

Agents of Chaos

cs.AI · 2026-02-23 · unverdicted · novelty 6.0

An exploratory red-teaming study documents eleven cases of security, privacy, and governance failures in autonomous language-model agents with tool access and persistent memory.

citing papers explorer

Showing 4 of 4 citing papers.

How Compliant Are GitHub Actions Workflows? A Checklist-Based Study with LLM-Assisted Auditing cs.SE · 2026-05-03 · accept · none · ref 34
GitHub Actions workflows achieve only 28% overall compliance with best practices, with LLMs enabling an 81% reduction in verification effort via hybrid adjudication but still requiring expert oversight for security judgments.
Escaping Mode Collapse in LLM Generation via Geometric Regulation cs.CL · 2026-05-01 · unverdicted · none · ref 7
Reinforced Mode Regulation (RMR) uses low-rank damping on the value cache to prevent geometric collapse and mode collapse in autoregressive LLM generation, supporting stable output down to 0.8 nats/step entropy.
Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability cs.AI · 2026-04-08 · unverdicted · none · ref 4
Reasoning SFT generalizes cross-domain conditionally on sufficient optimization, high-quality long-CoT data, and strong base models, while degrading safety.
Agents of Chaos cs.AI · 2026-02-23 · unverdicted · none · ref 8
An exploratory red-teaming study documents eleven cases of security, privacy, and governance failures in autonomous language-model agents with tool access and persistent memory.

Wait, wait, wait

fields

years

verdicts

representative citing papers

citing papers explorer