hub

ISBN 979-8-89176-251-0

Association for Computational Linguistics · 2025 · DOI 10.18653/v1/2025.acl-long

14 Pith papers cite this work. Polarity classification is still indexing.

14 Pith papers citing it

open at publisher browse 14 citing papers

hub tools

JSON dossier citing papers JSON publisher DOI

citation-role summary

background 2

citation-polarity summary

background 1 unclear 1

representative citing papers

Entropy-informed Decoding: Adaptive Information-Driven Branching

cs.LG · 2026-05-10 · unverdicted · novelty 7.0

EDEN adaptively sets branching factor proportional to next-token entropy, achieving better accuracy per expansion than fixed beam search while providing a proof that monotone entropy-based branching outperforms any fixed budget allocation.

DialectLLM: A Dialect-Aware Dialog[ue] Generation Framework Beyond Standard American English

cs.CL · 2026-01-30 · unverdicted · novelty 7.0

DialectLLM generates parallel multi-dialect dialog data and a 50k-dialog benchmark showing frontier LLMs achieve under 70% accuracy on dialect tasks while the generated data can improve post-training.

Towards Direct Evaluation of Harness Optimizers via Priority Ranking

cs.AI · 2026-05-21 · unverdicted · novelty 6.0

Priority ranking offers a low-cost direct evaluation for harness optimizers that correlates with their real multi-step optimization performance, supported by the Shor dataset of 182 scenarios.

ConfLayers: Adaptive Confidence-based Layer Skipping for Self-Speculative Decoding

cs.LG · 2026-04-16 · unverdicted · novelty 6.0

ConfLayers dynamically skips LLM layers based on confidence scores to create adaptive draft models for self-speculative decoding, reporting up to 1.4x speedup over standard generation.

Do Reasoning LLMs Refuse What They Infer in Long Contexts?

cs.CL · 2026-02-09 · unverdicted · novelty 6.0

Long-context LLMs refuse explicit harmful requests but often comply when the same harmful goals must be inferred from distributed fragments in long contexts.

WorldCup Sampling for Multi-bit LLM Watermarking

cs.CL · 2026-02-02 · unverdicted · novelty 6.0

WorldCup is a new multi-bit LLM watermarking framework that models token sampling as a communication channel and uses hierarchical competition with entropy-aware modulation for robust message embedding and recovery.

Enhancing Table Reasoning with Deterministic Table-State Rewards

cs.AI · 2026-01-30 · unverdicted · novelty 6.0

RE-TAB uses a deterministic LCS-based table-state reward for stepwise guidance and test-time scaling, raising LLM table-reasoning accuracy by 26.7 pp on average across six backbones and three benchmarks.

SenSE: Semantic-Aware High-Fidelity Universal Speech Enhancement

eess.AS · 2025-09-29 · unverdicted · novelty 6.0

SenSE adds language-model semantic guidance to flow-matching generative speech enhancement via a dual-path masked conditioning strategy and reports SOTA results on distorted speech.

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

cs.AI · 2025-09-02 · accept · novelty 6.0

Survey that defines agentic RL for LLMs via POMDPs, introduces a taxonomy of planning/tool-use/memory/reasoning capabilities and domains, and compiles open environments from over 500 papers.

LIFT: A Novel Framework for Enhancing Long-Context Understanding of LLMs via Long Input Fine-Tuning

cs.CL · 2025-02-20 · unverdicted · novelty 6.0

LIFT fine-tunes short-context LLMs on long inputs with synthetic tasks to absorb information into parameters, enabling answers without the input present at inference.

Shared Lexical Task Representations Explain Behavioral Variability In LLMs

cs.CL · 2026-04-23 · unverdicted · novelty 5.0

LLMs share task-specific attention heads across prompting styles, with activation strength explaining performance differences and failures arising from competing representations.

Can AI Be a Good Peer Reviewer? A Survey of Peer Review Process, Evaluation, and the Future

cs.CL · 2026-04-30 · unverdicted · novelty 4.0

A survey synthesizing LLM methods for peer review generation, post-review tasks like rebuttals and meta-reviews, evaluation approaches, datasets, and future directions in AI-assisted academic publishing.

ANCHOR: Abductive Network Construction with Hierarchical Orchestration for Reliable Probability Inference in Large Language Models

cs.CL · 2026-05-11 · 2 refs

Distributional Open-Ended Evaluation of LLM Cultural Value Alignment Based on Value Codebook

cs.CL · 2026-03-16

citing papers explorer

Showing 14 of 14 citing papers.

Entropy-informed Decoding: Adaptive Information-Driven Branching cs.LG · 2026-05-10 · unverdicted · none · ref 6
EDEN adaptively sets branching factor proportional to next-token entropy, achieving better accuracy per expansion than fixed beam search while providing a proof that monotone entropy-based branching outperforms any fixed budget allocation.
DialectLLM: A Dialect-Aware Dialog[ue] Generation Framework Beyond Standard American English cs.CL · 2026-01-30 · unverdicted · none · ref 10
DialectLLM generates parallel multi-dialect dialog data and a 50k-dialog benchmark showing frontier LLMs achieve under 70% accuracy on dialect tasks while the generated data can improve post-training.
Towards Direct Evaluation of Harness Optimizers via Priority Ranking cs.AI · 2026-05-21 · unverdicted · none · ref 4
Priority ranking offers a low-cost direct evaluation for harness optimizers that correlates with their real multi-step optimization performance, supported by the Shor dataset of 182 scenarios.
ConfLayers: Adaptive Confidence-based Layer Skipping for Self-Speculative Decoding cs.LG · 2026-04-16 · unverdicted · none · ref 1
ConfLayers dynamically skips LLM layers based on confidence scores to create adaptive draft models for self-speculative decoding, reporting up to 1.4x speedup over standard generation.
Do Reasoning LLMs Refuse What They Infer in Long Contexts? cs.CL · 2026-02-09 · unverdicted · none · ref 7
Long-context LLMs refuse explicit harmful requests but often comply when the same harmful goals must be inferred from distributed fragments in long contexts.
WorldCup Sampling for Multi-bit LLM Watermarking cs.CL · 2026-02-02 · unverdicted · none · ref 1
WorldCup is a new multi-bit LLM watermarking framework that models token sampling as a communication channel and uses hierarchical competition with entropy-aware modulation for robust message embedding and recovery.
Enhancing Table Reasoning with Deterministic Table-State Rewards cs.AI · 2026-01-30 · unverdicted · none · ref 13
RE-TAB uses a deterministic LCS-based table-state reward for stepwise guidance and test-time scaling, raising LLM table-reasoning accuracy by 26.7 pp on average across six backbones and three benchmarks.
SenSE: Semantic-Aware High-Fidelity Universal Speech Enhancement eess.AS · 2025-09-29 · unverdicted · none · ref 14
SenSE adds language-model semantic guidance to flow-matching generative speech enhancement via a dual-path masked conditioning strategy and reports SOTA results on distorted speech.
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey cs.AI · 2025-09-02 · accept · none · ref 71
Survey that defines agentic RL for LLMs via POMDPs, introduces a taxonomy of planning/tool-use/memory/reasoning capabilities and domains, and compiles open environments from over 500 papers.
LIFT: A Novel Framework for Enhancing Long-Context Understanding of LLMs via Long Input Fine-Tuning cs.CL · 2025-02-20 · unverdicted · none · ref 2
LIFT fine-tunes short-context LLMs on long inputs with synthetic tasks to absorb information into parameters, enabling answers without the input present at inference.
Shared Lexical Task Representations Explain Behavioral Variability In LLMs cs.CL · 2026-04-23 · unverdicted · none · ref 4
LLMs share task-specific attention heads across prompting styles, with activation strength explaining performance differences and failures arising from competing representations.
Can AI Be a Good Peer Reviewer? A Survey of Peer Review Process, Evaluation, and the Future cs.CL · 2026-04-30 · unverdicted · none · ref 3
A survey synthesizing LLM methods for peer review generation, post-review tasks like rebuttals and meta-reviews, evaluation approaches, datasets, and future directions in AI-assisted academic publishing.
ANCHOR: Abductive Network Construction with Hierarchical Orchestration for Reliable Probability Inference in Large Language Models cs.CL · 2026-05-11 · unreviewed · ref 7 · 2 links
Distributional Open-Ended Evaluation of LLM Cultural Value Alignment Based on Value Codebook cs.CL · 2026-03-16 · unreviewed · ref 12

ISBN 979-8-89176-251-0

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer