hub

ISBN 979-8-89176-251-0

Association for Computational Linguistics · 2025 · DOI 10.18653/v1/2025.acl-long

25 Pith papers cite this work. Polarity classification is still indexing.

25 Pith papers citing it

open at publisher browse 25 citing papers

hub tools

JSON dossier citing papers JSON publisher DOI

citation-role summary

background 2

citation-polarity summary

background 1 unclear 1

representative citing papers

A Model of Multi-turn Human Persuadability Using Probabilistic Belief Tracing

cs.CL · 2026-06-03 · unverdicted · novelty 7.0

PERSUASIONTRACE introduces a Bayesian-network simulated target for multi-turn persuasion that matches human belief dynamics (81 vs 80) better than LLM baselines (64) and enables process-level evaluation.

Re-Ranking Through an Attribution Lens for Citation Quality in Legal QA

cs.CL · 2026-06-02 · unverdicted · novelty 7.0

Re-ranking retrieval candidates via a cross-encoder trained on continuous perturbation-based attribution scores improves citation faithfulness and gold-answer alignment in legal QA over semantic similarity.

Do Audio LLMs Listen or Read? Analyzing and Mitigating Paralinguistic Failures with VoxParadox

cs.SD · 2026-05-26 · unverdicted · novelty 7.0

Audio LLMs fail to use paralinguistic audio information and default to transcript content; a new adversarial benchmark plus PCLM and DPO training raise accuracy on VoxParadox from 17.4% to 65.2%.

Entropy-informed Decoding: Adaptive Information-Driven Branching

cs.LG · 2026-05-10 · unverdicted · novelty 7.0

EDEN adaptively sets branching factor proportional to next-token entropy, achieving better accuracy per expansion than fixed beam search while providing a proof that monotone entropy-based branching outperforms any fixed budget allocation.

DialectLLM: A Dialect-Aware Dialog[ue] Generation Framework Beyond Standard American English

cs.CL · 2026-01-30 · unverdicted · novelty 7.0

DialectLLM generates parallel multi-dialect dialog data and a 50k-dialog benchmark showing frontier LLMs achieve under 70% accuracy on dialect tasks while the generated data can improve post-training.

Soft Token Alignment for Cross-Lingual Reasoning

cs.CL · 2026-06-25 · unverdicted · novelty 6.0

SOLAR aligns soft-token probability mixtures across languages in embedding space during SFT and raises multilingual reasoning accuracy by up to 17.7 points over the base model.

Local Causal Attribution of Chain-of-Thought Reasoning

cs.LG · 2026-06-20 · unverdicted · novelty 6.0

AttriCoT is a black-box algorithm that attributes causal importance to units in a specific CoT trace via a structural causal model estimated with linear forward passes.

AdaMEM: Test-Time Adaptive Memory for Language Agents

cs.AI · 2026-06-04 · unverdicted · novelty 6.0

AdaMEM proposes hybrid long-term and short-term memory for test-time adaptation in language agents, reporting relative gains of up to 13% on ALFWorld and 11% on WebShop over static baselines.

D-Judge: Disrupting Multi-Turn Jailbreaks using Semantics-Preserving Output Rewriting

cs.CR · 2026-05-31 · unverdicted · novelty 6.0

D-Judge applies semantics-preserving output rewriting, trained via SFT and DPO on paired responses that differ in judge scores, to disrupt multi-turn jailbreak refinement loops and reduce attack success on HarmBench.

JuICE: A Benchmark for Evaluating LLM-Judge in Identifying Cultural Errors

cs.CL · 2026-05-26 · unverdicted · novelty 6.0

JuICE is a new multilingual benchmark dataset showing top LLM judges reach only F1 0.52 on span-level cultural error detection and miss errors locals readily spot.

Towards Direct Evaluation of Harness Optimizers via Priority Ranking

cs.AI · 2026-05-21 · unverdicted · novelty 6.0

Priority ranking offers a low-cost direct evaluation for harness optimizers that correlates with their real multi-step optimization performance, supported by the Shor dataset of 182 scenarios.

From Backward Spreading to Forward Replay: Revisiting Target Construction in LLM Parameter Editing

cs.CL · 2026-05-01 · unverdicted · novelty 6.0

Proposes forward replay of target hidden states from the first editing layer instead of backward spreading, claiming equivalent complexity but higher accuracy for LLM parameter editing.

ConfLayers: Adaptive Confidence-based Layer Skipping for Self-Speculative Decoding

cs.LG · 2026-04-16 · unverdicted · novelty 6.0

ConfLayers dynamically skips LLM layers based on confidence scores to create adaptive draft models for self-speculative decoding, reporting up to 1.4x speedup over standard generation.

Do Reasoning LLMs Refuse What They Infer in Long Contexts?

cs.CL · 2026-02-09 · unverdicted · novelty 6.0

Long-context LLMs refuse explicit harmful requests but often comply when the same harmful goals must be inferred from distributed fragments in long contexts.

WorldCup Sampling for Multi-bit LLM Watermarking

cs.CL · 2026-02-02 · unverdicted · novelty 6.0

WorldCup is a new multi-bit LLM watermarking framework that models token sampling as a communication channel and uses hierarchical competition with entropy-aware modulation for robust message embedding and recovery.

Enhancing Table Reasoning with Deterministic Table-State Rewards

cs.AI · 2026-01-30 · unverdicted · novelty 6.0

RE-TAB uses a deterministic LCS-based table-state reward for stepwise guidance and test-time scaling, raising LLM table-reasoning accuracy by 26.7 pp on average across six backbones and three benchmarks.

SenSE: Semantic-Aware High-Fidelity Universal Speech Enhancement

eess.AS · 2025-09-29 · unverdicted · novelty 6.0

SenSE adds language-model semantic guidance to flow-matching generative speech enhancement via a dual-path masked conditioning strategy and reports SOTA results on distorted speech.

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

cs.AI · 2025-09-02 · accept · novelty 6.0

Survey that defines agentic RL for LLMs via POMDPs, introduces a taxonomy of planning/tool-use/memory/reasoning capabilities and domains, and compiles open environments from over 500 papers.

Quantifying the Salience of Geo-Cultural Values for Pluralistic Safety Alignment

cs.CY · 2026-05-29 · unverdicted · novelty 5.0

Cultural zones explain variance in safety ratings beyond demographics across six datasets, with roughly 10% of items identified as culturally sensitive.

ANCHOR: Abductive Network Construction with Hierarchical Orchestration for Reliable Probability Inference in Large Language Models

cs.CL · 2026-05-11 · unverdicted · novelty 4.0 · 3 refs

ANCHOR uses hierarchical factor construction and causal Bayesian networks to reduce unknown predictions and improve reliability of LLM-based probability inference over prior Naive Bayes approaches.

Learning in the Fisher Subspace: A Guided Initialization for LoRA Fine-Tuning

cs.LG · 2026-05-01 · unverdicted · novelty 4.0

Fisher information from the target data distribution supplies a task-dependent criterion for selecting LoRA directions that outperforms weight-magnitude heuristics.

Can AI Be a Good Peer Reviewer? A Survey of Peer Review Process, Evaluation, and the Future

cs.CL · 2026-04-30 · unverdicted · novelty 4.0

A survey synthesizing LLM methods for peer review generation, post-review tasks like rebuttals and meta-reviews, evaluation approaches, datasets, and future directions in AI-assisted academic publishing.

Shared Lexical Task Representations Explain Behavioral Variability In LLMs

cs.CL · 2026-04-23

Distributional Open-Ended Evaluation of LLM Cultural Value Alignment Based on Value Codebook

cs.CL · 2026-03-16

citing papers explorer

Showing 22 of 22 citing papers after filters.

A Model of Multi-turn Human Persuadability Using Probabilistic Belief Tracing cs.CL · 2026-06-03 · unverdicted · none · ref 39
PERSUASIONTRACE introduces a Bayesian-network simulated target for multi-turn persuasion that matches human belief dynamics (81 vs 80) better than LLM baselines (64) and enables process-level evaluation.
Re-Ranking Through an Attribution Lens for Citation Quality in Legal QA cs.CL · 2026-06-02 · unverdicted · none · ref 7
Re-ranking retrieval candidates via a cross-encoder trained on continuous perturbation-based attribution scores improves citation faithfulness and gold-answer alignment in legal QA over semantic similarity.
Do Audio LLMs Listen or Read? Analyzing and Mitigating Paralinguistic Failures with VoxParadox cs.SD · 2026-05-26 · unverdicted · none · ref 7
Audio LLMs fail to use paralinguistic audio information and default to transcript content; a new adversarial benchmark plus PCLM and DPO training raise accuracy on VoxParadox from 17.4% to 65.2%.
Entropy-informed Decoding: Adaptive Information-Driven Branching cs.LG · 2026-05-10 · unverdicted · none · ref 6
EDEN adaptively sets branching factor proportional to next-token entropy, achieving better accuracy per expansion than fixed beam search while providing a proof that monotone entropy-based branching outperforms any fixed budget allocation.
DialectLLM: A Dialect-Aware Dialog[ue] Generation Framework Beyond Standard American English cs.CL · 2026-01-30 · unverdicted · none · ref 10
DialectLLM generates parallel multi-dialect dialog data and a 50k-dialog benchmark showing frontier LLMs achieve under 70% accuracy on dialect tasks while the generated data can improve post-training.
Soft Token Alignment for Cross-Lingual Reasoning cs.CL · 2026-06-25 · unverdicted · none · ref 30
SOLAR aligns soft-token probability mixtures across languages in embedding space during SFT and raises multilingual reasoning accuracy by up to 17.7 points over the base model.
Local Causal Attribution of Chain-of-Thought Reasoning cs.LG · 2026-06-20 · unverdicted · none · ref 14
AttriCoT is a black-box algorithm that attributes causal importance to units in a specific CoT trace via a structural causal model estimated with linear forward passes.
AdaMEM: Test-Time Adaptive Memory for Language Agents cs.AI · 2026-06-04 · unverdicted · none · ref 2
AdaMEM proposes hybrid long-term and short-term memory for test-time adaptation in language agents, reporting relative gains of up to 13% on ALFWorld and 11% on WebShop over static baselines.
D-Judge: Disrupting Multi-Turn Jailbreaks using Semantics-Preserving Output Rewriting cs.CR · 2026-05-31 · unverdicted · none · ref 4
D-Judge applies semantics-preserving output rewriting, trained via SFT and DPO on paired responses that differ in judge scores, to disrupt multi-turn jailbreak refinement loops and reduce attack success on HarmBench.
JuICE: A Benchmark for Evaluating LLM-Judge in Identifying Cultural Errors cs.CL · 2026-05-26 · unverdicted · none · ref 43
JuICE is a new multilingual benchmark dataset showing top LLM judges reach only F1 0.52 on span-level cultural error detection and miss errors locals readily spot.
Towards Direct Evaluation of Harness Optimizers via Priority Ranking cs.AI · 2026-05-21 · unverdicted · none · ref 4
Priority ranking offers a low-cost direct evaluation for harness optimizers that correlates with their real multi-step optimization performance, supported by the Shor dataset of 182 scenarios.
From Backward Spreading to Forward Replay: Revisiting Target Construction in LLM Parameter Editing cs.CL · 2026-05-01 · unverdicted · none · ref 6
Proposes forward replay of target hidden states from the first editing layer instead of backward spreading, claiming equivalent complexity but higher accuracy for LLM parameter editing.
ConfLayers: Adaptive Confidence-based Layer Skipping for Self-Speculative Decoding cs.LG · 2026-04-16 · unverdicted · none · ref 1
ConfLayers dynamically skips LLM layers based on confidence scores to create adaptive draft models for self-speculative decoding, reporting up to 1.4x speedup over standard generation.
Do Reasoning LLMs Refuse What They Infer in Long Contexts? cs.CL · 2026-02-09 · unverdicted · none · ref 7
Long-context LLMs refuse explicit harmful requests but often comply when the same harmful goals must be inferred from distributed fragments in long contexts.
WorldCup Sampling for Multi-bit LLM Watermarking cs.CL · 2026-02-02 · unverdicted · none · ref 1
WorldCup is a new multi-bit LLM watermarking framework that models token sampling as a communication channel and uses hierarchical competition with entropy-aware modulation for robust message embedding and recovery.
Enhancing Table Reasoning with Deterministic Table-State Rewards cs.AI · 2026-01-30 · unverdicted · none · ref 13
RE-TAB uses a deterministic LCS-based table-state reward for stepwise guidance and test-time scaling, raising LLM table-reasoning accuracy by 26.7 pp on average across six backbones and three benchmarks.
Quantifying the Salience of Geo-Cultural Values for Pluralistic Safety Alignment cs.CY · 2026-05-29 · unverdicted · none · ref 6
Cultural zones explain variance in safety ratings beyond demographics across six datasets, with roughly 10% of items identified as culturally sensitive.
ANCHOR: Abductive Network Construction with Hierarchical Orchestration for Reliable Probability Inference in Large Language Models cs.CL · 2026-05-11 · unverdicted · none · ref 7 · 3 links
ANCHOR uses hierarchical factor construction and causal Bayesian networks to reduce unknown predictions and improve reliability of LLM-based probability inference over prior Naive Bayes approaches.
Learning in the Fisher Subspace: A Guided Initialization for LoRA Fine-Tuning cs.LG · 2026-05-01 · unverdicted · none · ref 7
Fisher information from the target data distribution supplies a task-dependent criterion for selecting LoRA directions that outperforms weight-magnitude heuristics.
Can AI Be a Good Peer Reviewer? A Survey of Peer Review Process, Evaluation, and the Future cs.CL · 2026-04-30 · unverdicted · none · ref 3
A survey synthesizing LLM methods for peer review generation, post-review tasks like rebuttals and meta-reviews, evaluation approaches, datasets, and future directions in AI-assisted academic publishing.
Shared Lexical Task Representations Explain Behavioral Variability In LLMs cs.CL · 2026-04-23 · unreviewed · ref 4
Distributional Open-Ended Evaluation of LLM Cultural Value Alignment Based on Value Codebook cs.CL · 2026-03-16 · unreviewed · ref 12

ISBN 979-8-89176-251-0

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer