ISBN 979-8-89176-189-6

URL https://openreview · 2024 · DOI 10.18653/v1/2025.naacl-long

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

open at publisher browse 8 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

STARE: Step-wise Temporal Alignment and Red-teaming Engine for Multi-modal Toxicity Attack

cs.CR · 2026-05-01 · unverdicted · novelty 8.0

STARE uses step-wise RL to attack multimodal models, achieving 68% higher attack success rate while revealing that adversarial optimization concentrates conceptual toxicity early and detail toxicity late in the generation trajectory.

Entropy-informed Decoding: Adaptive Information-Driven Branching

cs.LG · 2026-05-10 · unverdicted · novelty 7.0

EDEN adaptively sets branching factor proportional to next-token entropy, achieving better accuracy per expansion than fixed beam search while providing a proof that monotone entropy-based branching outperforms any fixed budget allocation.

EngiAgent: Fully Connected Coordination of LLM Agents for Solving Open-ended Engineering Problems with Feasible Solutions

cs.AI · 2026-05-04 · unverdicted · novelty 7.0

EngiAgent deploys a fully connected multi-agent coordinator to achieve higher feasibility rates when using LLMs to solve open-ended engineering problems under physical and data constraints.

FlowBot: Inducing LLM Workflows with Bilevel Optimization and Textual Gradients

cs.CL · 2026-04-29 · unverdicted · novelty 7.0

FlowBot automatically induces LLM workflows through bilevel optimization with textual gradients, achieving competitive performance against human-crafted baselines.

CREATE: Testing LLMs for Associative Creativity

cs.CL · 2026-03-10 · unverdicted · novelty 7.0

CREATE is a benchmark that scores LLMs on their ability to produce many specific and diverse associative paths between concepts drawn from parametric knowledge.

CL-bench Life: Can Language Models Learn from Real-Life Context?

cs.CL · 2026-04-29 · unverdicted · novelty 6.0

CL-bench Life shows frontier language models achieve only 13.8% average success on real-life context tasks, with the best model at 19.3%.

BARRED: Synthetic Training of Custom Policy Guardrails via Asymmetric Debate

cs.CL · 2026-04-28 · unverdicted · novelty 6.0

BARRED uses dimension decomposition and asymmetric multi-agent debate to generate high-fidelity synthetic data that lets small fine-tuned models outperform proprietary LLMs and existing guardrail models on custom policies.

Distributional Open-Ended Evaluation of LLM Cultural Value Alignment Based on Value Codebook

cs.CL · 2026-03-16 · unverdicted · novelty 6.0

DOVE constructs a value codebook via rate-distortion variational optimization from 10K documents and measures LLM-human cultural alignment through unbalanced optimal transport, showing 31.56% correlation with downstream tasks and reliability at 500 samples per culture.

citing papers explorer

Showing 8 of 8 citing papers.

STARE: Step-wise Temporal Alignment and Red-teaming Engine for Multi-modal Toxicity Attack cs.CR · 2026-05-01 · unverdicted · none · ref 9
STARE uses step-wise RL to attack multimodal models, achieving 68% higher attack success rate while revealing that adversarial optimization concentrates conceptual toxicity early and detail toxicity late in the generation trajectory.
Entropy-informed Decoding: Adaptive Information-Driven Branching cs.LG · 2026-05-10 · unverdicted · none · ref 11
EDEN adaptively sets branching factor proportional to next-token entropy, achieving better accuracy per expansion than fixed beam search while providing a proof that monotone entropy-based branching outperforms any fixed budget allocation.
EngiAgent: Fully Connected Coordination of LLM Agents for Solving Open-ended Engineering Problems with Feasible Solutions cs.AI · 2026-05-04 · unverdicted · none · ref 2
EngiAgent deploys a fully connected multi-agent coordinator to achieve higher feasibility rates when using LLMs to solve open-ended engineering problems under physical and data constraints.
FlowBot: Inducing LLM Workflows with Bilevel Optimization and Textual Gradients cs.CL · 2026-04-29 · unverdicted · none · ref 2
FlowBot automatically induces LLM workflows through bilevel optimization with textual gradients, achieving competitive performance against human-crafted baselines.
CREATE: Testing LLMs for Associative Creativity cs.CL · 2026-03-10 · unverdicted · none · ref 1
CREATE is a benchmark that scores LLMs on their ability to produce many specific and diverse associative paths between concepts drawn from parametric knowledge.
CL-bench Life: Can Language Models Learn from Real-Life Context? cs.CL · 2026-04-29 · unverdicted · none · ref 31
CL-bench Life shows frontier language models achieve only 13.8% average success on real-life context tasks, with the best model at 19.3%.
BARRED: Synthetic Training of Custom Policy Guardrails via Asymmetric Debate cs.CL · 2026-04-28 · unverdicted · none · ref 1
BARRED uses dimension decomposition and asymmetric multi-agent debate to generate high-fidelity synthetic data that lets small fine-tuned models outperform proprietary LLMs and existing guardrail models on custom policies.
Distributional Open-Ended Evaluation of LLM Cultural Value Alignment Based on Value Codebook cs.CL · 2026-03-16 · unverdicted · none · ref 21
DOVE constructs a value codebook via rate-distortion variational optimization from 10K documents and measures LLM-human cultural alignment through unbalanced optimal transport, showing 31.56% correlation with downstream tasks and reliability at 500 samples per culture.

ISBN 979-8-89176-189-6

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer