Title resolution pending

Yang, Shu, Zhu, Shenzhe, Wu, Zeyu, Wang, Keyu, Yao, Junchi, Wu, Junchao · 2025 · DOI 10.18653/v1/2025.findings-acl.226

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Harness-MU: A Safe, Governed, and Effective Harness for Multi-User LLM Agents

cs.CR · 2026-06-20 · unverdicted · novelty 7.0

Harness-MU is a zero-tuning infrastructure that decouples safety orchestration from language generation in multi-user LLM agents, achieving full privacy preservation on Muses-Bench while improving utility and instruction-following over baselines.

Agentic Relationship Harm: Benchmarking and Gating Relational Manipulation in AI Agents

cs.HC · 2026-06-02 · unverdicted · novelty 6.0

Presents a new benchmark and role-sensitive policy gate for agentic relationship harm that outperforms generic safety prompting with zero harmful compliance in tests.

citing papers explorer

Showing 2 of 2 citing papers.

Harness-MU: A Safe, Governed, and Effective Harness for Multi-User LLM Agents cs.CR · 2026-06-20 · unverdicted · none · ref 24
Harness-MU is a zero-tuning infrastructure that decouples safety orchestration from language generation in multi-user LLM agents, achieving full privacy preservation on Muses-Bench while improving utility and instruction-following over baselines.
Agentic Relationship Harm: Benchmarking and Gating Relational Manipulation in AI Agents cs.HC · 2026-06-02 · unverdicted · none · ref 27
Presents a new benchmark and role-sensitive policy gate for agentic relationship harm that outperforms generic safety prompting with zero harmful compliance in tests.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer