Haicosystem: An ecosystem for sandboxing safety risks in human-ai interactions

Xuhui Zhou, Hyunwoo Kim, Faeze Brahman, Liwei Jiang, Hao Zhu, Ximing Lu, Frank F · 2025 · arXiv 2409.16427

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Reinforcing Human Behavior Simulation via Verbal Feedback

cs.LG · 2026-05-19 · unverdicted · novelty 6.0

DITTO uses RL with verbal feedback to train LLMs for human behavior simulation, reporting 36% average gains over base models and outperforming GPT-5.4 on 6 of 10 SOUL benchmark tasks.

Security Risks in Tool-Enabled AI Agents: A Systematic Analysis of Privileged Execution Environments

cs.CR · 2026-05-10 · unverdicted · novelty 5.0

The paper introduces a taxonomy of security risks in cloud-hosted tool-enabled AI agents arising mainly from over-privileged tools and authority leakage, supported by scenarios, mitigations, and a small experiment.

citing papers explorer

Showing 2 of 2 citing papers.

Reinforcing Human Behavior Simulation via Verbal Feedback cs.LG · 2026-05-19 · unverdicted · none · ref 61
DITTO uses RL with verbal feedback to train LLMs for human behavior simulation, reporting 36% average gains over base models and outperforming GPT-5.4 on 6 of 10 SOUL benchmark tasks.
Security Risks in Tool-Enabled AI Agents: A Systematic Analysis of Privileged Execution Environments cs.CR · 2026-05-10 · unverdicted · none · ref 21
The paper introduces a taxonomy of security risks in cloud-hosted tool-enabled AI agents arising mainly from over-privileged tools and authority leakage, supported by scenarios, mitigations, and a small experiment.

Haicosystem: An ecosystem for sandboxing safety risks in human-ai interactions

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer