Currently, the reasoning is based on a simple chain of thought without any validation of the reasoning steps

The reasoning capabilities of GuardAgent can be further enhanced · 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

GuardAgent: Safeguard LLM Agents by a Guard Agent via Knowledge-Enabled Reasoning

cs.LG · 2024-06-13 · unverdicted · novelty 6.0

GuardAgent safeguards LLM agents by generating task plans from safety requests and mapping them to executable guardrail code, achieving over 98% accuracy on a healthcare access-control benchmark and 83% on a web safety benchmark.

citing papers explorer

Showing 1 of 1 citing paper.

GuardAgent: Safeguard LLM Agents by a Guard Agent via Knowledge-Enabled Reasoning cs.LG · 2024-06-13 · unverdicted · none · ref 13
GuardAgent safeguards LLM agents by generating task plans from safety requests and mapping them to executable guardrail code, achieving over 98% accuracy on a healthcare access-control benchmark and 83% on a web safety benchmark.

Currently, the reasoning is based on a simple chain of thought without any validation of the reasoning steps

fields

years

verdicts

representative citing papers

citing papers explorer