pith. sign in

hub Mixed citations

Prompt Injection Attacks and Defenses in LLM-Integrated Applications

Mixed citation behavior. Most common role is background (60%).

14 Pith papers citing it
Background 60% of classified citations

hub tools

citation-role summary

background 4 baseline 1

citation-polarity summary

representative citing papers

Whispers in the Machine: Confidentiality in Agentic Systems

cs.CR · 2024-02-10 · unverdicted · novelty 6.0

Systematic testing of ten LLM agents across 20 tool scenarios and 14 attacks finds universal vulnerability to prompt injection enabling data exfiltration, with tooling amplifying leakage.

TrustLLM: Trustworthiness in Large Language Models

cs.CL · 2024-01-10 · unverdicted · novelty 5.0

TrustLLM defines eight trustworthiness principles, creates a six-dimension benchmark, and evaluates 16 LLMs showing proprietary models generally lead but some open-source ones are close while over-calibration can hurt utility.

citing papers explorer

Showing 14 of 14 citing papers.