Fundamental limitations of alignment in large language models

Yotam Wolf, Noam Wies, Yoav Levine, Amnon Shashua · 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Prompt Injection attack against LLM-integrated Applications

cs.CR · 2023-06-08 · accept · novelty 6.0

HouYi enables prompt injection attacks that grant arbitrary LLM control and steal application prompts in 31 out of 36 tested real-world LLM-integrated applications.

citing papers explorer

Showing 1 of 1 citing paper.

Prompt Injection attack against LLM-integrated Applications cs.CR · 2023-06-08 · accept · none · ref 64
HouYi enables prompt injection attacks that grant arbitrary LLM control and steal application prompts in 31 out of 36 tested real-world LLM-integrated applications.

Fundamental limitations of alignment in large language models

fields

years

verdicts

representative citing papers

citing papers explorer