pith. sign in

Exploiting GPT-3 prompts with malicious inputs that order the model to ignore its previous directions

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

citation-role summary

background 2

citation-polarity summary

fields

cs.CL 1 cs.CR 1

years

2024 1 2022 1

verdicts

UNVERDICTED 2

roles

background 2

polarities

background 2

representative citing papers

citing papers explorer

Showing 2 of 2 citing papers.