LBPE : Long-token-first Tokenization to Improve Large Language Models

Lian, H · 2024 · arXiv 2404.18553

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Breaking Safety at the Token Boundary: How BPE Tokenization Creates Exploitable Gaps in LLM Alignment

cs.CL · 2026-05-01 · unverdicted · novelty 6.0

BPE tokenization creates exploitable gaps in LLM safety by fragmenting safety words, enabling attacks that flip refusal on 80-100% of HarmBench prompts across five models, with DPO failing to close the gap stably and SFT causing over-refusal.

citing papers explorer

Showing 1 of 1 citing paper.

Breaking Safety at the Token Boundary: How BPE Tokenization Creates Exploitable Gaps in LLM Alignment cs.CL · 2026-05-01 · unverdicted · none · ref 21
BPE tokenization creates exploitable gaps in LLM safety by fragmenting safety words, enabling attacks that flip refusal on 80-100% of HarmBench prompts across five models, with DPO failing to close the gap stably and SFT causing over-refusal.

LBPE : Long-token-first Tokenization to Improve Large Language Models

fields

years

verdicts

representative citing papers

citing papers explorer