arXiv preprint arXiv:2312.03718 , year=

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Measuring & Mitigating Over-Alignment for LLMs in Multilingual Criminal Law Courts

cs.CL · 2026-06-22 · unverdicted · novelty 6.0

Creates TF-RefusalBench to quantify over-alignment in LLMs on criminal-law tasks across four languages and shows abliteration mitigates refusals with little performance loss.

A Survey on Knowledge Distillation of Large Language Models

cs.CL · 2024-02-20 · accept · novelty 3.0

A comprehensive survey of knowledge distillation for LLMs structured around algorithms, skill enhancement, and vertical applications, highlighting data augmentation as a key enabler.

citing papers explorer

Showing 2 of 2 citing papers.

Measuring & Mitigating Over-Alignment for LLMs in Multilingual Criminal Law Courts cs.CL · 2026-06-22 · unverdicted · none · ref 28
Creates TF-RefusalBench to quantify over-alignment in LLMs on criminal-law tasks across four languages and shows abliteration mitigates refusals with little performance loss.
A Survey on Knowledge Distillation of Large Language Models cs.CL · 2024-02-20 · accept · none · ref 175
A comprehensive survey of knowledge distillation for LLMs structured around algorithms, skill enhancement, and vertical applications, highlighting data augmentation as a key enabler.

arXiv preprint arXiv:2312.03718 , year=

fields

years

verdicts

representative citing papers

citing papers explorer