Robustness in Large Language Models: A Survey of Mitigation Strategies and Evaluation Metrics

· 2025 · arXiv 2505.18658

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Robust Biomedical Publication Type and Study Design Classification with Knowledge-Guided Perturbations

cs.CL · 2026-05-12 · unverdicted · novelty 5.0

Controlled semantic perturbations and selective robustness training with entity masking and adversarial objectives mitigate the typical robustness-accuracy trade-off in publication type and study design classification.

Sentra-Guard: A Real-Time Multilingual Defense Against Adversarial LLM Prompts

cs.CR · 2025-10-26 · unverdicted · novelty 4.0

Sentra-Guard reports 99.96% detection of adversarial LLM prompts with AUC 1.00 and ASR of 0.004% using a hybrid SBERT-FAISS and transformer classifier architecture with multilingual translation and human feedback.

citing papers explorer

Showing 2 of 2 citing papers.

Robust Biomedical Publication Type and Study Design Classification with Knowledge-Guided Perturbations cs.CL · 2026-05-12 · unverdicted · none · ref 38
Controlled semantic perturbations and selective robustness training with entity masking and adversarial objectives mitigate the typical robustness-accuracy trade-off in publication type and study design classification.
Sentra-Guard: A Real-Time Multilingual Defense Against Adversarial LLM Prompts cs.CR · 2025-10-26 · unverdicted · none · ref 9
Sentra-Guard reports 99.96% detection of adversarial LLM prompts with AUC 1.00 and ASR of 0.004% using a hybrid SBERT-FAISS and transformer classifier architecture with multilingual translation and human feedback.

Robustness in Large Language Models: A Survey of Mitigation Strategies and Evaluation Metrics

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer