Towards evaluating the robustness of neural networks

Nicholas Carlini, David Wagner · 2017

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Revisiting Model Inversion Evaluation: From Misleading Standards to Reliable Privacy Assessment

cs.LG · 2025-05-06 · conditional · novelty 7.0

Standard model inversion evaluation counts many adversarial false positives as successes; MLLM-based evaluation reveals consistently high false-positive rates across 27 attack setups.

ROAST: Risk-aware Outlier-exposure for Adversarial Selective Training of Anomaly Detectors Against Evasion Attacks

cs.CR · 2026-03-27 · unverdicted · novelty 6.0

ROAST selectively trains anomaly detectors on less vulnerable patient data with targeted outlier exposure, boosting recall by 16.2% in black-box settings and reducing training time by 88.3%.

citing papers explorer

Showing 2 of 2 citing papers.

Revisiting Model Inversion Evaluation: From Misleading Standards to Reliable Privacy Assessment cs.LG · 2025-05-06 · conditional · none · ref 2
Standard model inversion evaluation counts many adversarial false positives as successes; MLLM-based evaluation reveals consistently high false-positive rates across 27 attack setups.
ROAST: Risk-aware Outlier-exposure for Adversarial Selective Training of Anomaly Detectors Against Evasion Attacks cs.CR · 2026-03-27 · unverdicted · none · ref 4
ROAST selectively trains anomaly detectors on less vulnerable patient data with targeted outlier exposure, boosting recall by 16.2% in black-box settings and reducing training time by 88.3%.

Towards evaluating the robustness of neural networks

fields

years

verdicts

representative citing papers

citing papers explorer