Neural trojans,

· 2017

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Imitation Game for Adversarial Disillusion with Chain-of-Thought Reasoning in Generative AI

cs.AI · 2025-01-31 · unverdicted · novelty 5.0

A new defense framework called the disillusion paradigm uses an imitation game and chain-of-thought reasoning in generative agents to neutralize deductive and inductive adversarial illusions in white-box and black-box scenarios.

Hypnopaedia-Aware Machine Unlearning via Psychometrics of Artificial Mental Imagery

cs.CR · 2024-09-29 · unverdicted · novelty 3.0

Proposes a self-aware unlearning method inspired by hypnopaedia that uses model inversion and hypothesis testing to detect and detach backdoor triggers from machine learning models.

citing papers explorer

Showing 2 of 2 citing papers.

Imitation Game for Adversarial Disillusion with Chain-of-Thought Reasoning in Generative AI cs.AI · 2025-01-31 · unverdicted · none · ref 18
A new defense framework called the disillusion paradigm uses an imitation game and chain-of-thought reasoning in generative agents to neutralize deductive and inductive adversarial illusions in white-box and black-box scenarios.
Hypnopaedia-Aware Machine Unlearning via Psychometrics of Artificial Mental Imagery cs.CR · 2024-09-29 · unverdicted · none · ref 51
Proposes a self-aware unlearning method inspired by hypnopaedia that uses model inversion and hypothesis testing to detect and detach backdoor triggers from machine learning models.

Neural trojans,

fields

years

verdicts

representative citing papers

citing papers explorer