Designing and interpreting probes with control tasks, 2019

John Hewitt, Percy Liang · 2019

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Senses Wide Shut: A Representation-Action Gap in Omnimodal LLMs

cs.AI · 2026-05-13 · unverdicted · novelty 7.0

Omnimodal LLMs encode premise-perception mismatches in hidden states yet almost never reject false textual claims, exposing a representation-action gap that is modality-asymmetric and prompt-resistant.

From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step

cs.CL · 2024-05-23 · conditional · novelty 6.0

Gradual fine-tuning that removes explicit CoT steps lets GPT-2 Small reach 99% accuracy on 9x9 multiplication and Mistral 7B exceed 50% on GSM8K with no intermediate outputs.

citing papers explorer

Showing 2 of 2 citing papers.

Senses Wide Shut: A Representation-Action Gap in Omnimodal LLMs cs.AI · 2026-05-13 · unverdicted · none · ref 44
Omnimodal LLMs encode premise-perception mismatches in hidden states yet almost never reject false textual claims, exposing a representation-action gap that is modality-asymmetric and prompt-resistant.
From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by Step cs.CL · 2024-05-23 · conditional · none · ref 8
Gradual fine-tuning that removes explicit CoT steps lets GPT-2 Small reach 99% accuracy on 9x9 multiplication and Mistral 7B exceed 50% on GSM8K with no intermediate outputs.

Designing and interpreting probes with control tasks, 2019

fields

years

verdicts

representative citing papers

citing papers explorer