and Geiger, Atticus and Nanda, Neel

Tigges, Curt, Hollinsworth, Oskar J · 2024 · DOI 10.18653/v1/2024.blackboxnlp-1.5

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

The Grounding Gap: How LLMs Anchor the Meaning of Abstract Concepts Differently from Humans

cs.CL · 2026-05-09 · unverdicted · novelty 6.0

LLMs show a grounding gap with humans on abstract concepts, with property-generation correlations at most r=0.37 versus human-to-human r>0.9, though larger models align better on explicit rating tasks and internal SAE features capture some grounding dimensions.

On Emotion-Sensitive Decision Making of Small Language Model Agents

cs.AI · 2026-04-08 · unverdicted · novelty 6.0

Emotional perturbations induced via activation steering systematically alter strategic choices made by small language model agents in cooperative and competitive game templates, yet the resulting behaviors remain unstable and only partially aligned with human patterns.

citing papers explorer

Showing 2 of 2 citing papers.

The Grounding Gap: How LLMs Anchor the Meaning of Abstract Concepts Differently from Humans cs.CL · 2026-05-09 · unverdicted · none · ref 26
LLMs show a grounding gap with humans on abstract concepts, with property-generation correlations at most r=0.37 versus human-to-human r>0.9, though larger models align better on explicit rating tasks and internal SAE features capture some grounding dimensions.
On Emotion-Sensitive Decision Making of Small Language Model Agents cs.AI · 2026-04-08 · unverdicted · none · ref 3
Emotional perturbations induced via activation steering systematically alter strategic choices made by small language model agents in cooperative and competitive game templates, yet the resulting behaviors remain unstable and only partially aligned with human patterns.

and Geiger, Atticus and Nanda, Neel

fields

years

verdicts

representative citing papers

citing papers explorer