pith. sign in

Title resolution pending

15 Pith papers cite this work. Polarity classification is still indexing.

15 Pith papers citing it

citation-role summary

background 2 baseline 1

citation-polarity summary

years

2026 14 2025 1

clear filters

representative citing papers

Multi-component Causal Tracing in Large Language Models

cs.LG · 2026-06-02 · unverdicted · novelty 6.0

A unified multi-component causal tracing method that uses soft interventions and a metric transformation to efficiently select critical LLM components for a target performance metric.

Contribution Weights: A Geometrical Analysis of Self-Attention Transformers

cs.LG · 2026-05-29 · unverdicted · novelty 6.0 · 2 refs

Contribution Weights combine attention, value magnitude, and directional alignment to measure token influence more faithfully than attention alone, and show attention sinks actively suppress information via a convex sink-rate to output-norm relationship.

Instructions Shape Production of Language, not Processing

cs.CL · 2026-05-11 · unverdicted · novelty 6.0 · 2 refs

Instructions trigger a production-centered mechanism in language models, with task-specific information stable in input tokens but varying strongly in output tokens and correlating with behavior.

citing papers explorer

Showing 15 of 15 citing papers.