pith. sign in

hub

OLM o: Accelerating the science of language models

16 Pith papers cite this work, alongside 57 external citations. Polarity classification is still indexing.

16 Pith papers citing it
57 external citations · Crossref

hub tools

citation-role summary

background 1

citation-polarity summary

years

2026 14 2025 2

roles

background 1

polarities

background 1

clear filters

representative citing papers

Disentangling MLP Neuron Weights in Vocabulary Space

cs.CL · 2026-04-07 · unverdicted · novelty 8.0

ROTATE disentangles MLP neurons into faithful vocabulary channels by optimizing weight rotations to maximize vocabulary-space kurtosis, outperforming activation-based baselines for neuron descriptions.

BOOKMARKS: Efficient Active Storyline Memory for Role-playing

cs.CL · 2026-05-13 · unverdicted · novelty 7.0

BOOKMARKS introduces searchable bookmarks as reusable answers to storyline questions, enabling active initialization and passive synchronization for more consistent role-playing agent memory than recurrent summarization.

ToxiREX: A Dataset on Toxic REasoning in ConteXt

cs.CL · 2026-06-26 · unverdicted · novelty 6.0

ToxiREX is a new dataset of 128k Reddit comments in six languages with hierarchical annotations for implicit toxicity in conversational context based on an existing reasoning schema.

Variable-Width Transformers

cs.CL · 2026-06-16 · conditional · novelty 6.0

×-shaped variable-width transformers outperform parameter-matched uniform baselines on language modeling loss with 22% fewer FLOPs and 15% smaller KV cache.

Unifying Local Communications and Local Updates for LLM Pretraining

cs.LG · 2026-06-09 · unverdicted · novelty 6.0

GASLoC generalizes communication acceleration to the outer optimizer to enable gossip-based decentralized LLM pretraining that supports adaptive optimizers, local steps, and outperforms prior decentralized methods on standard tasks while matching DiLoCo in multi-step regimes.

citing papers explorer

Showing 16 of 16 citing papers.