Title resolution pending

HellaSwag: Can a Machine Really Finish Your Sentence? , author=

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

browse 4 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Delta Attention Residuals

cs.LG · 2026-05-13 · unverdicted · novelty 7.0

Delta Attention Residuals attend over per-sublayer deltas instead of cumulative hidden states, producing higher-contrast attention weights and 1.7-8.2% validation perplexity gains over standard and attention residuals across 220M-7.6B models.

Exploring Language-Agnosticity in Function Vectors: A Case Study in Machine Translation

cs.CL · 2026-04-21 · unverdicted · novelty 7.0

Translation function vectors extracted from English to one target language improve correct token ranking for translations to multiple other unseen target languages in decoder-only multilingual LLMs.

SocialIQA: Commonsense Reasoning about Social Interactions

cs.CL · 2019-04-22 · unverdicted · novelty 7.0

SocialIQA is the first large-scale benchmark with 38k crowdsourced questions testing commonsense about social interactions, where pretrained language models trail humans by over 20% but transfer to improve performance on Winograd Schemas and COPA.

River-LLM: Large Language Model Seamless Exit Based on KV Share

cs.CL · 2026-04-20

citing papers explorer

Showing 4 of 4 citing papers.

Delta Attention Residuals cs.LG · 2026-05-13 · unverdicted · none · ref 11
Delta Attention Residuals attend over per-sublayer deltas instead of cumulative hidden states, producing higher-contrast attention weights and 1.7-8.2% validation perplexity gains over standard and attention residuals across 220M-7.6B models.
Exploring Language-Agnosticity in Function Vectors: A Case Study in Machine Translation cs.CL · 2026-04-21 · unverdicted · none · ref 27
Translation function vectors extracted from English to one target language improve correct token ranking for translations to multiple other unseen target languages in decoder-only multilingual LLMs.
SocialIQA: Commonsense Reasoning about Social Interactions cs.CL · 2019-04-22 · unverdicted · none · ref 98
SocialIQA is the first large-scale benchmark with 38k crowdsourced questions testing commonsense about social interactions, where pretrained language models trail humans by over 20% but transfer to improve performance on Winograd Schemas and COPA.
River-LLM: Large Language Model Seamless Exit Based on KV Share cs.CL · 2026-04-20 · unreviewed · ref 7

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer