pith. sign in

some" task with SSA for model trained from scratch Figure 6: Heatmap showing the evolution of errors for the task

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.CL 1

years

2025 1

verdicts

UNVERDICTED 1

representative citing papers

SSA: Improving Performance With a Better Scoring Function

cs.CL · 2025-08-20 · unverdicted · novelty 5.0

Replacing Softmax with Scaled Signed Averaging in transformer attention improves generalization under distribution shifts for in-context learning and boosts results on NLP benchmarks.

citing papers explorer

Showing 1 of 1 citing paper.

  • SSA: Improving Performance With a Better Scoring Function cs.CL · 2025-08-20 · unverdicted · none · ref 19

    Replacing Softmax with Scaled Signed Averaging in transformer attention improves generalization under distribution shifts for in-context learning and boosts results on NLP benchmarks.