Title resolution pending

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Transformer Feed-Forward Layers Are Key-Value Memories

cs.CL · 2020-12-29 · conditional · novelty 8.0

Transformer feed-forward layers act as key-value memories storing textual patterns and their associated output distributions.

Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators

cs.LG · 2024-04-06 · conditional · novelty 6.0

Length-controlled AlpacaEval applies regression adjustment to remove length bias from LLM auto-evaluations, raising Spearman correlation with Chatbot Arena from 0.94 to 0.98.

Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-Case Generalization

cs.LG · 2019-11-20 · conditional · novelty 6.0

Increased regularization is required for group DRO to achieve good worst-group generalization in overparameterized neural networks.

citing papers explorer

Showing 3 of 3 citing papers.

Transformer Feed-Forward Layers Are Key-Value Memories cs.CL · 2020-12-29 · conditional · none · ref 47
Transformer feed-forward layers act as key-value memories storing textual patterns and their associated output distributions.
Length-Controlled AlpacaEval: A Simple Way to Debias Automatic Evaluators cs.LG · 2024-04-06 · conditional · none · ref 164
Length-controlled AlpacaEval applies regression adjustment to remove length bias from LLM auto-evaluations, raising Spearman correlation with Chatbot Arena from 0.94 to 0.98.
Distributionally Robust Neural Networks for Group Shifts: On the Importance of Regularization for Worst-Case Generalization cs.LG · 2019-11-20 · conditional · none · ref 286
Increased regularization is required for group DRO to achieve good worst-group generalization in overparameterized neural networks.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer