Show Your Work: Improved Reporting of Experimental Results

Dodge, Jesse, Gururangan, Suchin, Card, Dallas, Schwartz, Roy, Smith, Noah A · 2019 · DOI 10.18653/v1/d19-1224

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

open at publisher browse 4 citing papers

citation-role summary

background 1

citation-polarity summary

support 1

representative citing papers

Hidden in the Multiplicative Interaction: Uncovering Fragility in Multimodal Contrastive Learning

cs.LG · 2026-04-07 · unverdicted · novelty 7.0

Multimodal contrastive learning using multilinear products is fragile to single bad modalities, and a gated version improves top-1 retrieval accuracy on synthetic and real trimodal data.

More Context, Larger Models, or Moral Knowledge? A Systematic Study of Schwartz Value Detection in Political Texts

cs.CL · 2026-05-21 · unverdicted · novelty 6.0 · 2 refs

Moral knowledge retrieval improves Schwartz value detection more consistently than added context or larger models across tested conditions and model families.

Improving Reproducibility in Evaluation through Multi-Level Annotator Modeling

cs.LG · 2026-05-13 · unverdicted · novelty 5.0

Multi-level bootstrapping models annotator variance using large rater-ID datasets to find optimal tradeoffs between number of items N and ratings per item K for statistically significant AI evaluations.

Human-aligned AI Model Cards with Weighted Hierarchy Architecture

cs.SE · 2025-10-08 · unverdicted · novelty 4.0

Introduces CRAI-MCF, an eight-module framework distilling 217 parameters from 240 projects into a quantitative sufficiency criterion for cross-model LLM comparison grounded in Value Sensitive Design.

citing papers explorer

Showing 4 of 4 citing papers.

Hidden in the Multiplicative Interaction: Uncovering Fragility in Multimodal Contrastive Learning cs.LG · 2026-04-07 · unverdicted · none · ref 15
Multimodal contrastive learning using multilinear products is fragile to single bad modalities, and a gated version improves top-1 retrieval accuracy on synthetic and real trimodal data.
More Context, Larger Models, or Moral Knowledge? A Systematic Study of Schwartz Value Detection in Political Texts cs.CL · 2026-05-21 · unverdicted · none · ref 49 · 2 links
Moral knowledge retrieval improves Schwartz value detection more consistently than added context or larger models across tested conditions and model families.
Improving Reproducibility in Evaluation through Multi-Level Annotator Modeling cs.LG · 2026-05-13 · unverdicted · none · ref 10
Multi-level bootstrapping models annotator variance using large rater-ID datasets to find optimal tradeoffs between number of items N and ratings per item K for statistically significant AI evaluations.
Human-aligned AI Model Cards with Weighted Hierarchy Architecture cs.SE · 2025-10-08 · unverdicted · none · ref 14
Introduces CRAI-MCF, an eight-module framework distilling 217 parameters from 240 projects into a quantitative sufficiency criterion for cross-model LLM comparison grounded in Value Sensitive Design.

Show Your Work: Improved Reporting of Experimental Results

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer