Appendix - Recommended Statistical Significance Tests for NLP Tasks

Roi Reichart; Rotem Dror

arxiv: 1809.01448 · v1 · pith:U3NPTDEEnew · submitted 2018-09-05 · 💻 cs.CL

Appendix - Recommended Statistical Significance Tests for NLP Tasks

Rotem Dror , Roi Reichart This is my paper

classification 💻 cs.CL

keywords statisticalsignificanceappendixtaskstestingtestswhenalgorithm

0 comments

read the original abstract

Statistical significance testing plays an important role when drawing conclusions from experimental results in NLP papers. Particularly, it is a valuable tool when one would like to establish the superiority of one algorithm over another. This appendix complements the guide for testing statistical significance in NLP presented in \cite{dror2018hitchhiker} by proposing valid statistical tests for the common tasks and evaluation measures in the field.

This paper has not been read by Pith yet.

Appendix - Recommended Statistical Significance Tests for NLP Tasks

discussion (0)