pith. sign in

arxiv: 1712.03686 · v2 · pith:QULGGJTLnew · submitted 2017-12-11 · 📊 stat.AP · cs.CV· cs.GR

A practical guide and software for analysing pairwise comparison experiments

classification 📊 stat.AP cs.CVcs.GR
keywords scalecomparisondatajudgmentsmethodspairwiseanalysisintroducing
0
0 comments X
read the original abstract

Most popular strategies to capture subjective judgments from humans involve the construction of a unidimensional relative measurement scale, representing order preferences or judgments about a set of objects or conditions. This information is generally captured by means of direct scoring, either in the form of a Likert or cardinal scale, or by comparative judgments in pairs or sets. In this sense, the use of pairwise comparisons is becoming increasingly popular because of the simplicity of this experimental procedure. However, this strategy requires non-trivial data analysis to aggregate the comparison ranks into a quality scale and analyse the results, in order to take full advantage of the collected data. This paper explains the process of translating pairwise comparison data into a measurement scale, discusses the benefits and limitations of such scaling methods and introduces a publicly available software in Matlab. We improve on existing scaling methods by introducing outlier analysis, providing methods for computing confidence intervals and statistical testing and introducing a prior, which reduces estimation error when the number of observers is low. Most of our examples focus on image quality assessment.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Dynamic resolution switching for live streaming

    eess.IV 2026-05 unverdicted novelty 5.0

    Presents DRS framework for live streaming with a lightweight bitstream VQM trained on pairwise comparison data to dynamically select optimal resolutions per segment, reporting ~9% BD-rate reduction.