A coefficient of agreement for nominal scales.Educational and Psychological Measurement, 20(1):37–46, 1960

Jacob Cohen · 1960

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

representative citing papers

VISTAQA: Benchmarking Joint Visual Question Answering and Pixel-Level Evidence

cs.CV · 2026-05-20 · unverdicted · novelty 7.0

VISTAQA is a new benchmark for joint visual question answering correctness and pixel-level grounding, evaluated with the GROVE metric that uses per-sample geometric mean to require both dimensions to succeed.

Interpretable Discriminative Text Representations via Agreement and Label Disentanglement

cs.CL · 2026-05-20 · unverdicted · novelty 6.0

LFD discovers predictive text features via LLM contrastive proposals, cross-LLM Cohen's kappa screening, and residual held-out gain selection, matching baseline accuracy while achieving higher human agreement and lower label leakage on ten tasks.

The CTI Echo Chamber: Fragmentation, Overlap, and Vendor Specificity in Twenty Years of Cyber Threat Reporting

cs.CR · 2026-02-19 · unverdicted · novelty 5.0

Large-scale LLM analysis of 16k CTI reports over 20 years shows a fragmented vendor ecosystem with low overlap and reporting biases.

citing papers explorer

Showing 3 of 3 citing papers.

VISTAQA: Benchmarking Joint Visual Question Answering and Pixel-Level Evidence cs.CV · 2026-05-20 · unverdicted · none · ref 7
VISTAQA is a new benchmark for joint visual question answering correctness and pixel-level grounding, evaluated with the GROVE metric that uses per-sample geometric mean to require both dimensions to succeed.
Interpretable Discriminative Text Representations via Agreement and Label Disentanglement cs.CL · 2026-05-20 · unverdicted · none · ref 5
LFD discovers predictive text features via LLM contrastive proposals, cross-LLM Cohen's kappa screening, and residual held-out gain selection, matching baseline accuracy while achieving higher human agreement and lower label leakage on ten tasks.
The CTI Echo Chamber: Fragmentation, Overlap, and Vendor Specificity in Twenty Years of Cyber Threat Reporting cs.CR · 2026-02-19 · unverdicted · none · ref 14
Large-scale LLM analysis of 16k CTI reports over 20 years shows a fragmented vendor ecosystem with low overlap and reporting biases.

A coefficient of agreement for nominal scales.Educational and Psychological Measurement, 20(1):37–46, 1960

fields

years

verdicts

representative citing papers

citing papers explorer