Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit

Jacob Cohen · 1968 · DOI 10.1037/h0026256

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

ProactBench: Beyond What The User Asked For

cs.LG · 2026-05-09 · unverdicted · novelty 7.0

ProactBench measures LLM conversational proactivity in three phases using 198 multi-agent dialogues and finds recovery behavior hard to predict from existing benchmarks.

MIRA: An LLM-Assisted Benchmark for Multi-Category Integrated Retrieval

cs.IR · 2026-05-11 · unverdicted · novelty 6.0

MIRA is a new benchmark for multi-category integrated retrieval built from real queries on a social science platform, with LLM assistance for topic descriptions and relevance labeling across four item categories.

ArguAgent: AI-Supported Real-Time Grouping for Productive Argumentation in STEM Classrooms

cs.AI · 2026-04-25 · unverdicted · novelty 4.0

ArguAgent scores arguments via AI, clusters stances, and forms groups with stance variety but argumentation quality within one level, validated at expert alpha 0.817 and 95.4% success in simulations.

citing papers explorer

Showing 3 of 3 citing papers.

ProactBench: Beyond What The User Asked For cs.LG · 2026-05-09 · unverdicted · none · ref 102
ProactBench measures LLM conversational proactivity in three phases using 198 multi-agent dialogues and finds recovery behavior hard to predict from existing benchmarks.
MIRA: An LLM-Assisted Benchmark for Multi-Category Integrated Retrieval cs.IR · 2026-05-11 · unverdicted · none · ref 14
MIRA is a new benchmark for multi-category integrated retrieval built from real queries on a social science platform, with LLM assistance for topic descriptions and relevance labeling across four item categories.
ArguAgent: AI-Supported Real-Time Grouping for Productive Argumentation in STEM Classrooms cs.AI · 2026-04-25 · unverdicted · none · ref 5
ArguAgent scores arguments via AI, clusters stances, and forms groups with stance variety but argumentation quality within one level, validated at expert alpha 0.817 and 95.4% success in simulations.

Weighted kappa: Nominal scale agreement provision for scaled disagreement or partial credit

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer