arXiv preprint arXiv:2305.13014 (2023)

Stefano De Paoli · 2023 · arXiv 2305.13014

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Can GPT-4o Evaluate Usability Like Human Experts? A Comparative Study on Issue Identification in Heuristic Evaluation

cs.HC · 2025-06-19 · unverdicted · novelty 6.0

GPT-4o identified only 21.2% of the usability issues found by human experts in heuristic evaluation, while discovering 27 additional issues and exhibiting difficulties with certain heuristics and generating false positives.

A Computational Method for Measuring "Open Codes" in Qualitative Analysis

cs.CL · 2024-11-19 · unverdicted · novelty 6.0

A method merges codebooks via LLM and evaluates human and AI inductive coding with four new metrics on an online conversation dataset.

citing papers explorer

Showing 2 of 2 citing papers.

Can GPT-4o Evaluate Usability Like Human Experts? A Comparative Study on Issue Identification in Heuristic Evaluation cs.HC · 2025-06-19 · unverdicted · none · ref 8
GPT-4o identified only 21.2% of the usability issues found by human experts in heuristic evaluation, while discovering 27 additional issues and exhibiting difficulties with certain heuristics and generating false positives.
A Computational Method for Measuring "Open Codes" in Qualitative Analysis cs.CL · 2024-11-19 · unverdicted · none · ref 17
A method merges codebooks via LLM and evaluates human and AI inductive coding with four new metrics on an online conversation dataset.

arXiv preprint arXiv:2305.13014 (2023)

fields

years

verdicts

representative citing papers

citing papers explorer