URL:https://arxiv.org/ abs/2407.14467

Check-Eval: A checklist-based approach for evaluating text quality · 2020 · arXiv 2407.14467

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Adaptive Cost-Efficient Evaluation for Reliable Patent Claim Validation

cs.CL · 2026-04-05 · conditional · novelty 7.0

ACE achieves 94.95% F1 on patent claim validation by routing high-entropy cases to an LLM with CoPT reasoning, cutting costs 78% versus full LLM use, with the threshold transferring to real USPTO rejections.

RPA-Check: A Multi-Stage Automated Framework for Evaluating Dynamic LLM-based Role-Playing Agents

cs.CL · 2026-04-13 · unverdicted · novelty 6.0

RPA-Check is a new multi-stage framework using dimension definition, boolean checklist augmentation, semantic filtering, and LLM-as-judge verification to assess role-playing agents, with tests on a legal training game showing smaller instruction-tuned models can be more consistent than larger ones.

citing papers explorer

Showing 2 of 2 citing papers.

Adaptive Cost-Efficient Evaluation for Reliable Patent Claim Validation cs.CL · 2026-04-05 · conditional · none · ref 1
ACE achieves 94.95% F1 on patent claim validation by routing high-entropy cases to an LLM with CoPT reasoning, cutting costs 78% versus full LLM use, with the threshold transferring to real USPTO rejections.
RPA-Check: A Multi-Stage Automated Framework for Evaluating Dynamic LLM-based Role-Playing Agents cs.CL · 2026-04-13 · unverdicted · none · ref 15
RPA-Check is a new multi-stage framework using dimension definition, boolean checklist augmentation, semantic filtering, and LLM-as-judge verification to assess role-playing agents, with tests on a legal training game showing smaller instruction-tuned models can be more consistent than larger ones.

URL:https://arxiv.org/ abs/2407.14467

fields

years

verdicts

representative citing papers

citing papers explorer