• Essay: Circular reasoning with no progressive analysis (e.g., merely listing pros and cons without further comparative sum- maries); merits a 1–2 point deduc- tion

Content Limitations in Argumentative or Fiction Writing • Fiction: Lack of detailed portrayal of characters or psychological depth, relying solely on surface-level narrative

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

HoWToBench: Holistic Evaluation for LLM's Capability in Human-level Writing using Tree of Writing

cs.CL · 2026-04-21 · unverdicted · novelty 6.0

Tree-of-Writing achieves 0.93 Pearson correlation with human judgments by using a tree-structured workflow to aggregate sub-feature scores, outperforming standard LLM-as-a-judge and overlap metrics on the new HowToBench.

citing papers explorer

Showing 1 of 1 citing paper after filters.

HoWToBench: Holistic Evaluation for LLM's Capability in Human-level Writing using Tree of Writing cs.CL · 2026-04-21 · unverdicted · none · ref 127
Tree-of-Writing achieves 0.93 Pearson correlation with human judgments by using a tree-structured workflow to aggregate sub-feature scores, outperforming standard LLM-as-a-judge and overlap metrics on the new HowToBench.

• Essay: Circular reasoning with no progressive analysis (e.g., merely listing pros and cons without further comparative sum- maries); merits a 1–2 point deduc- tion

fields

years

verdicts

representative citing papers

citing papers explorer