These hypotheses are based on the user’s writing style and tone, as well as the topics and themes they tend to explore in their writing

The user is empathetic, understanding, often using phrases, sentences that convey a sense of shared experience, camaraderie, may use rhetorical devices such as rhetori

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Personalized Benchmarking: Evaluating LLMs by Individual Preferences

cs.AI · 2026-04-21 · unverdicted · novelty 6.0

Personalized LLM rankings using ELO and Bradley-Terry on 115 users show low correlation with aggregate rankings (BT ρ=0.04), highlighting the need for user-specific benchmarks.

citing papers explorer

Showing 1 of 1 citing paper.

Personalized Benchmarking: Evaluating LLMs by Individual Preferences cs.AI · 2026-04-21 · unverdicted · none · ref 69
Personalized LLM rankings using ELO and Bradley-Terry on 115 users show low correlation with aggregate rankings (BT ρ=0.04), highlighting the need for user-specific benchmarks.

These hypotheses are based on the user’s writing style and tone, as well as the topics and themes they tend to explore in their writing

fields

years

verdicts

representative citing papers

citing papers explorer