Question interpretation diversity outperforms model diversity for LLM ensembling on binary QA tasks using majority voting.
Rain gauges are used to collect and measure precipitation, and the size of the gauge can affect the accuracy of the measure- ment
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2025 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Diverse LLMs or Diverse Question Interpretations? That is the Ensembling Question
Question interpretation diversity outperforms model diversity for LLM ensembling on binary QA tasks using majority voting.