It surpasses all existing open-source state-of-the-art models, showcasing the effectiveness and robustness of the RLEIF approach proposed in our study

WizardMath demonstrates outstanding performance across a wide range of model scales, from 100M to 1B, 70B parameters, on the benchmarks such as GSM8k, MATH, out-ofdistribution (OOD) tasks like MWPBench(Tang et al · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct

cs.CL · 2023-08-18 · conditional · novelty 6.0

WizardMath applies RLEIF to produce open-source LLMs that reach new state-of-the-art math reasoning scores on GSM8k and MATH, with the 70B variant surpassing GPT-3.5-Turbo, Claude 2, Gemini Pro, and early GPT-4.

citing papers explorer

Showing 1 of 1 citing paper.

WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-Instruct cs.CL · 2023-08-18 · conditional · none · ref 13
WizardMath applies RLEIF to produce open-source LLMs that reach new state-of-the-art math reasoning scores on GSM8k and MATH, with the 70B variant surpassing GPT-3.5-Turbo, Claude 2, Gemini Pro, and early GPT-4.

It surpasses all existing open-source state-of-the-art models, showcasing the effectiveness and robustness of the RLEIF approach proposed in our study

fields

years

verdicts

representative citing papers

citing papers explorer