Title resolution pending

meta-math, “Metamathqa,” Hugging Face Datasets · 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Hard Negative Sample-Augmented DPO Post-Training for Small Language Models

cs.LG · 2025-12-17 · unverdicted · novelty 5.0

A six-dimensional MathVerifier supplies hard negatives and per-sample weights that improve DPO performance on math reasoning for a 1.5B Qwen2.5 model over standard SFT and unweighted DPO.

citing papers explorer

Showing 1 of 1 citing paper.

Hard Negative Sample-Augmented DPO Post-Training for Small Language Models cs.LG · 2025-12-17 · unverdicted · none · ref 18
A six-dimensional MathVerifier supplies hard negatives and per-sample weights that improve DPO performance on math reasoning for a 1.5B Qwen2.5 model over standard SFT and unweighted DPO.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer