arXiv preprint arXiv:2006.08198 , year=

· 2006 · arXiv 2006.08198

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Uncertainty-Aware End-to-End Co-Design of Neural Network Processors: From Training and Mapping to Fabrication

cs.LG · 2026-06-03 · unverdicted · novelty 6.0

A monotone co-design framework for neural network processors that treats uncertainty via Confidence as a tunable resource and allows modular block refinement without structural changes.

Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive

cs.CL · 2024-02-20 · conditional · novelty 6.0

DPOP is a new loss function that prevents DPO from lowering preferred response likelihoods and outperforms standard DPO on diverse datasets, MT-Bench, and enables Smaug-72B to exceed 80% on the Open LLM Leaderboard.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive cs.CL · 2024-02-20 · conditional · none · ref 172
DPOP is a new loss function that prevents DPO from lowering preferred response likelihoods and outperforms standard DPO on diverse datasets, MT-Bench, and enables Smaug-72B to exceed 80% on the Open LLM Leaderboard.

arXiv preprint arXiv:2006.08198 , year=

fields

years

verdicts

representative citing papers

citing papers explorer