Then max q∈∆V−1 F(q) = 11 √ 33−59 768 ≤0.00546, and the maximizer is attained by a vector with qi =q j = 9− √ 33 24 ,all remaining mass1−2q i placed on one coordinate

For fixed distincti̸=j, consider F(q) :=q iqj −qi −q j +∥q∥ 2

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Beyond Log Likelihood: Probability-Based Objectives for Supervised Fine-Tuning across the Model Capability Continuum

cs.CL · 2025-10-01 · unverdicted · novelty 5.0

Prior-leaning probability objectives outperform NLL for strong base models on SFT while NLL dominates for weak models, with the switch governed by a model-capability continuum.

citing papers explorer

Showing 1 of 1 citing paper.

Beyond Log Likelihood: Probability-Based Objectives for Supervised Fine-Tuning across the Model Capability Continuum cs.CL · 2025-10-01 · unverdicted · none · ref 2
Prior-leaning probability objectives outperform NLL for strong base models on SFT while NLL dominates for weak models, with the switch governed by a model-capability continuum.

Then max q∈∆V−1 F(q) = 11 √ 33−59 768 ≤0.00546, and the maximizer is attained by a vector with qi =q j = 9− √ 33 24 ,all remaining mass1−2q i placed on one coordinate

fields

years

verdicts

representative citing papers

citing papers explorer