Individual choice behavior: A theoretical analysis

R Duncan Luce · 2005

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Neural Variance-aware Dueling Bandits with Deep Representation and Shallow Exploration

cs.LG · 2025-06-02 · unverdicted · novelty 6.0

Variance-aware neural dueling bandit algorithms achieve sublinear regret of order O(d sqrt(sum sigma_t^2) + sqrt(d T)) for wide networks on nonlinear utilities.

Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive

cs.CL · 2024-02-20 · conditional · novelty 6.0

DPOP is a new loss function that prevents DPO from lowering preferred response likelihoods and outperforms standard DPO on diverse datasets, MT-Bench, and enables Smaug-72B to exceed 80% on the Open LLM Leaderboard.

citing papers explorer

Showing 2 of 2 citing papers.

Neural Variance-aware Dueling Bandits with Deep Representation and Shallow Exploration cs.LG · 2025-06-02 · unverdicted · none · ref 23
Variance-aware neural dueling bandit algorithms achieve sublinear regret of order O(d sqrt(sum sigma_t^2) + sqrt(d T)) for wide networks on nonlinear utilities.
Smaug: Fixing Failure Modes of Preference Optimisation with DPO-Positive cs.CL · 2024-02-20 · conditional · none · ref 36
DPOP is a new loss function that prevents DPO from lowering preferred response likelihoods and outperforms standard DPO on diverse datasets, MT-Bench, and enables Smaug-72B to exceed 80% on the Open LLM Leaderboard.

Individual choice behavior: A theoretical analysis

fields

years

verdicts

representative citing papers

citing papers explorer