LLM4Branch discovers branching policies for MILP solvers as LLM-generated executable programs whose parameters are tuned via zeroth-order optimization on solver performance.
Deepseek-r1 incentivizes reasoning in llms through reinforcement learning
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 3verdicts
UNVERDICTED 3roles
background 1polarities
background 1representative citing papers
SelectiveRM applies optimal transport with a joint consistency discrepancy and partial mass relaxation to produce reward models that optimize a tighter upper bound on clean risk while autonomously dropping noisy preference samples.
TRUST is a decentralized AI auditing framework that decomposes reasoning into HDAGs, maps agent interactions via the DAAN protocol to CIGs, and uses stake-weighted multi-tier consensus to achieve 72.4% accuracy while proving a Safety-Profitability Theorem that rewards honest auditors.
citing papers explorer
-
LLM4Branch: Large Language Model for Discovering Efficient Branching Policies of Integer Programs
LLM4Branch discovers branching policies for MILP solvers as LLM-generated executable programs whose parameters are tuned via zeroth-order optimization on solver performance.
-
TRUST: A Framework for Decentralized AI Service v.0.1
TRUST is a decentralized AI auditing framework that decomposes reasoning into HDAGs, maps agent interactions via the DAAN protocol to CIGs, and uses stake-weighted multi-tier consensus to achieve 72.4% accuracy while proving a Safety-Profitability Theorem that rewards honest auditors.