arXiv preprint arXiv:2504.16918 (2025)

Thind, R · 2025 · arXiv 2504.16918

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

background 1 baseline 1

citation-polarity summary

background 1 baseline 1

representative citing papers

PARM: Pipeline-Adapted Reward Model

cs.AI · 2026-04-20 · unverdicted · novelty 6.0

PARM adapts reward models to multi-stage LLM pipelines via pipeline data and direct preference optimization, improving execution rate and solving accuracy on optimization benchmarks and showing transfer to GSM8K.

AgenticRecTune: Multi-Agent with Self-Evolving Skillhub for Recommendation System Optimization

cs.IR · 2026-04-21 · unverdicted · novelty 5.0 · 2 refs

AgenticRecTune deploys five LLM agents (Actor, Critic, Insight, Skill, Online) and a self-evolving Skillhub to handle end-to-end configuration optimization for multi-stage recommendation systems.

Large Language Models for Operations Research: A Comprehensive Survey

math.OC · 2026-05-20 · unverdicted · novelty 2.0

A survey compiling roles, applications, benchmarks, challenges, and future directions for large language models in operations research.

citing papers explorer

Showing 3 of 3 citing papers.

PARM: Pipeline-Adapted Reward Model cs.AI · 2026-04-20 · unverdicted · none · ref 36
PARM adapts reward models to multi-stage LLM pipelines via pipeline data and direct preference optimization, improving execution rate and solving accuracy on optimization benchmarks and showing transfer to GSM8K.
AgenticRecTune: Multi-Agent with Self-Evolving Skillhub for Recommendation System Optimization cs.IR · 2026-04-21 · unverdicted · none · ref 16 · 2 links
AgenticRecTune deploys five LLM agents (Actor, Critic, Insight, Skill, Online) and a self-evolving Skillhub to handle end-to-end configuration optimization for multi-stage recommendation systems.
Large Language Models for Operations Research: A Comprehensive Survey math.OC · 2026-05-20 · unverdicted · none · ref 45
A survey compiling roles, applications, benchmarks, challenges, and future directions for large language models in operations research.

arXiv preprint arXiv:2504.16918 (2025)

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer