AIGP combines LLMs with offline RL and DPO to produce interpretable pricing policies that improved GMV by 13.21%, ROI by 7.59%, and milestone achievement by 8.20% in 14-day online tests versus baseline.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.LG 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
AIGP: An LLM-Based Framework for Long-Term Value Alignment in E-Commerce Pricing
AIGP combines LLMs with offline RL and DPO to produce interpretable pricing policies that improved GMV by 13.21%, ROI by 7.59%, and milestone achievement by 8.20% in 14-day online tests versus baseline.