Title resolution pending

Proceedings of Machine Learning Research · 2026 · arXiv 2603.24361

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

DGLight: DQN-Guided GRPO Fine-Tuning of Large Language Models for Traffic Signal Control

cs.LG · 2026-04-28 · unverdicted · novelty 4.0

DGLight uses a frozen CoLight DQN critic to score LLM-generated actions and optimize the policy via GRPO, yielding the strongest LLM-based traffic signal controller on Jinan and Hangzhou benchmarks while remaining competitive with RL baselines.

citing papers explorer

Showing 1 of 1 citing paper.

DGLight: DQN-Guided GRPO Fine-Tuning of Large Language Models for Traffic Signal Control cs.LG · 2026-04-28 · unverdicted · none · ref 2
DGLight uses a frozen CoLight DQN critic to score LLM-generated actions and optimize the policy via GRPO, yielding the strongest LLM-based traffic signal controller on Jinan and Hangzhou benchmarks while remaining competitive with RL baselines.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer