HydroAgent fine-tunes Qwen3-4B on 2,576 expert calibration trajectories and applies Group-Relative Policy Optimization with NSE reward from live CREST simulations to improve hydrologic model calibration over frontier LLMs.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.LG 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
HydroAgent: Closing the Gap Between Frontier LLMs and Human Experts in Hydrologic Model Calibration via Simulator-Grounded RL
HydroAgent fine-tunes Qwen3-4B on 2,576 expert calibration trajectories and applies Group-Relative Policy Optimization with NSE reward from live CREST simulations to improve hydrologic model calibration over frontier LLMs.