Reinforcement learning with imitation learning and reward shaping improves online workload shifting in a one-turbine one-data-center simulation but remains below an offline optimizer that sees the full day.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.LG 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Toward an Energy-Optimized Operation of Data Centers Located in Wind Farms Using Reinforcement Learning
Reinforcement learning with imitation learning and reward shaping improves online workload shifting in a one-turbine one-data-center simulation but remains below an offline optimizer that sees the full day.