pith. sign in

arxiv: 1902.09467 · v1 · pith:CSVP7URCnew · submitted 2019-01-24 · 📡 eess.SP · cs.IT· cs.NI· cs.SI· math.IT

Reinforcement Learning to Minimize Age of Information with an Energy Harvesting Sensor with HARQ and Sensing Cost

classification 📡 eess.SP cs.ITcs.NIcs.SImath.IT
keywords energyharvestinginformationlearningpolicyproposedreinforcementstatus
0
0 comments X
read the original abstract

The time average expected age of information (AoI) is studied for status updates sent from an energy-harvesting transmitter with a finite-capacity battery. The optimal scheduling policy is first studied under different feedback mechanisms when the channel and energy harvesting statistics are known. For the case of unknown environments, an average-cost reinforcement learning algorithm is proposed that learns the system parameters and the status update policy in real time. The effectiveness of the proposed methods is verified through numerical results.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.