Multiobjective Home Appliances Scheduling Considering Customer Thermal Discomfort: A Multistep Look-ahead ADP-Based Approach
Pith reviewed 2026-05-25 01:41 UTC · model grok-4.3
The pith
A multistep look-ahead ADP approach solves multiobjective home energy management to trade off electricity payment against thermal discomfort.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The MO_HEMU is formulated as a dynamic program and a method based on the approximated dynamic programming is used as the scheduling algorithm. The developed method, called multistep look ahead algorithm, is an iterative algorithm that overcomes the curse of dimensionality of the exact DP by choosing a decision based on a limited number of stages ahead. The effectiveness of the proposed model is investigated through numerical simulations. The proposed MO_HEMU enables customers to find the desired trade off between electricity payment and a discomfort level.
What carries the argument
Multistep look-ahead algorithm, an iterative approximation to dynamic programming that selects each decision by evaluating a limited future horizon instead of the full horizon.
If this is right
- Customers obtain explicit trade-off curves between electricity payment and thermal discomfort.
- The algorithm schedules solar PV, battery, deferrable and thermal appliances in response to time-varying prices.
- Thermal discomfort is modeled as temperature deviation for both space heating and hot water.
- The approach makes dynamic programming tractable for realistic household instances by limiting the look-ahead depth.
Where Pith is reading between the lines
- The same approximation technique might apply to other multiobjective stochastic control problems in energy systems.
- Varying the look-ahead horizon length could be used as a tunable parameter to adjust computation time versus solution quality.
- Incorporating uncertainty in user behavior or weather forecasts would be a natural next extension of the model.
Load-bearing premise
The multistep look-ahead approximation produces decisions close enough to the true optimal dynamic program that the reported cost-discomfort trade-offs remain meaningful.
What would settle it
Solve the exact dynamic program for a reduced problem with short time horizon and few devices, then check if the multistep look-ahead method recovers similar cost and discomfort values.
Figures
read the original abstract
This paper proposes a multiobjective home energy management unit (MO_HEMU) to balance the electricity payment and thermal discomfort of a household by properly scheduling devices in a time varying price environment. The thermal discomfort is measured by the deviation of indoor and hot water temperature from the users ideal temperature. The home devices include solar Photovoltaics, a battery storage system, deferrable, and thermal appliances. The proposed MOHEMU is formulated as a dynamic program and a method based on the approximated dynamic programming is used as the scheduling algorithm. The developed method, called multistep look ahead algorithm, is an iterative algorithm that overcomes the curse of dimensionality of the exact DP by choosing a decision based on a limited number of stages ahead. The effectiveness of the proposed model is investigated through numerical simulations. The proposed MO_HEMU enables customers to find the desired trade off between electricity payment and a discomfort level.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes a multiobjective home energy management unit (MO_HEMU) to schedule home appliances including solar PV, battery storage, deferrable and thermal appliances under time-varying electricity prices. The goal is to balance electricity payment and thermal discomfort, where discomfort is quantified by deviations of indoor and hot water temperatures from user ideals. The problem is formulated as a finite-horizon stochastic dynamic program, solved approximately using a multistep look-ahead approximated dynamic programming (ADP) algorithm that is iterative and claimed to overcome the curse of dimensionality. Numerical simulations are presented to show the effectiveness, with the claim that the approach enables customers to achieve desired trade-offs between cost and discomfort.
Significance. If the multistep look-ahead ADP is validated to yield decisions close to the optimal dynamic programming solution, this work would offer a practical computational method for multi-objective scheduling in residential energy systems that accounts for customer thermal comfort preferences. Such methods could be relevant for implementing demand response in smart grids with renewable integration and storage. The explicit multiobjective formulation and use of ADP for thermal load scheduling represent a reasonable technical contribution, though the lack of approximation quality assessment reduces its current significance.
major comments (1)
- [Abstract] The claim that the proposed MO_HEMU enables customers to find the desired trade-off between electricity payment and discomfort level rests on the multistep look-ahead ADP producing decisions sufficiently close to the true optimal dynamic program. However, the manuscript provides no bound on the value-function approximation error, no convergence argument as the lookahead depth or iteration count increases, and no numerical comparison against an exactly solvable small instance. This is a load-bearing issue for the central claim, as the reported cost-discomfort trade-offs may be unreliable if the approximation introduces systematic bias.
minor comments (1)
- The abstract mentions 'numerical simulations' but does not specify the simulation setup, such as the time horizon, price signals, or thermal model parameters used.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on our manuscript. We address the major comment regarding the approximation quality of the multistep look-ahead ADP below.
read point-by-point responses
-
Referee: [Abstract] The claim that the proposed MO_HEMU enables customers to find the desired trade-off between electricity payment and discomfort level rests on the multistep look-ahead ADP producing decisions sufficiently close to the true optimal dynamic program. However, the manuscript provides no bound on the value-function approximation error, no convergence argument as the lookahead depth or iteration count increases, and no numerical comparison against an exactly solvable small instance. This is a load-bearing issue for the central claim, as the reported cost-discomfort trade-offs may be unreliable if the approximation introduces systematic bias.
Authors: We agree that the manuscript does not provide theoretical bounds on the value-function approximation error or convergence arguments as the lookahead depth or iteration count increases. The multistep look-ahead ADP is presented as a practical heuristic to address the curse of dimensionality for the finite-horizon stochastic dynamic program with continuous states arising from thermal dynamics. Its effectiveness is supported by numerical simulations demonstrating achievable cost-discomfort trade-offs under time-varying prices. We will revise the abstract to qualify the central claim as providing a practical computational approach validated through simulations, rather than implying proximity to the optimal DP solution. We will also add a small-scale numerical comparison on a simplified deterministic instance to illustrate approximation behavior where an exact solution is computable. Deriving general error bounds is outside the scope of this applied work. revision: partial
- Providing a bound on the value-function approximation error or a convergence argument for the multistep look-ahead ADP as lookahead depth or iteration count increases.
Circularity Check
No circularity: algorithmic approximation stands on its own formulation
full rationale
The paper states the MO_HEMU problem as a finite-horizon stochastic dynamic program and replaces exact solution with an iterative multistep look-ahead ADP procedure that selects decisions by limited forward simulation. No parameter is fitted to a data subset and then relabeled a prediction; no self-citation supplies a uniqueness theorem or ansatz that defines the reported trade-off surface; the algorithm is described without reference to prior author work that would make the outcome tautological. The numerical simulations are presented as external validation rather than internal re-derivation, so the derivation chain does not reduce to its own inputs by construction.
Axiom & Free-Parameter Ledger
free parameters (2)
- look-ahead horizon length
- discomfort weighting factor
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The proposed MO-HEMU is formulated as a dynamic program and a method based on the approximated dynamic programming is used as the scheduling algorithm. The developed method, called multistep look-ahead algorithm...
-
IndisputableMonolith/Foundation/AbsoluteFloorClosure.leanabsolute_floor_iff_bare_distinguishability unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The MO formulation is based on the ε-constrained technique...
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
BSS: The state of charge (SOC) defines the state variable, i.e., ௧ ௧ , where ௫ሿ. The decision variable ௧ controls charging (௧ 0) and discharging (௧ ൏ 0 ) constrained by charging/discharging rates, i.e.,௧ ∈ ௫ሿ. Transition function of BSS state is. The battery power is obtained by: ௧ ݐ ௗ ݐ ௧ିଵ)ି (1) where { 0 , . } and (. )ି = min{0, . } . E...
-
[2]
Dishwasher and laundry appliances belong to this class
Deferrable non-interruptible appliances: The starting time of these appliances can be delayed across the day, but once the appliance ܣis turned on, it has to work for Iୟ time slots uninterruptedly to finish its task. Dishwasher and laundry appliances belong to this class. The power consumption at each time might be different. Assuming ଵ ଶ ூೌ } and ሿ as ...
-
[3]
Deferrable interruptible appliances: These appliances, such as Plug-in electrical vehicles (PEVs), can be turned on and off multiple times. The controller finds the best time slots within ሿ for the operation of appliance ܨwith total job length. At t ∉ ሾα,β ሿ, s୲ =u ୲ =0 . If the appliance is switched on during +1 ) , the state moves to st+1 f =s ...
-
[4]
The state variables are defined by the indoor and hot water temperature
Thermal appliances: The controller adjusts the power consumption of AC and EWH to keep the indoor temperature and temperature of the water inside the tank within the customer desired range. The state variables are defined by the indoor and hot water temperature. That is s୲ େ≜T e m୲ ୧୬ and ௧ ௧ ௐ where ௧ ௫ ൧ and ௧ ௐ ∈ ௫ௐ ൧. The decision varia...
-
[5]
+ܬ௧)൯ൟ (10) ௧ାଵ=݂௧) ௧ ௧ ௧ ௫} ௧ ௧ ௧ ଵ ூಶೈಹ ௧ ௫ ௐ ൟ ௧ ௧ ௧ ଵ ூಲ ௧ ௫ ൟ where ௧ ௧ ௧ ௧ ௧ ൯ is the total state space and u୲=൫ u୲ ୟ ,u ୲ ,u ୲ େ,u ୲ ୌ,u ୲ ୠ ൯ is the total decision space at time slot t. gt(st,L t) is the one-stage objective function (which can be either of CoEC or TDL) and J୲(s୲) is the optimal cost-to-go at state ௧ and tim...
-
[6]
The determination of ࢚is often problem-dependent
instead of ௧). The determination of ࢚is often problem-dependent. One potential option is to select some important variables of the problem and do minimization over the feasible decisions related to these variables, while keeping the decisions of the remaining variables at some nominal values. This approach can be specifically useful in MLA in which the or...
-
[7]
is used for the one-stage look-ahead approximations, while the reduced decision set ௧) is used for the multi-step look-ahead approximations. In the formulated HEMU, for example, the sets of charging/discharging decisions for the BSS, i.e., ௧ , can be nominated to build such a ௧). C. Multiobjective Formulation The proposed MO-HEMU is stated as follows: mi...
-
[8]
solutions as the Pareto list are found. An advantage of this technique is that it allows the decision maker to have control over the Pareto front by correctly choosing the values of kଶ. The choice of a higher value for ଶ results in more solutions in the Pareto list with better density but at the expense of higher calculation time. In this paper kଶ=6 . IV....
-
[9]
Single objective optimization: First, in order to evaluate the performance of the developed MLA in terms of the solution quality and computational time, different case studies with a different combination of devices and thus a different number of state and decision variables are defined as follows: • Case #1: only deferrable appliances are considered; • C...
-
[10]
Multiobjective optimization: Finally, the MO-HEMU model is solved by the TSLA considering all devices (case #5) and the obtained Pareto list is displayed in Fig. 6. The choice of the best compromise solution from this list depends on the preferences of the customer; whether minimizing the electricity cost is her priority or minimizing thermal discomfort i...
-
[11]
Demand Response for Residential Appliances via Customer Reward Scheme,
C. Vivekananthan, Y. Mishra, G. Ledwich, and F. Li, "Demand Response for Residential Appliances via Customer Reward Scheme," IEEE Trans. Smart Grid, vol. 5, no. 2, pp. 809-820, 2014
work page 2014
-
[12]
Home energy management systems: A review of modelling and complexity,
M. Beaudin and H. Zareipour, "Home energy management systems: A review of modelling and complexity," Renew. Sust. Energ. Rev., vol. 45, pp. 318-335, 2015
work page 2015
-
[13]
Dynamic Programming Based Home Energy Management Unit Incorporating PVs and Batteries,
B. Jeddi, Y. Mishra, and G. Ledwich, "Dynamic Programming Based Home Energy Management Unit Incorporating PVs and Batteries," Power and Energy Society General Meeting, Chicago, IL, USA, 2017
work page 2017
-
[14]
Real-Time Price Based Home Energy Management Scheduler,
C. Vivekananthan, Y. Mishra, and F. Li, "Real-Time Price Based Home Energy Management Scheduler," IEEE Trans. Power Syst., vol. 30, no. 4, pp. 2149- 2159, 2015
work page 2015
-
[15]
S. Althaher, P. Mancarella, and J. Mutale, "Automated Demand Response From Home Energy Management System Under Dynamic Pricing and Power and Comfort Constraints," IEEE Trans. Smart Grid, vol. 6, no. 4, pp. 1874- 1883, 2015
work page 2015
-
[16]
Modeling and Stochastic Control for Home Energy Management,
Z. Yu, L. Jia, M. C. Murphy-Hoye, A. Pratt, and L. Tong, "Modeling and Stochastic Control for Home Energy Management," IEEE Trans. Smart Grid, vol. 4, no. 4, pp. 2244-2255, 2013
work page 2013
-
[17]
M. A. A. Pedrasa, T. D. Spooner, and I. F. MacGill, "Coordinated Scheduling of Residential Distributed Energy Resources to Optimize Smart Home Energy Services," IEEE Trans. Smart Grid, vol. 1, no. 2, pp. 134-143, 2010
work page 2010
-
[18]
Optimal Power Flow Management for Grid Connected PV Systems With Batteries,
Y. Riffonneau, S. Bacha, F. Barruel, and S. Ploix, "Optimal Power Flow Management for Grid Connected PV Systems With Batteries," IEEE Trans. Sust. Energy, vol. 2, no. 3, pp. 309-320, 2011
work page 2011
-
[19]
W. B. Powell, Approximate Dynamic Programming: Solving the Curses of Dimensionality. John Wiley & Sons, 2007
work page 2007
-
[20]
A Fast Technique for Smart Home Management: ADP with Temporal Difference Learning,
C. Keerthisinghe, G. Verbic, and A. C. Chapman, "A Fast Technique for Smart Home Management: ADP with Temporal Difference Learning," IEEE Trans. Smart Grid, vol. PP, no. 99, pp. 1-1, 2016
work page 2016
-
[21]
An Optimal Power Scheduling Method for Demand Response in Home Energy Management System,
Z. Zhao, W. C. Lee, Y. Shin, and K. B. Song, "An Optimal Power Scheduling Method for Demand Response in Home Energy Management System," IEEE Trans. Smart Grid, vol. 4, no. 3, pp. 1391-1400, 2013
work page 2013
-
[22]
Optimal Smart Home Energy Management Considering Energy Saving and a Comfortable Lifestyle,
A. Anvari-Moghaddam, H. Monsef, and A. Rahimi-Kian, "Optimal Smart Home Energy Management Considering Energy Saving and a Comfortable Lifestyle," IEEE Trans. Smart Grid, vol. 6, no. 1, pp. 324-332, 2015
work page 2015
-
[23]
Optimal operation scheduling of electric water heaters under dynamic pricing,
V. Kapsalis and L. Hadellis, "Optimal operation scheduling of electric water heaters under dynamic pricing," Sust. Cities Soc., vol. 31, pp. 109-121, 2017
work page 2017
-
[24]
Modeling of the Cost of EV Battery Wear Due to V2G Application in Power Systems,
C. Zhou, K. Qian, M. Allan, and W. Zhou, "Modeling of the Cost of EV Battery Wear Due to V2G Application in Power Systems," IEEE Trans. Energy Conv., vol. 26, no. 4, pp. 1041-1050, 2011
work page 2011
-
[25]
C. Bordin, H. O. Anuta, A. Crossland, I. L. Gutierrez, C. J. Dent, and D. Vigo, "A linear programming approach for battery degradation analysis and optimization in offgrid power systems with solar energy integration," Renew. Energy, vol. 101, pp. 417-430, 2017
work page 2017
-
[26]
B. Jeddi, A. H. Einaddin, and R. Kazemzadeh, "A novel multi-objective approach based on improved electromagnetism-like algorithm to solve optimal power flow problem considering the detailed model of thermal generators," Int. Trans. Electric. Energy Syst., vol. 27, no. 4, 2017
work page 2017
-
[27]
V. Vahidinasab, "Optimal distributed energy resources planning in a competitive electricity market: Multiobjective optimization and probabilistic design," Renew. Energy, vol. 66, pp. 354-363, 2014
work page 2014
-
[28]
Network impact of multiple HEMUs with PVs and BESS in a low voltage distribution feeder,
B. Jeddi, Y. Mishra, and G. Ledwich, "Network impact of multiple HEMUs with PVs and BESS in a low voltage distribution feeder," Australasian Universities Power Engineering Conference (AUPEC), 2017, pp. 1-6 Pareto solutions for case #5 obtained by the TSLA Scheduling of devices corresponding to (a) minimum CoEC, (b) average CoEC and TDL, (c) minimum TDL so...
work page 2017
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.