Recognition: unknown
A Q-learning-based QoS-aware multipath routing protocol in IoMT-based wireless body area network
Pith reviewed 2026-05-10 09:33 UTC · model grok-4.3
The pith
QQMR applies Q-learning to prioritized data paths to raise delivery rates and cut energy use in body-area medical networks.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
QQMR classifies data into three priority levels, employs adaptive multi-level queuing and fuzzy C-means clustering to optimize routing decisions, maintains separate learning policies for each data type, and selects primary and backup paths, resulting in higher packet delivery ratios together with lower delay, routing overhead, and energy consumption than existing protocols.
What carries the argument
QQMR, the Q-learning-based QoS-aware multipath routing protocol that classifies traffic by priority, runs independent learning agents per class, and combines fuzzy C-means clustering with multi-level queues to pick stable primary and backup routes.
If this is right
- High-priority medical packets experience measurably lower end-to-end delay.
- Overall network lifetime increases because sensor nodes consume less energy for routing.
- Routing overhead drops because backup paths are pre-learned rather than discovered on demand.
- The protocol remains functional when topology changes rapidly due to body motion.
Where Pith is reading between the lines
- The separate-policy design could be reused in other IoT domains that mix urgent and routine traffic.
- Real deployments would need to verify whether the learned policies remain stable when new sensor nodes join or leave.
- Extending the three-level classification to more granular QoS classes might further improve performance for mixed vital-sign streams.
Load-bearing premise
The simulation scenarios used to test QQMR accurately capture the dynamic topology, energy constraints, and traffic patterns of real-world IoMT-based wireless body area networks.
What would settle it
A physical testbed experiment showing equal or lower packet delivery ratio and higher energy use under realistic patient movement and mixed-priority traffic would disprove the claimed gains.
Figures
read the original abstract
The Internet of Medical Things (IoMT) enables intelligent healthcare services but faces challenges such as dynamic topology, energy constraints, and diverse QoS requirements. This paper proposes QQMR, a Q-learning-based QoS-aware multipath routing method for WBANs. QQMR classifies data into three priority levels and employs adaptive multi-level queuing and fuzzy C-means clustering to optimize routing decisions. It maintains separate learning policies for each data type and selects primary and backup paths accordingly. Experimental results demonstrate improved packet delivery ratio and significant reductions in delay, routing overhead, and energy consumption compared to existing methods.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes QQMR, a Q-learning-based QoS-aware multipath routing protocol for IoMT-based wireless body area networks. Data packets are classified into three priority levels; the protocol applies fuzzy C-means clustering for node organization, adaptive multi-level queuing, and maintains independent Q-learning policies per class to select primary and backup paths. Simulations are reported to demonstrate gains in packet delivery ratio together with reductions in end-to-end delay, routing overhead, and energy consumption relative to existing methods.
Significance. If the performance gains are shown to be robust under realistic WBAN conditions, the work would provide a concrete reinforcement-learning approach to QoS differentiation and multipath reliability in energy-constrained, topology-dynamic medical sensor networks. The combination of per-class learning with fuzzy clustering is a plausible direction, but the significance hinges on whether the evaluation scenarios faithfully reproduce body-induced mobility, heterogeneous energy budgets, and priority traffic patterns.
major comments (3)
- [Experimental Results] The experimental section provides no quantitative results, simulation parameters, baseline protocol names, or statistical analysis (e.g., confidence intervals or number of runs). This absence prevents verification of the claimed improvements in PDR, delay, overhead, and energy and makes the central performance claim impossible to assess.
- [Simulation Environment] The simulation setup does not specify the mobility model used to generate body-induced topology changes, the energy dissipation function for heterogeneous sensors, or the traffic generator for priority-differentiated flows. Without these details the reported gains cannot be distinguished from artifacts of an idealized simulator.
- [Proposed QQMR Protocol] The Q-learning formulation (state space, action space, reward function, and update rule) is described only at a high level; the integration with fuzzy C-means clustering and the separate policies per data class therefore cannot be evaluated for correctness or novelty.
minor comments (2)
- [Abstract] The abstract asserts 'significant reductions' without any numerical values; these should be replaced by concrete percentages or absolute deltas drawn from the results tables.
- [Introduction] Ensure all compared baseline protocols are cited with full references and that any acronyms (e.g., QQMR) are defined on first use.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed feedback. We have revised the manuscript to address each major comment by expanding the experimental results, simulation environment, and protocol description sections with the requested specifics. Point-by-point responses follow.
read point-by-point responses
-
Referee: [Experimental Results] The experimental section provides no quantitative results, simulation parameters, baseline protocol names, or statistical analysis (e.g., confidence intervals or number of runs). This absence prevents verification of the claimed improvements in PDR, delay, overhead, and energy and makes the central performance claim impossible to assess.
Authors: We agree that the original manuscript lacked explicit quantitative values, parameter tables, baseline names, and statistical measures. In the revised version we have added a dedicated 'Performance Evaluation' subsection that reports concrete PDR, delay, overhead, and energy figures, names the baselines (AODV, DSR, and a recent priority-aware WBAN protocol), lists all simulation parameters, and presents results averaged over 50 independent runs with 95% confidence intervals shown as error bars. revision: yes
-
Referee: [Simulation Environment] The simulation setup does not specify the mobility model used to generate body-induced topology changes, the energy dissipation function for heterogeneous sensors, or the traffic generator for priority-differentiated flows. Without these details the reported gains cannot be distinguished from artifacts of an idealized simulator.
Authors: The referee is correct that these implementation details were omitted. We have extended the 'Simulation Environment' section to specify: a body-mobility model driven by real accelerometer traces to produce realistic topology dynamics; the standard first-order radio energy model with heterogeneous initial energies (0.5 J–2 J) across sensor types; and a priority-aware traffic generator using independent Poisson processes with class-specific rates and packet sizes. These additions make the evaluation reproducible and tied to realistic WBAN conditions. revision: yes
-
Referee: [Proposed QQMR Protocol] The Q-learning formulation (state space, action space, reward function, and update rule) is described only at a high level; the integration with fuzzy C-means clustering and the separate policies per data class therefore cannot be evaluated for correctness or novelty.
Authors: We acknowledge the high-level presentation limited evaluation. The revised manuscript now contains an explicit 'Q-Learning Formulation' subsection that defines the state vector (residual energy, queue length, link quality, priority class), the action set (next-hop selection for primary and backup paths), the multi-objective reward function (weighted sum of delay, reliability, and energy cost), and the standard Q-update equation with learning rate and discount factor. We also describe how fuzzy C-means clustering partitions nodes for route discovery and how each priority class maintains an independent Q-table updated only on class-specific experiences. revision: yes
Circularity Check
No circularity: new protocol design evaluated against external baselines
full rationale
The paper proposes QQMR, a novel Q-learning-based QoS-aware multipath routing protocol that classifies traffic into priority levels, applies adaptive queuing and fuzzy C-means clustering, and maintains separate learning policies per data type. Performance gains in PDR, delay, overhead, and energy are shown via simulation comparisons to existing methods. No equations, fitted parameters renamed as predictions, self-citations, or ansatzes are present in the provided text that would reduce any claim to its own inputs by construction. The derivation chain is self-contained and externally benchmarked.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Internet of medical things: A systematic review,
Huang, C., Wang, J., Wang, S. and Zhang, Y ., 2023. Inter- net of medical things: A systematic review. Neurocomput- ing, 557, p.126719. doi: 10.1016/j.neucom.2023.126719
-
[2]
Narwal, B., Malik, M., Mohapatra, A.K., Baliyan, N., Shukla, V . and Kumar, M., 2024. Dissecting wireless body area networks routing protocols: Classification, com- parative analysis, and research challenges. International Journal of Communication Systems, 37(1), p.e5637. doi: 10.1002/dac.5637
-
[3]
A. Si-Ahmed, M.A. Al-Garadi, N. Boustia, Survey of ma- chine learning based intrusion detection methods for Internet of Medical Things, Appl. Soft Comput. 140 (2022), 110227. doi: 10.1016/j.asoc.2023.110227
-
[4]
M. Papaioannou, M. Karageorgou, G. Mantas, V . Sucasas, I. Essop, J. Rodriguez, D. Lymberopoulos, A survey on se- curity threats and countermeasures in Internet of Medical Things (IoMT), Trans. Emerg. Telecommun. Technol. 33 (2022). doi: 10.1002/ett.4049
-
[5]
B. Bhushan, A. Kumar, A.K. Agarwal, A. Kumar, P. Bhat- tacharya, A. Kumar, Towards a secure and sustainable Inter- net of Medical Things (IoMT): requirements, design chal- lenges, security techniques, and future trends, Sustain 15 (2023). doi: 10.3390/su15076177
-
[6]
B. Priya Prathaban, R. Balasubramanian, R. Kalpana, Fore- Seiz: an IoMT based headband for real-time epileptic seizure forecasting, Expert Syst. Appl. 188 (2022), 116083. doi: 10.1016/j.eswa.2021.116083
-
[7]
J. Srivastava, S. Routray, S. Ahmad, M.M. Waris, Internet of Medical Things (IoMT)-based smart healthcare system: trends and progress, Comput. Intell. Neurosci 2022 (2022). doi: 10.1155/2022/7218113
- [8]
- [9]
-
[10]
Aski, V .J., Dhaka, V .S., Parashar, A. and Rida, I., 2023. Internet of Things in healthcare: A survey on protocol standards, enabling technologies, WBAN architectures and open issues. Physical Communication, 60, p.102103. doi: 10.1016/j.phycom.2023.102103
-
[11]
Ahmed, S.F., Alam, M.S.B., Afrin, S., Rafa, S.J., Rafa, N. and Gandomi, A.H., 2024. Insights into Internet of Med- ical Things (IoMT): Data fusion, security issues and po- tential solutions. Information Fusion, 102, p.102060. doi: 10.1016/j.inffus.2023.102060
-
[12]
He, P., Huang, D., Wu, D., He, H., Wei, Y ., Cui, Y ., Wang, R. and Peng, L., 2024. A survey of inter- net of medical things: technology, application and fu- ture directions. Digital Communications and Networks. doi: 10.1016/j.dcan.2024.11.013
-
[13]
and Ruiz-Martnez, A.,
Lpez Martnez, A., Gil Perez, M. and Ruiz-Martnez, A.,
-
[14]
ACM Computing Surveys, 55(12), pp.1-38
A comprehensive review of the state-of-the-art on se- curity and privacy issues in healthcare. ACM Computing Surveys, 55(12), pp.1-38. doi: 10.1145/3571156
-
[15]
Cornet, B., Fang, H., Ngo, H., Boyer, E.W. and Wang, H., 2022. An overview of wireless body area networks for mobile health applications. IEEE Network, 36(1), pp.76-82. doi: 10.1109/MNET.103.2000761
-
[16]
Singh, A.K. and Kumar, S., 2024. An efficient and secure CLAKA protocol for blockchain-aided wireless body area networks. Expert Systems with Applications, 242, p.122740. doi: 10.1016/j.eswa.2023.122740
-
[17]
Arora, N., Gupta, S.H. and Kumar, B., 2024. Energy- efficiency optimization and the comparative performance analysis for Wireless Body Area Networks (WBANs). Sustainable Computing: Informatics and Systems, 42, p.100975. doi: 10.1016/j.suscom.2024.100975. 23
-
[18]
Javaid, S., Zeadally, S., Fahim, H. and He, B., 2022. Medical sensors and their integration in wireless body area networks for pervasive healthcare delivery: A re- view. IEEE Sensors Journal, 22(5), pp.3860-3877. doi: 10.1109/JSEN.2022.3141064
-
[19]
Guo, W., Hou, Y ., Gan, Y . and Guo, W., 2024. Efficient data transmission mechanisms in energy harvesting wireless body area networks: A survey. Computer Networks, 254, p.110769. doi: 10.1016/j.comnet.2024.110769
-
[20]
Wang, D., Zhou, J., Masdari, M., Qasem, S.N. and Sayed, B.T., 2024. Security in wireless body area networks via anonymous authentication: Comprehen- sive literature review, scheme classification, and fu- ture challenges. Ad Hoc Networks, 153, p.103332. doi: 10.1016/j.adhoc.2023.103332
-
[21]
Kaleem, M. and Devarajan, G.G., 2024. Energy- efficient classification strategy for detecting interference and malicious sensor nodes in wireless body area Net- works. Cyber Security and Applications, 2, p.100048. doi: 10.1016/j.csa.2024.100048
-
[22]
Lansky, J., Rahmani, A.M. and Hosseinzadeh, M., 2022. Reinforcement learning-based routing protocols in vehic- ular Ad Hoc Networks for Intelligent Transport System (ITS): A survey. Mathematics, 10(24), p.4673. doi:1270 10.3390/math10244673
-
[23]
Khoshvaght, P., Tanveer, J., Rahmani, A.M., Altulyan, M., Alkhrijah, Y ., Yousefpoor, M.S., Yousefpoor, E., Mo- hammadi, M. and Hosseinzadeh, M., 2025. Computational intelligence-based routing schemes in flying ad-hoc net- works (FANETs): A review. Vehicular Communications, p.100913. doi: 10.1016/j.vehcom.2025.100913
-
[24]
Zhang, L. and Lin, P., 2023. Reinforcement learning based energy-neutral operation for hybrid EH powered TBAN. Fu- ture Generation Computer Systems, 140, pp.311-320. doi: 10.1016/j.future.2022.10.037
-
[25]
Zhang, Z., Li, Z., Lai, C.S., Wu, R., Li, X., Lin, J., Ren, X. and Qiao, H., 2025. Joint Optimization of Data Urgency and Freshness in Wireless Body Area Networks for Enhanced eHealth Monitoring. IEEE Transactions on Consumer Elec- tronics. doi: 10.1109/TCE.2025.3544917
-
[26]
Abdellatif, A.A., Mhaisen, N., Mohamed, A., Er- bad, A. and Guizani, M., 2023. Reinforcement learn- ing for intelligent healthcare systems: A review of chal- lenges, applications, and open research issues. IEEE In- ternet of Things Journal, 10(24), pp.21982-22007. doi: 10.1109/JIOT.2023.3288050
-
[27]
Rahmani, A.M., Haider, A., Zaidi, M.M., Alanazi, A., Alsubai, S., Alqahtani, A., Yousefpoor, M.S., Yousefpoor, E. and Hosseinzadeh, M., 2025. QRCF: A new Q-learning- based routing approach using a smart cylindrical filtering system in flying ad hoc networks. Vehicular Communica- tions, 53, p.100905. doi: 10.1016/j.vehcom.2025.100905
-
[28]
and Al-Kadri, M.O.,
Hajar, M.S., Kalutarage, H.K. and Al-Kadri, M.O.,
-
[29]
Computer Networks, 237, p.110073
3R: A reliable multi agent reinforcement learn- ing based routing protocol for wireless medical sen- sor networks. Computer Networks, 237, p.110073. doi: 10.1016/j.comnet.2023.110073
-
[30]
Mohammadi, R. and Shirmohammadi, Z., 2023. RLS2: An energy efficient reinforcement learning-based sleep scheduling for energy harvesting WBANs. Computer Net- works, 229, p.109781. doi: 10.1016/j.comnet.2023.109781
-
[31]
Morphological and cytoskeleton changes in cells after EMT
Hosseinzadeh, M., Yoo, J., Ali, S., Lansky, J., Mildeova, S., Yousefpoor, M.S., Ahmed, O.H., Rahmani, A.M., Tigh- tiz, L., 2023. A cluster-based trusted routing method us- ing fire hawk optimizer (FHO) in wireless sensor networks (WSNs). Sci. Rep. 13 (1), 13046. doi: 10.1038/s41598-023- 40273-8
-
[32]
Rahmani-Sane, G., Azad, S., Ameli, M.T. and Haghani, S., 2025. The Applications of Artificial Intelligence and Dig- ital Twin in Power Systems: An In-Depth Review. IEEE Ac- cess. doi: 10.1109/ACCESS.2025.3580340
-
[33]
Z.A. Khan, S.C. Sivakumar, W.J. Phillips, B. Robertson, QPRR: qosaware peering routing protocol for reliability sen- sitive data in body area network communication, Comput. J. 58 (2014) 17011716. doi: 10.1093/comjnl/bxu114
-
[34]
Nazari, A., Kordabadi, M., Mohammadi, R. and Lal, C., 2023. EQRSRL: an energy-aware and QoS-based rout- ing schema using reinforcement learning in IoMT. Wireless Networks, 29(7), pp.3239-3253. doi: 10.1007/s11276-023- 03367-9
-
[35]
Arafat, M.Y ., Pan, S. and Bak, E., 2024. QQAR: A Q- learning-based QoS-aware routing for IoMT-enabled wire- less body area networks for smart healthcare. Internet of Things, 26, p.101151. doi: 10.1016/j.iot.2024.101151
-
[36]
Bedi, P., Das, S., Goyal, S.B., Shukla, P.K., Mirjalili, S. and Kumar, M., 2022. A novel routing protocol based on grey wolf optimization and Q learning for wireless body area network. Expert Systems with Applications, 210, p.118477. doi: 10.1016/j.eswa.2022.118477
-
[37]
Fatima, N., Tariq, M., Iqbal, Z., Shahid, M., Alzahrani, A.I., Alalwan, N., Khan, M.A., Kadry, S. and Nam, Y ., 2024. Thermal aware high throughput routing protocol for Wire- less Body Area Network. Alexandria Engineering Journal, 104, pp.306-313. doi: 10.1016/j.aej.2024.06.047
-
[38]
and del Carmen Romero- Ternero, M., 2024
Memarian, S., Behmanesh-Fard, N., Aryai, P., Shok- ouhifar, M., Mirjalili, S. and del Carmen Romero- Ternero, M., 2024. TSFIS-GWO: Metaheuristic-driven takagi-sugeno fuzzy system for adaptive real-time routing in WBANs. Applied Soft Computing, 155, p.111427. doi: 10.1016/j.asoc.2024.111427
-
[39]
Arafat, M.Y ., Pan, S. and Bak, E., 2024. An adaptive rein- forcement learning-based mobility-aware routing for hetero- geneous wireless body area networks. IEEE Sensors Journal. doi: 10.1109/JSEN.2024.3440412. 24
-
[40]
doi: 10.1007/s11276-022-02997-9
-
[41]
Karp, B. and Kung, H.T., 2000, August. GPSR: Greedy perimeter stateless routing for wireless networks. In Pro- ceedings of the 6th annual international conference on Mobile computing and networking (pp. 243-254). doi: 10.1145/345910.345953
-
[42]
Hosseinzadeh, M., Haider, A., Rahmani, A.M., Au- rangzeb, K., Liu, Z., Yousefpoor, M.S., Yousefpoor, E., Lee, S.W. and Khoshvaght, P., 2025. A Q-learning- based trust model in underwater acoustic sensor net- works (UASNs). Ad Hoc Networks, 178, p.103918. doi: 10.1016/j.adhoc.2025.103918
-
[43]
M. Hosseinzadeh, S. Ali, H.J. Ahmad, F. Alanazi, M.S. Yousefpoor, E. Yousefpoor, O.H. Ahmed, A.M. Rahmani, S.W. Lee, A novel Q-learning-based secure routing scheme with a robust defensive system against wormhole attacks in flying ad hoc networks, Veh. Commun. (2024) 100826. doi: 10.1016/j.vehcom.2024.100826
-
[44]
M. Hosseinzadeh, J. Tanveer, F. Alanazi, K. Au- rangzeb, M.S. Yousefpoor, E. Yousefpoor, A. Darwesh, S.W. Lee, A.M. Rahmani, An intelligent clustering scheme based on whale optimization algorithm in fly- ing ad hoc networks, Veh. Commun. (2024) 100805. doi: 10.1016/j.vehcom.2024.100805
-
[45]
A.M. Rahmani, R.A. Naqvi, E. Yousefpoor, M.S. Yousefpoor, O.H. Ahmed, M. Hosseinzadeh, K. Sid- dique, A Q-learning and fuzzy logic-based hierarchical routing scheme in the intelligent transportation system for smart cities, Mathematics 10 (22) (2022) 4192. doi: 10.3390/math10224192
-
[46]
A.E. Ezugwu, A.M. Ikotun, O.O. Oyelade, L. Abuali- gah, J.O. Agushaka, C.I. Eke, A.A. Akinyelu, A compre- hensive survey of clustering algorithms: state-of-theart ma- chine learning applications, taxonomy, challenges, and fu- ture research prospects, Eng. Appl. Artif. Intell. 110 (2022) 104743. doi: 10.1016/j.engappai.2022.104743
-
[47]
B. Ji, Y . Wang, K. Song, C. Li, H. Wen, V .G. Menon, S. Mumtaz, A survey of computational intelli- gence for 6G: key technologies, applications and trends, IEEE Trans. Ind. Inform. 17 (10) (2021) 7145–7154. doi: 10.1109/TII.2021.3052531
-
[48]
Z. Li, J. Wang, J. Chen, Z. Fang, Y . Ren, Flying ad-hoc network covert communications with deep rein- forcement learning, IEEE Wirel. Commun. (2024). doi: 10.1109/MWC.010.2300200
-
[49]
Lansky, J., Ali, S., Rahmani, A.M., Yousefpoor, M.S., Yousefpoor, E., Khan, F. and Hosseinzadeh, M., 2022. Rein- forcement learning-based routing protocols in flying ad hoc networks (FANET): A review. Mathematics, 10(16), p.3017. doi: 10.3390/math10163017
-
[50]
Han, D., Du, X., Liu, X., Tian, X., 2024. Fuzzy C-means clustering and improved arithmetic optimization algorithm- based layering cooperative routing protocol for UASNs. IEEE Sens. J. doi: 10.1109/JSEN.2024.3413793
-
[51]
Bede, B., 2013. Fuzzy clustering. In Mathematics of Fuzzy Sets and Fuzzy Logic (pp. 213-219). Berlin, Heidel- berg: Springer Berlin Heidelberg. doi: 10.1007/978-3-642- 35221-8_12
-
[52]
Ramonet, A.G., Pecorella, T., Picano, B. and Kinoshita, K., 2024. Perspectives on IoT-oriented network simula- tion systems. Computer Networks, 253, p.110749. doi: 10.1016/j.comnet.2024.110749. 25
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.