BRAVR: An AP-Assisted Online DRL Mechanism for Interactive VR Bitrate Adaptation over Wi-Fi

Boris Bellalta; Francesc Wilhelmi; Miguel Casasnovas

arxiv: 2606.24389 · v1 · pith:6IYJQSPSnew · submitted 2026-06-23 · 💻 cs.NI

BRAVR: An AP-Assisted Online DRL Mechanism for Interactive VR Bitrate Adaptation over Wi-Fi

Miguel Casasnovas , Francesc Wilhelmi , Boris Bellalta This is my paper

Pith reviewed 2026-06-25 21:59 UTC · model grok-4.3

classification 💻 cs.NI

keywords interactive VR streamingbitrate adaptationdeep reinforcement learningWi-Fi networksaccess point assistanceQoSairtime fairnessonline learning

0 comments

The pith

BRAVR integrates Wi-Fi access point statistics into a decentralized DRL agent to adapt VR bitrates while preserving QoS and airtime fairness.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper frames interactive VR streaming over Wi-Fi as a network-aware online control problem under dynamic channels and contention. It introduces BRAVR, a DRL mechanism that augments client-side observations with lightweight wireless statistics collected at the serving access point. The goal is to select bitrates that sustain visual quality and low latency while avoiding sustained overutilization of shared airtime in multi-user settings. Physical testbed experiments compare BRAVR against a heuristic baseline and an ablated version lacking AP input, confirming consistent QoS and fairness gains from the added network visibility. This shows that partial network information at the edge can strengthen decentralized learning loops for latency-sensitive wireless applications.

Core claim

BRAVR is a decentralized deep reinforcement learning mechanism for interactive VR bitrate adaptation that incorporates lightweight wireless network statistics collected at the Wi-Fi access point. By integrating these AP-provided inputs with application-layer observations, it enables more informed decisions that optimize visual quality while maintaining streaming performance and promoting airtime fairness in multi-user scenarios. Evaluation in a real VR streaming system on a physical Wi-Fi testbed confirms that BRAVR achieves robust QoS, prevents sustained airtime overutilization, and outperforms an ablated variant without AP assistance.

What carries the argument

The AP-assisted online DRL control loop that fuses application observations with lightweight wireless network statistics for real-time bitrate decisions.

If this is right

BRAVR delivers robust quality of service under dynamic channel conditions and shared-medium contention.
It prevents sustained airtime overutilization among multiple VR users on the same access point.
It outperforms a version without AP assistance, confirming value from network-level inputs in the control loop.
AP-assisted online DRL is effective for decentralized interactive VR streaming over commodity Wi-Fi hardware.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same lightweight AP-statistic approach could extend to other latency-sensitive traffic such as cloud gaming or remote robotics.
Minimal network visibility at the access point may reduce the need for fully client-only or fully centralized adaptation schemes.
The method suggests a practical path for edge-assisted learning in future wireless standards that already expose basic contention metrics.

Load-bearing premise

Lightweight wireless network statistics collected at the AP are sufficient, reliable, and non-disruptive to meaningfully improve the DRL agent's decisions under dynamic channel conditions and contention.

What would settle it

A controlled experiment in which BRAVR shows no improvement or degraded performance relative to the ablated version without AP statistics across varying levels of contention and channel dynamics would falsify the claimed benefit.

Figures

Figures reproduced from arXiv: 2606.24389 by Boris Bellalta, Francesc Wilhelmi, Miguel Casasnovas.

**Figure 1.** Figure 1: Overview of the VR streaming system architecture. Among all traffic flows, video traffic is the dominant contributor to network load, and its encoding bitrate—whether fixed or adaptive—largely determines channel occupancy and airtime consumption. Consequently, bitrate decisions directly influence contention and resource availability in the shared wireless medium, coupling the performance of concurrent VR … view at source ↗

**Figure 2.** Figure 2: Safe reinforcement learning via preemptive shielding. The agent–environment interaction loop. across all active VR streams, measured using Jain’s fairness index. C. Action Space At each decision epoch t, the agent selects an action at ∈ A based on the observation ot. The action space is defined as: A = {−1, 0, +1}, (2) corresponding to decreasing, maintaining, or increasing the current encoding bitrate. B… view at source ↗

**Figure 3.** Figure 3: Schematic of the neural network architecture. The observation ot is mapped through two fully connected layers with ReLU activations to estimate the action–value function for all actions. dedicated to VR streaming. Depending on whether the scenario is single-player or multi-player, the setup includes one or two laptops as streaming servers, a single Wi-Fi AP running OpenWrt [32], and one or two Meta Quest … view at source ↗

**Figure 5.** Figure 5: Airtime-aware utility per user in the multi-user near–near scenario. Values are averaged over evaluation sessions. Error bars indicate standard deviation. target bounds while achieving comparable average per-user bitrate, indicating efficient utilization of the available capacity. BRAVR+, in particular, operates at slightly lower bitrate levels but delivers more reliable performance, achieving QoS satisfac… view at source ↗

**Figure 6.** Figure 6: Airtime-aware utility per user in the multi-user near–far scenario. Values are averaged over evaluation sessions. Error bars indicate standard deviation. the frequency of such events. As a result, it attains the highest airtime-aware utility across users. Bitrate adaptation. Learning-based approaches perform more frequent target bitrate adjustments, resulting in less stable bitrate trajectories than the n… view at source ↗

**Figure 7.** Figure 7: Per-user airtime evolution in the multi-user near– far scenario during representative evaluation sessions. VI. CONCLUSIONS This paper presents BRAVR, a decentralized, AP–assisted reinforcement learning approach for real-time virtual reality bitrate adaptation over Wi-Fi, leveraging both application- and AP-level information to jointly optimize visual quality, latency, reliability, and multi-user airtime fa… view at source ↗

read the original abstract

Interactive virtual reality (VR) streaming over Wi-Fi requires stringent latency and reliability guarantees, which become increasingly difficult to achieve under dynamic channel conditions and shared medium contention. These challenges make real-time bitrate adaptation a critical yet fundamentally difficult control problem, particularly under limited visibility of the underlying network conditions. This paper formulates VR bitrate adaptation as a network-aware, online decision-making problem and proposes BRAVR, a decentralized deep reinforcement learning (DRL) mechanism designed to optimize visual quality while maintaining streaming performance and promoting airtime fairness in multi-user scenarios. BRAVR integrates application-layer observations with lightweight wireless network statistics collected at the Wi-Fi access point (AP) serving the VR client, enabling more informed bitrate adaptation decisions. We implement BRAVR in a real VR streaming system and evaluate it on a physical Wi-Fi testbed against a strong heuristic baseline and an ablated BRAVR variant without AP assistance. Experimental results show that BRAVR consistently achieves its design objectives, delivering robust quality of service (QoS) and preventing sustained airtime overutilization. It also outperforms its ablated counterpart, highlighting the benefits of incorporating network-level information into the bitrate adaptation control loop. Overall, these results demonstrate the effectiveness of AP-assisted online learning for decentralized interactive VR streaming over commodity Wi-Fi and provide practical insights into bitrate adaptation in shared wireless environments.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

BRAVR adds AP-collected stats to an online DRL agent for VR bitrate adaptation and shows measurable gains over an ablated version on a physical Wi-Fi testbed.

read the letter

The main takeaway is that BRAVR integrates lightweight wireless stats from the access point into a decentralized DRL controller for interactive VR streaming. The physical testbed results indicate it maintains QoS and airtime fairness better than both a heuristic baseline and the version without AP input.

What stands out is the real implementation and the direct comparison to the ablated variant. That comparison isolates the contribution of the network-level observations under actual contention and channel variation, which is more convincing than simulation-only claims in this area.

The work stays incremental rather than foundational. It builds on existing DRL-for-bitrate ideas by adding one specific information source, and the scope is limited to commodity Wi-Fi and one application. No equations or derivations appear to introduce circularity or hidden fitting.

Soft spots are modest. The abstract does not detail the DRL state space, reward shaping, or training procedure, so it is hard to judge how sensitive the gains are to those choices. The testbed is physical but single-environment; broader channel conditions or denser deployments would strengthen the case. Still, nothing in the reported design or results points to a load-bearing flaw.

This paper is for people working on practical wireless multimedia systems and applied DRL. It gives a clear data point on the value of AP assistance. It deserves a serious referee because the experimental setup directly tests the central claim and the results are falsifiable.

Referee Report

0 major / 2 minor

Summary. The paper proposes BRAVR, a decentralized deep reinforcement learning mechanism for interactive VR bitrate adaptation over Wi-Fi. It integrates application-layer observations with lightweight wireless network statistics collected at the AP to optimize visual quality, maintain QoS, and promote airtime fairness under dynamic conditions and multi-user contention. The system is implemented in a real VR streaming setup and evaluated on a physical Wi-Fi testbed against a heuristic baseline and an ablated variant without AP assistance, with results indicating consistent achievement of design objectives and performance gains from the network-level inputs.

Significance. If the experimental results hold under the reported conditions, the work provides concrete evidence that AP-assisted network statistics can meaningfully improve DRL-based bitrate decisions for latency-sensitive VR over commodity Wi-Fi. The physical testbed evaluation and direct comparison to the ablated variant are strengths that support the central claim about the value of network-level information in the control loop.

minor comments (2)

[Abstract] The abstract refers to 'lightweight wireless network statistics' without enumerating the specific metrics (e.g., airtime utilization, RSSI, or contention indicators); adding this detail in §3 or the evaluation section would improve reproducibility.
The claim of 'preventing sustained airtime overutilization' would benefit from a precise definition or threshold in the problem formulation section to allow readers to assess how this is measured and enforced.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for the positive assessment of our work and the recommendation for minor revision. The provided summary accurately captures the core contributions of BRAVR, including the integration of AP-assisted network statistics into the DRL-based bitrate adaptation loop and the physical testbed evaluation.

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper describes an experimental DRL implementation for VR bitrate adaptation evaluated on a physical Wi-Fi testbed, with comparisons to a heuristic baseline and an ablated variant. No equations, derivations, or parameter-fitting steps are referenced in the provided material that reduce any claimed prediction or result to its own inputs by construction. The central claims rest on direct empirical measurements rather than self-referential definitions or self-citation chains, rendering the work self-contained.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

The abstract provides no information on free parameters, axioms, or invented entities.

pith-pipeline@v0.9.1-grok · 5776 in / 1179 out tokens · 30369 ms · 2026-06-25T21:59:41.959354+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

37 extracted references · 1 canonical work pages

[1]

M. F. Hossain, A. Jamalipour, K. Munasinghe, A Survey on Virtual Reality over Wireless Networks: Fundamentals, QoE, Enabling Tech- nologies, Research Trends and Open Issues, Authorea Preprints (2023)

2023
[2]

X. Yin, A. Jindal, V . Sekar, B. Sinopoli, A control-theoretic approach for dynamic adaptive video streaming over HTTP, in: Proceedings of the 2015 ACM Conference on Special Interest Group on Data Communication, 2015, pp. 325–338

2015
[3]

Spiteri, R

K. Spiteri, R. Urgaonkar, R. K. Sitaraman, BOLA: Near-optimal bitrate adaptation for online videos, IEEE/ACM transactions on networking 28 (4) (2020) 1698–1711

2020
[4]

H. Mao, R. Netravali, M. Alizadeh, Neural adaptive video streaming with pensieve, in: Proceedings of the conference of the ACM special interest group on data communication, 2017, pp. 197–210

2017
[5]

N. A. Hafez, M. S. Hassan, T. Landolsi, Reinforcement learning-based rate adaptation in dynamic video streaming, Telecommunication Systems 83 (4) (2023) 395–407

2023
[6]

Naresh, P

M. Naresh, P. Saxena, M. Gupta, Ppo-abr: Proximal policy optimization based deep reinforcement learning for adaptive bitrate streaming, in: 2023 International Wireless Communications and Mobile Computing (IWCMC), IEEE, 2023, pp. 199–204

2023
[7]

Information technology — Dynamic adaptive streaming over HTTP (DASH) — Part 5: Server and network assisted DASH (SAND), https: //www.iso.org/standard/78448.html, amendment 1 to ISO/IEC 23009- 5:2017 (2020)

2017
[8]

Mehrabi, M

A. Mehrabi, M. Siekkinen, A. Yl ¨a-J¨a¨aski, Edge computing assisted adap- tive mobile video streaming, IEEE Transactions on Mobile Computing 18 (4) (2018) 787–800

2018
[9]

J. W. Kleinrouweler, S. Cabrero, P. Cesar, Delivering stable high-quality video: An SDN architecture with DASH assisting network elements, in: Proceedings of the 7th International Conference on Multimedia Systems, 2016, pp. 1–10

2016
[10]

W. Wu, J. Yuan, S. Ma, M. Yang, AP-assisted adaptive video streaming in wireless networks with high-density clients, Computer Communica- tions 219 (2024) 53–63

2024
[11]

Liubogoshchev, E

M. Liubogoshchev, E. Korneev, E. Khorov, EVeREst: Bitrate adaptation for cloud VR, Electronics 10 (6) (2021) 678

2021
[12]

Korneev, M

E. Korneev, M. Liubogoshchev, D. Bankov, E. Khorov, How to Model Cloud VR: An Empirical Study of Features That Matter, IEEE Open Journal of the Communications Society (2024)

2024
[13]

Maura, M

F. Maura, M. Casasnovas, B. Bellalta, Experimenting with adaptive bi- trate algorithms for virtual reality streaming over Wi-Fi, in: Proceedings of the 30th Annual International Conference on Mobile Computing and Networking, 2024, pp. 1930–1937

2024
[14]

Casasnovas, F

M. Casasnovas, F. Maura, I. Vandebroeck, H. Sukmawanto, E. Joris, B. Bellalta, NeSt-VR: An Adaptive Bitrate Algorithm for Virtual Reality Streaming over Wi-Fi, arXiv preprint arXiv:2502.14947 (2025)

arXiv 2025
[15]

Kougioumtzidis, V

G. Kougioumtzidis, V . K. Poulkov, P. I. Lazaridis, Z. D. Zaharis, Deep Reinforcement Learning-Based Resource Allocation for QoE Enhance- ment in Wireless VR Communications, IEEE Access (2025)

2025
[16]

Y . Sun, J. Chen, Z. Wang, M. Peng, S. Mao, Enabling mobile virtual reality with open 5g, fog computing and reinforcement learning, IEEE Network 36 (6) (2022) 142–149

2022
[17]

Singh, R

R. Singh, R. Sukapuram, S. Chakraborty, Mobility-aware multi-access edge computing for multiplayer augmented and virtual reality gaming, in: 2022 IEEE 21st International Symposium on Network Computing and Applications (NCA), V ol. 21, IEEE, 2022, pp. 191–200

2022
[18]

Lu, W.-X

S.-J. Lu, W.-X. Chen, Y .-S. Su, Y .-S. Chang, Y .-W. Liu, C.-Y . Li, G.-H. Tu, Practical Latency-Aware Scheduling for Low-Latency Elephant VR Flows in Wi-Fi Networks, in: 2024 IEEE International Conference on Pervasive Computing and Communications (PerCom), IEEE, 2024, pp. 57–68

2024
[19]

Jiang, X

Z. Jiang, X. Zhang, Y . Xu, Z. Ma, J. Sun, Y . Zhang, Reinforcement learning based rate adaptation for 360-degree video streaming, IEEE Transactions on Broadcasting 67 (2) (2020) 409–423

2020
[20]

N. Kan, J. Zou, C. Li, W. Dai, H. Xiong, RAPT360: Reinforcement learning-based rate adaptation for 360-degree video streaming with adap- tive prediction and tiling, IEEE Transactions on Circuits and Systems for Video Technology 32 (3) (2021) 1607–1623

2021
[21]

W. Quan, Y . Pan, B. Xiang, L. Zhang, Reinforcement learning driven adaptive vr streaming with optical flow based qoe, arXiv preprint arXiv:2003.07583 (2020)

arXiv 2003
[22]

Li, Federated deep reinforcement learning-based caching and bitrate adaptation for VR panoramic video in clustered MEC networks, Elec- tronics 11 (23) (2022) 3968

Y . Li, Federated deep reinforcement learning-based caching and bitrate adaptation for VR panoramic video in clustered MEC networks, Elec- tronics 11 (23) (2022) 3968

2022
[23]

ALVR Project, ALVR (Air Light VR), https://github.com/alvr-org/ ALVR, accessed: May 20, 2026 (2026)

2026
[24]

R. S. Sutton, A. G. Barto, et al., Reinforcement learning: An introduc- tion, V ol. 1, MIT press Cambridge, 1998

1998
[25]

Alshiekh, R

M. Alshiekh, R. Bloem, R. Ehlers, B. K ¨onighofer, S. Niekum, U. Topcu, Safe reinforcement learning via shielding, in: Proceedings of the AAAI conference on artificial intelligence, V ol. 32, 2018

2018
[26]

H. S. Rossi, K. Mitra, C. ˚Ahlund, I. Cotanis, QoE Models for Virtual Reality Cloud-based First Person Shooter Game over Mobile Networks, in: 2024 20th International Conference on Network and Service Man- agement (CNSM), IEEE, 2024, pp. 1–5

2024
[27]

Van Seijen, H

H. Van Seijen, H. Van Hasselt, S. Whiteson, M. Wiering, A theoretical and empirical analysis of expected sarsa, in: 2009 ieee symposium on adaptive dynamic programming and reinforcement learning, IEEE, 2009, pp. 177–184

2009
[28]

B. T. Polyak, A. B. Juditsky, Acceleration of stochastic approximation by averaging, SIAM journal on control and optimization 30 (4) (1992) 838–855

1992
[29]

P. J. Huber, Robust estimation of a location parameter, in: Breakthroughs in statistics: Methodology and distribution, Springer, 1992, pp. 492–518

1992
[30]

D. P. Kingma, J. Ba, Adam: A method for stochastic optimization, arXiv preprint arXiv:1412.6980 (2014)

Pith/arXiv arXiv 2014
[31]

Pascanu, T

R. Pascanu, T. Mikolov, Y . Bengio, On the difficulty of training recurrent neural networks, in: International conference on machine learning, Pmlr, 2013, pp. 1310–1318

2013
[32]

OpenWrt Project, OpenWrt, https://openwrt.org, accessed: May 20, 2026 (2026)

2026
[33]

Casasnovas, BRA VR-DRL: AP-Assisted Deep Reinforcement Learning for VR Bitrate Adaptation over Wi-Fi, https://github.com/ miguelcUPF/BRA VRDRL (2026)

M. Casasnovas, BRA VR-DRL: AP-Assisted Deep Reinforcement Learning for VR Bitrate Adaptation over Wi-Fi, https://github.com/ miguelcUPF/BRA VRDRL (2026)

2026
[34]

Casasnovas, M

M. Casasnovas, M. Carrascosa-Zamacois, B. Bellalta, Can cloud-based VR streaming handle Wi-Fi OBSS contention?, in: 2025 IEEE Confer- ence on Standards for Communications and Networking (CSCN), IEEE, 2025, pp. 1–6

2025
[35]

Bellalta, M

B. Bellalta, M. Casasnovas, F. Maura, A. Rodr ´ıguez, J. S. Marquerie, P. L. Garc ´ıa, F. Wilhelmi, J. Blat, Understanding the Wi-Fi and VR streaming interplay: A comprehensible simulation and experimental study, Journal of Network and Computer Applications (2025) 104391

2025
[36]

Casasnovas, F

M. Casasnovas, F. Wilhelmi, B. Bellalta, AP-Assisted VR Streaming Dataset (BRA VR) (2026).doi:10.5281/zenodo.20072438. URL https://doi.org/10.5281/zenodo.20072438

work page doi:10.5281/zenodo.20072438 2026
[37]

Michaelides, M

C. Michaelides, M. Casanovas, D. N ´u˜nez, B. Bellalta, Lessons learned from a large-scale virtual reality experience over Wi-Fi, IEEE Transac- tions on Networking (2025)

2025

[1] [1]

M. F. Hossain, A. Jamalipour, K. Munasinghe, A Survey on Virtual Reality over Wireless Networks: Fundamentals, QoE, Enabling Tech- nologies, Research Trends and Open Issues, Authorea Preprints (2023)

2023

[2] [2]

X. Yin, A. Jindal, V . Sekar, B. Sinopoli, A control-theoretic approach for dynamic adaptive video streaming over HTTP, in: Proceedings of the 2015 ACM Conference on Special Interest Group on Data Communication, 2015, pp. 325–338

2015

[3] [3]

Spiteri, R

K. Spiteri, R. Urgaonkar, R. K. Sitaraman, BOLA: Near-optimal bitrate adaptation for online videos, IEEE/ACM transactions on networking 28 (4) (2020) 1698–1711

2020

[4] [4]

H. Mao, R. Netravali, M. Alizadeh, Neural adaptive video streaming with pensieve, in: Proceedings of the conference of the ACM special interest group on data communication, 2017, pp. 197–210

2017

[5] [5]

N. A. Hafez, M. S. Hassan, T. Landolsi, Reinforcement learning-based rate adaptation in dynamic video streaming, Telecommunication Systems 83 (4) (2023) 395–407

2023

[6] [6]

Naresh, P

M. Naresh, P. Saxena, M. Gupta, Ppo-abr: Proximal policy optimization based deep reinforcement learning for adaptive bitrate streaming, in: 2023 International Wireless Communications and Mobile Computing (IWCMC), IEEE, 2023, pp. 199–204

2023

[7] [7]

Information technology — Dynamic adaptive streaming over HTTP (DASH) — Part 5: Server and network assisted DASH (SAND), https: //www.iso.org/standard/78448.html, amendment 1 to ISO/IEC 23009- 5:2017 (2020)

2017

[8] [8]

Mehrabi, M

A. Mehrabi, M. Siekkinen, A. Yl ¨a-J¨a¨aski, Edge computing assisted adap- tive mobile video streaming, IEEE Transactions on Mobile Computing 18 (4) (2018) 787–800

2018

[9] [9]

J. W. Kleinrouweler, S. Cabrero, P. Cesar, Delivering stable high-quality video: An SDN architecture with DASH assisting network elements, in: Proceedings of the 7th International Conference on Multimedia Systems, 2016, pp. 1–10

2016

[10] [10]

W. Wu, J. Yuan, S. Ma, M. Yang, AP-assisted adaptive video streaming in wireless networks with high-density clients, Computer Communica- tions 219 (2024) 53–63

2024

[11] [11]

Liubogoshchev, E

M. Liubogoshchev, E. Korneev, E. Khorov, EVeREst: Bitrate adaptation for cloud VR, Electronics 10 (6) (2021) 678

2021

[12] [12]

Korneev, M

E. Korneev, M. Liubogoshchev, D. Bankov, E. Khorov, How to Model Cloud VR: An Empirical Study of Features That Matter, IEEE Open Journal of the Communications Society (2024)

2024

[13] [13]

Maura, M

F. Maura, M. Casasnovas, B. Bellalta, Experimenting with adaptive bi- trate algorithms for virtual reality streaming over Wi-Fi, in: Proceedings of the 30th Annual International Conference on Mobile Computing and Networking, 2024, pp. 1930–1937

2024

[14] [14]

Casasnovas, F

M. Casasnovas, F. Maura, I. Vandebroeck, H. Sukmawanto, E. Joris, B. Bellalta, NeSt-VR: An Adaptive Bitrate Algorithm for Virtual Reality Streaming over Wi-Fi, arXiv preprint arXiv:2502.14947 (2025)

arXiv 2025

[15] [15]

Kougioumtzidis, V

G. Kougioumtzidis, V . K. Poulkov, P. I. Lazaridis, Z. D. Zaharis, Deep Reinforcement Learning-Based Resource Allocation for QoE Enhance- ment in Wireless VR Communications, IEEE Access (2025)

2025

[16] [16]

Y . Sun, J. Chen, Z. Wang, M. Peng, S. Mao, Enabling mobile virtual reality with open 5g, fog computing and reinforcement learning, IEEE Network 36 (6) (2022) 142–149

2022

[17] [17]

Singh, R

R. Singh, R. Sukapuram, S. Chakraborty, Mobility-aware multi-access edge computing for multiplayer augmented and virtual reality gaming, in: 2022 IEEE 21st International Symposium on Network Computing and Applications (NCA), V ol. 21, IEEE, 2022, pp. 191–200

2022

[18] [18]

Lu, W.-X

S.-J. Lu, W.-X. Chen, Y .-S. Su, Y .-S. Chang, Y .-W. Liu, C.-Y . Li, G.-H. Tu, Practical Latency-Aware Scheduling for Low-Latency Elephant VR Flows in Wi-Fi Networks, in: 2024 IEEE International Conference on Pervasive Computing and Communications (PerCom), IEEE, 2024, pp. 57–68

2024

[19] [19]

Jiang, X

Z. Jiang, X. Zhang, Y . Xu, Z. Ma, J. Sun, Y . Zhang, Reinforcement learning based rate adaptation for 360-degree video streaming, IEEE Transactions on Broadcasting 67 (2) (2020) 409–423

2020

[20] [20]

N. Kan, J. Zou, C. Li, W. Dai, H. Xiong, RAPT360: Reinforcement learning-based rate adaptation for 360-degree video streaming with adap- tive prediction and tiling, IEEE Transactions on Circuits and Systems for Video Technology 32 (3) (2021) 1607–1623

2021

[21] [21]

W. Quan, Y . Pan, B. Xiang, L. Zhang, Reinforcement learning driven adaptive vr streaming with optical flow based qoe, arXiv preprint arXiv:2003.07583 (2020)

arXiv 2003

[22] [22]

Li, Federated deep reinforcement learning-based caching and bitrate adaptation for VR panoramic video in clustered MEC networks, Elec- tronics 11 (23) (2022) 3968

Y . Li, Federated deep reinforcement learning-based caching and bitrate adaptation for VR panoramic video in clustered MEC networks, Elec- tronics 11 (23) (2022) 3968

2022

[23] [23]

ALVR Project, ALVR (Air Light VR), https://github.com/alvr-org/ ALVR, accessed: May 20, 2026 (2026)

2026

[24] [24]

R. S. Sutton, A. G. Barto, et al., Reinforcement learning: An introduc- tion, V ol. 1, MIT press Cambridge, 1998

1998

[25] [25]

Alshiekh, R

M. Alshiekh, R. Bloem, R. Ehlers, B. K ¨onighofer, S. Niekum, U. Topcu, Safe reinforcement learning via shielding, in: Proceedings of the AAAI conference on artificial intelligence, V ol. 32, 2018

2018

[26] [26]

H. S. Rossi, K. Mitra, C. ˚Ahlund, I. Cotanis, QoE Models for Virtual Reality Cloud-based First Person Shooter Game over Mobile Networks, in: 2024 20th International Conference on Network and Service Man- agement (CNSM), IEEE, 2024, pp. 1–5

2024

[27] [27]

Van Seijen, H

H. Van Seijen, H. Van Hasselt, S. Whiteson, M. Wiering, A theoretical and empirical analysis of expected sarsa, in: 2009 ieee symposium on adaptive dynamic programming and reinforcement learning, IEEE, 2009, pp. 177–184

2009

[28] [28]

B. T. Polyak, A. B. Juditsky, Acceleration of stochastic approximation by averaging, SIAM journal on control and optimization 30 (4) (1992) 838–855

1992

[29] [29]

P. J. Huber, Robust estimation of a location parameter, in: Breakthroughs in statistics: Methodology and distribution, Springer, 1992, pp. 492–518

1992

[30] [30]

D. P. Kingma, J. Ba, Adam: A method for stochastic optimization, arXiv preprint arXiv:1412.6980 (2014)

Pith/arXiv arXiv 2014

[31] [31]

Pascanu, T

R. Pascanu, T. Mikolov, Y . Bengio, On the difficulty of training recurrent neural networks, in: International conference on machine learning, Pmlr, 2013, pp. 1310–1318

2013

[32] [32]

OpenWrt Project, OpenWrt, https://openwrt.org, accessed: May 20, 2026 (2026)

2026

[33] [33]

Casasnovas, BRA VR-DRL: AP-Assisted Deep Reinforcement Learning for VR Bitrate Adaptation over Wi-Fi, https://github.com/ miguelcUPF/BRA VRDRL (2026)

M. Casasnovas, BRA VR-DRL: AP-Assisted Deep Reinforcement Learning for VR Bitrate Adaptation over Wi-Fi, https://github.com/ miguelcUPF/BRA VRDRL (2026)

2026

[34] [34]

Casasnovas, M

M. Casasnovas, M. Carrascosa-Zamacois, B. Bellalta, Can cloud-based VR streaming handle Wi-Fi OBSS contention?, in: 2025 IEEE Confer- ence on Standards for Communications and Networking (CSCN), IEEE, 2025, pp. 1–6

2025

[35] [35]

Bellalta, M

B. Bellalta, M. Casasnovas, F. Maura, A. Rodr ´ıguez, J. S. Marquerie, P. L. Garc ´ıa, F. Wilhelmi, J. Blat, Understanding the Wi-Fi and VR streaming interplay: A comprehensible simulation and experimental study, Journal of Network and Computer Applications (2025) 104391

2025

[36] [36]

Casasnovas, F

M. Casasnovas, F. Wilhelmi, B. Bellalta, AP-Assisted VR Streaming Dataset (BRA VR) (2026).doi:10.5281/zenodo.20072438. URL https://doi.org/10.5281/zenodo.20072438

work page doi:10.5281/zenodo.20072438 2026

[37] [37]

Michaelides, M

C. Michaelides, M. Casanovas, D. N ´u˜nez, B. Bellalta, Lessons learned from a large-scale virtual reality experience over Wi-Fi, IEEE Transac- tions on Networking (2025)

2025