An Empirical Analysis of Mobile Energy Consumption Across User Configurations
Pith reviewed 2026-05-07 16:06 UTC · model grok-4.3
The pith
Automated tests across popular apps quantify the energy consumption impacts of user settings on mobile devices.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Employing an automated monitoring framework, a series of user interface tests that simulate realistic usage patterns across popular applications was conducted to systematically evaluate the energy impact of user-controllable factors, including device settings such as screen brightness, refresh rate, connectivity status, interface themes, and battery-saving profiles, combined with app-specific variables like video resolution and message size. By analyzing over 12,000 data points, the paper quantifies the real-world impact of common settings, revealing the trade-offs between user experience and device autonomy.
What carries the argument
Automated monitoring framework with UI tests simulating realistic usage patterns on selected mobile apps while varying user settings.
If this is right
- Adjusting screen brightness and refresh rate produces different levels of battery drain during app use.
- Selecting lower video resolutions in streaming apps reduces overall energy consumption.
- Activating battery-saving profiles delivers consistent power savings across communication and media apps.
- Managing connectivity status and interface themes offers additional levers for conserving device energy.
- The quantified data enables users to make explicit choices prioritizing experience features or extended autonomy.
Where Pith is reading between the lines
- App developers could apply comparable testing methods to set energy-efficient defaults in their software.
- Mobile operating systems might use similar empirical data to generate personalized setting recommendations.
- Extending the approach to additional device models or emerging apps would strengthen the generalizability of the trade-off findings.
Load-bearing premise
The automated UI tests simulating realistic usage patterns on selected apps and devices accurately capture energy consumption under actual user behavior and varied real-world conditions.
What would settle it
Direct comparison of energy logs from real users performing similar tasks on the same devices and apps would test if the simulated results match observed consumption.
Figures
read the original abstract
Mobile devices have become ubiquitous tools for communication, entertainment, and productivity, yet battery autonomy remains a constraint. While energy-saving tips exist, they are often generic, anecdotal, or focused on software development rather than end-user behavior, leaving users to rely on grey literature or tacit knowledge to optimize their device energy consumption, lacking the academic rigor to ensure their effectiveness. This research aims to bridge the gap between technical energy analysis and practical user application by quantifying the energy consumption of different user-controlled parameters. Employing an automated monitoring framework, a series of user interface tests that simulate realistic usage patterns across popular applications (i.e., WhatsApp, Instagram, TikTok, and YouTube) was conducted. The objective is to have a systematic evaluation of the energy impact of user-controllable factors, including device settings, such as screen brightness, refresh rate, connectivity status, interface themes, and battery-saving profiles, combined with more app-specific variables (e.g., video resolution and message size). By analyzing over 12,000 data points, this paper quantifies the real-world impact of common settings, revealing the trade-offs between user experience and device autonomy.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper presents an empirical study employing an automated monitoring framework with UI tests that simulate realistic usage patterns on WhatsApp, Instagram, TikTok, and YouTube. It collects and analyzes over 12,000 data points to quantify the energy consumption effects of user-controllable parameters including screen brightness, refresh rate, connectivity status, interface themes, battery-saving profiles, video resolution, and message size, with the goal of identifying trade-offs between user experience and device battery autonomy.
Significance. If the automated tests are shown to validly represent real-world usage distributions and the measurements prove robust to hardware and environmental variation, the work could supply practical, data-driven guidance for end-users seeking to optimize mobile battery life beyond anecdotal advice. The scale of the dataset is a positive feature for an empirical measurement study in software engineering.
major comments (2)
- [Abstract] Abstract: The central claim of quantifying 'real-world impact' from the >12,000 data points rests on the unvalidated assumption that scripted UI tests on selected devices and apps reproduce the statistical distribution of actual user sessions (timing, content, background activity, network state). No description of validation, calibration against real traces, or sensitivity analysis to these factors is provided, which directly undermines generalizability of the reported energy deltas.
- [Abstract] Abstract: No information is given on the statistical methods used to aggregate or compare the energy measurements, the presence or absence of error bars, device models tested, exclusion criteria for data points, or controls for measurement artifacts. These omissions are load-bearing because the headline trade-off quantifications cannot be assessed for reliability or reproducibility without them.
minor comments (1)
- [Abstract] The abstract lists the apps and parameters but does not specify the exact number of devices, operating system versions, or total test duration, which would aid clarity.
Simulated Author's Rebuttal
We thank the referee for their detailed and constructive feedback. We address each major comment below, indicating revisions where appropriate while being transparent about the scope of changes possible in this revision.
read point-by-point responses
-
Referee: [Abstract] Abstract: The central claim of quantifying 'real-world impact' from the >12,000 data points rests on the unvalidated assumption that scripted UI tests on selected devices and apps reproduce the statistical distribution of actual user sessions (timing, content, background activity, network state). No description of validation, calibration against real traces, or sensitivity analysis to these factors is provided, which directly undermines generalizability of the reported energy deltas.
Authors: We agree that the manuscript does not include direct validation or calibration of the scripted tests against real-world user traces. The usage patterns were derived from publicly documented typical behaviors for the selected apps rather than empirical user logs. In the revised manuscript we have added a new 'Limitations and Assumptions' subsection that explicitly describes the simulation design choices, acknowledges the lack of trace-based calibration, and discusses implications for generalizability. We have also incorporated a sensitivity analysis varying interaction timing and content size within plausible ranges, with results now reported in Section 4.3. These additions improve transparency without claiming broader validation. revision: partial
-
Referee: [Abstract] Abstract: No information is given on the statistical methods used to aggregate or compare the energy measurements, the presence or absence of error bars, device models tested, exclusion criteria for data points, or controls for measurement artifacts. These omissions are load-bearing because the headline trade-off quantifications cannot be assessed for reliability or reproducibility without them.
Authors: We acknowledge that the original presentation of these details was insufficient. The full manuscript already specifies the two device models (Samsung Galaxy S21 and Google Pixel 6) and basic aggregation as means, but we agree that statistical procedures, error reporting, exclusion rules, and artifact controls require expansion. The revised Methods section now includes: (i) explicit use of paired t-tests and ANOVA for comparisons with p-value thresholds, (ii) error bars defined as standard error of the mean, (iii) exclusion criteria (runs discarded if network latency exceeded 200 ms or temperature rose above 40 °C), and (iv) controls (fixed lab environment, disabled background services, and repeated measurements under identical conditions). These changes are incorporated in the updated version. revision: yes
- Direct empirical validation or calibration of the scripted UI tests against real user session distributions from diverse populations, which would require a separate large-scale user study and trace collection not feasible within the current revision.
Circularity Check
Purely empirical measurement study with no derivations or self-referential reductions
full rationale
The paper performs an empirical analysis by running automated UI tests on selected apps and devices to collect over 12,000 data points on energy consumption under varying user settings. It then reports observed differences and trade-offs directly from those measurements. No equations, fitted parameters, predictions, or first-principles derivations appear in the abstract or described methodology; results are not obtained by reducing any quantity to a prior input or self-citation. The central claims rest on external data collection rather than any definitional or fitted loop, making the work self-contained as a measurement study.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Automated UI tests on selected apps accurately represent real-world energy consumption patterns
Reference graph
Works this paper leans on
-
[1]
A. Javed et al. Energy consumption in mobile phones.International Journal of Computer Network and Information Security, 9(12):18–28, 2017
work page 2017
-
[2]
A. Almasri, T. Y. El-Kour, L. Silva, and Y. Abdulfattah. Evaluating the energy efficiency of popular us smartphone health care apps: Comparative analysis study toward sustainable health and nutrition apps practices.JMIR Human Factors, 11:e58311, 2024
work page 2024
-
[3]
T. Zaragoza, T. Soulancé, A. Noureddine, and E. Exposito. Understanding and influencing end-user behavior in software energy consumption. InProceedings of the 29th International Conference on Evaluation and Assessment in Software Engineering (EASE 2025), Istanbul, Turkey, June 2025. ACM
work page 2025
-
[4]
W. Oliveira, B. Moraes, F. Castor, and J. P. Fernandes. Analyzing the resource usage overhead of mobile app development frameworks. InProceedings of the 27th International Conference on Evaluation and Assessment in Software Engineering, pages 152–161, 2023
work page 2023
-
[5]
D. Flores-Martin, S. Laso, and J. L. Herrera. Enhancing smartphone battery life: A deep learning model based on user-specific application and network behavior. Electronics, 13(24):4897, 2024
work page 2024
-
[6]
A. Schuler and G. Kotsis. A systematic review on techniques and approaches to es- timate mobile software energy consumption.Sustainable Computing: Informatics and Systems, 41:100919, 2024
work page 2024
-
[7]
W. Oliveira, B. Moraes, F. Castor, and J. P. Fernandes. Ebserver: Automating resource-usage data collection of android applications. In2023 IEEE/ACM 10th In- ternational Conference on Mobile Software Engineering and Systems (MOBILESoft), pages 55–59. IEEE, 2023
work page 2023
-
[8]
Monsoon high voltage power monitor
Monsoon. Monsoon high voltage power monitor. https://www.msoon.com/ online-store, 2025. Accessed: 2026-04-27
work page 2025
-
[9]
E. Guégain, R. Raes, N. Chachignot, C. Quinton, and R. Rouvoy. Androwatts: Unpacking the power consumption of mobile device’s components. In2025 IEEE/ACM 12th International Conference on Mobile Software Engineering and Systems (MOBILESoft), pages 71–81. IEEE, 2025
work page 2025
-
[10]
P. Dash and Y. C. Hu. How much battery does dark mode save? an accurate oled display power profiler for modern smartphones. InProceedings of the 19th Annual International Conference on Mobile Systems, Applications, and Services, pages 323–335, 2021
work page 2021
-
[11]
R. Rua, M. Couto, and J. Saraiva. Greensource: a large-scale collection of android code, tests and energy metrics. In2019 IEEE/ACM 16th International Conference on Mining Software Repositories (MSR), pages 176–180. IEEE, 2019
work page 2019
- [12]
-
[13]
D. Ferreira, A. K. Dey, and V. Kostakos. Understanding human-smartphone con- cerns: a study of battery life. InInternational Conference on Pervasive Computing, pages 19–33. Springer, 2011
work page 2011
-
[14]
S. Lee, D. R. Jeong, J. Choi, J. Kwak, S. Son, J. Y. Song, and I. Shin. Serenus: Alleviating low-battery anxiety through real-time, accurate, and user-friendly energy consumption prediction of mobile applications. InProceedings of the 37th Annual ACM Symposium on User Interface Software and Technology, pages 1–20, 2024
work page 2024
- [15]
- [16]
- [17]
- [18]
-
[19]
R. Kazantsev and D. Vatolin. Power consumption of video-decoders on various android devices. InProceedings of the 2021 Picture Coding Symposium (PCS), pages 1–5, 2021
work page 2021
- [20]
- [21]
-
[22]
E. Monteiro, H. Cavalcante, R. Barreto, and R. de Freitas. Analysis of energy consumption on android devices for developers: A systematic mapping study. SBC Reviews on Computer Science, 3(1):1–18, 2023
work page 2023
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.