An Edge-Cloud Collaborative Architecture for Proactive Elderly Care: Real-Time Risk Assessment and Three-Level Emergency Response

Lijie Zhou; Luran Wang

arxiv: 2604.14154 · v1 · submitted 2026-03-20 · 📡 eess.SP · cs.AI· cs.CY

An Edge-Cloud Collaborative Architecture for Proactive Elderly Care: Real-Time Risk Assessment and Three-Level Emergency Response

Lijie Zhou , Luran Wang This is my paper

Pith reviewed 2026-05-15 08:52 UTC · model grok-4.3

classification 📡 eess.SP cs.AIcs.CY

keywords elderly careedge computingsensor fusionrisk assessmentfall detectionemergency responsemulti-modal monitoringactivity recognition

0 comments

The pith

Edge-cloud fusion fuses multi-modal sensors for real-time elderly risk scoring and three-tier alerts under three seconds.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes moving elderly monitoring computation to the edge to combine data from multiple sensors into a single risk score. A five-layer design keeps raw data local, computes a four-dimensional risk value from fall likelihood, physiology, behavior, and anomalies, and triggers family, doctor, or volunteer responses based on dynamic thresholds. This setup delivers end-to-end alerts in under three seconds on low-cost hardware while outperforming single-sensor baselines on three public datasets. The approach directly tackles latency and privacy problems that limit cloud-only elderly care systems. Experiments confirm 91 percent activity recognition accuracy and 84 percent anomaly detection F1-score.

Core claim

The architecture fuses five sensor types at the edge using a weighted multi-modal algorithm, produces a unified risk score from fall probability, physiological indicators, behavioral patterns, and sensor anomalies, and routes alerts through a three-level system that coordinates family, community doctors, and volunteers, achieving sub-three-second latency and higher accuracy than single-sensor methods on CASAS, MIMIC-III, and SisFall data.

What carries the argument

The five-layer edge-cloud architecture with weighted multi-modal fusion at the edge that integrates sensor data and computes a four-dimensional risk score to drive dynamic three-level emergency notifications.

If this is right

Real-time risk evaluation becomes possible without sending raw data to the cloud.
Privacy improves because only alerts and aggregated scores leave the edge gateway.
Three-tier responses scale notifications to match risk severity.
Sub-100 ms inference on Raspberry Pi shows the system can run on inexpensive hardware.
Accuracy gains over single-sensor methods hold across fall, activity, and vital-sign datasets.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same edge layer could incorporate additional wearable or environmental sensors without changing the cloud side.
Dynamic thresholds might be made user-specific by adding a short personalization phase on first deployment.
The architecture could extend to other home health scenarios such as post-surgical monitoring or chronic disease alerts.
Latency budgets under three seconds open the door to closed-loop interventions like automatic light activation during detected falls.

Load-bearing premise

The weighted fusion rules and risk thresholds tuned on the three public datasets will deliver similar accuracy and low latency when used with real elderly people whose sensor streams contain more noise or different activity patterns.

What would settle it

A field deployment with actual elderly users wearing or living with the sensors, measuring end-to-end alert latency, false-positive rate, and response effectiveness against the dataset benchmarks.

Figures

Figures reproduced from arXiv: 2604.14154 by Lijie Zhou, Luran Wang.

**Figure 2.** Figure 2: Edge gateway processing pipeline showing the flow from sensor data [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 3.** Figure 3: Multi-database architecture showing PostgreSQL for business data, [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗

**Figure 4.** Figure 4: Four-dimensional risk scoring model showing the integration of fall [PITH_FULL_IMAGE:figures/full_fig_p008_4.png] view at source ↗

**Figure 5.** Figure 5: Dynamic threshold-based alert level determination showing the four [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗

**Figure 7.** Figure 7: Web dashboard interface showing real-time health monitoring, alert [PITH_FULL_IMAGE:figures/full_fig_p009_7.png] view at source ↗

**Figure 8.** Figure 8: ROC curve for the four-dimensional risk scoring model showing AUC [PITH_FULL_IMAGE:figures/full_fig_p010_8.png] view at source ↗

read the original abstract

The rapid aging of global populations has created an urgent need for intelligent healthcare monitoring systems to ensure the safety of elderly individuals living independently. Existing cloud-centric platforms face critical limitations, including high latency unsuitable for emergency response, privacy risks from continuous transmission of sensitive data, and limited, single-channel alert mechanisms lacking scalability and context awareness. This paper proposes an edge-cloud collaborative architecture that addresses these challenges through real-time multi-modal sensor fusion, a four-dimensional risk assessment model, and a three-level emergency response system. The framework adopts a five-layer design - device, edge, service, data, and application layers - enabling real-time risk evaluation with end-to-end alert latency under three seconds. At the edge, a weighted multi-modal fusion algorithm integrates data from five sensor types with confidence propagation. A unified risk score is generated by combining fall probability, physiological indicators, behavioral patterns, and sensor anomaly metrics. Based on dynamic thresholds, a three-tier notification system coordinates responses among family members, community doctors, and nearby volunteers. Experiments on CASAS, MIMIC-III, and SisFall datasets show that the approach achieves 91% activity recognition accuracy and an 84% anomaly detection F1-score, outperforming single-sensor methods. Deployment on Raspberry Pi 4 gateways demonstrates sub-100 ms inference latency while preserving privacy by keeping raw data local. This architecture advances practical, privacy-preserving, and responsive elderly care systems.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper combines standard edge techniques into a five-layer elderly monitoring system with reported 91% accuracy on public datasets and fast hardware runs, but real-world generalization from clean data to noisy homes is untested.

read the letter

The main point is a practical edge-cloud setup for elderly care that fuses multiple sensors at the edge, computes a four-dimensional risk score, and triggers three levels of alerts while keeping raw data local. It reports concrete numbers from CASAS, MIMIC-III, and SisFall plus a Raspberry Pi demo, which is useful for applied work even if the pieces are not brand new on their own.

Referee Report

3 major / 2 minor

Summary. The manuscript proposes an edge-cloud collaborative architecture for proactive elderly care using a five-layer design (device, edge, service, data, application). At the edge, a weighted multi-modal fusion algorithm integrates five sensor types with confidence propagation to produce a unified four-dimensional risk score (fall probability, physiological indicators, behavioral patterns, sensor anomalies). Dynamic thresholds trigger a three-level emergency response coordinating family, community doctors, and volunteers. Experiments on CASAS, MIMIC-III, and SisFall datasets report 91% activity recognition accuracy and 84% anomaly detection F1-score, outperforming single-sensor baselines, with sub-100 ms inference latency on Raspberry Pi 4 and end-to-end alert latency under 3 seconds while keeping raw data local.

Significance. If the performance and generalization claims hold, the architecture provides a concrete, deployable template for privacy-preserving, low-latency elderly monitoring that scales response coordination beyond single-channel alerts. The explicit hardware measurements and multi-dataset evaluation strengthen its practical relevance over purely theoretical edge-computing proposals.

major comments (3)

[Experiments] Experiments section: the headline metrics (91% activity recognition accuracy, 84% anomaly F1) are reported without train/test split details, cross-validation procedure, or statistical significance tests against the single-sensor baselines, preventing assessment of whether the claimed outperformance is robust.
[Risk Assessment Model] Risk assessment and threshold selection: the dynamic risk thresholds and fusion weights are stated to be tuned on the three public datasets, yet no sensitivity analysis or ablation under added noise, missing samples, or sensor drift is provided; this directly bears on the weakest assumption that sub-3 s latency and three-level response reliability will transfer to real elderly home deployments.
[System Deployment] Deployment evaluation: sub-100 ms inference is shown on Raspberry Pi 4, but the manuscript does not report end-to-end latency measurements that include network handoff to the cloud layer or the three-level notification pipeline, leaving the central <3 s claim unsupported by the presented hardware results.

minor comments (2)

[Introduction] The abstract and introduction use 'four-dimensional risk assessment' and 'unified risk score' interchangeably; a single consistent term and a brief equation or pseudocode for the combination step would improve clarity.
[System Architecture] Figure captions for the architecture diagram should explicitly label the five layers and the data flow between edge and cloud to match the textual description.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive and detailed comments, which help strengthen the manuscript. We address each major point below and will revise the paper to improve reproducibility, robustness analysis, and evaluation completeness.

read point-by-point responses

Referee: [Experiments] Experiments section: the headline metrics (91% activity recognition accuracy, 84% anomaly F1) are reported without train/test split details, cross-validation procedure, or statistical significance tests against the single-sensor baselines, preventing assessment of whether the claimed outperformance is robust.

Authors: We agree that these methodological details are necessary for assessing robustness. In the revised manuscript we will expand the Experiments section to explicitly report: (i) the train/test split ratios and random seeds used for CASAS, MIMIC-III, and SisFall; (ii) the cross-validation scheme (5-fold stratified CV); and (iii) statistical significance tests (paired t-tests with p-values) comparing the multi-modal fusion against each single-sensor baseline. These additions will directly substantiate the reported 91% accuracy and 84% F1-score. revision: yes
Referee: [Risk Assessment Model] Risk assessment and threshold selection: the dynamic risk thresholds and fusion weights are stated to be tuned on the three public datasets, yet no sensitivity analysis or ablation under added noise, missing samples, or sensor drift is provided; this directly bears on the weakest assumption that sub-3 s latency and three-level response reliability will transfer to real elderly home deployments.

Authors: The referee correctly identifies a gap in demonstrating real-world robustness. We will add a dedicated sensitivity-analysis subsection that includes ablation experiments under Gaussian noise, random missing samples (10–30%), and simulated sensor drift. The results will quantify degradation in the four-dimensional risk score and threshold stability, thereby supporting the transferability of the three-level response mechanism. revision: yes
Referee: [System Deployment] Deployment evaluation: sub-100 ms inference is shown on Raspberry Pi 4, but the manuscript does not report end-to-end latency measurements that include network handoff to the cloud layer or the three-level notification pipeline, leaving the central <3 s claim unsupported by the presented hardware results.

Authors: We acknowledge that the current hardware results focus on edge inference only. In the revision we will augment the deployment evaluation with end-to-end timing that incorporates simulated network handoff (typical home-to-cloud delays) and measured dispatch times for the three-level notification pipeline. This will provide concrete support for the sub-3-second claim while noting that exhaustive field trials under variable network conditions remain future work. revision: partial

Circularity Check

0 steps flagged

No circularity in architecture description or dataset evaluation

full rationale

The paper describes a five-layer edge-cloud system, weighted multi-modal fusion, four-dimensional risk model, and dynamic thresholds. Reported metrics (91% activity recognition, 84% anomaly F1) are direct empirical results from experiments on the external public datasets CASAS, MIMIC-III, and SisFall. No equations reduce the unified risk score or accuracy figures to a parameter defined in terms of itself. Fusion weights and thresholds are tuned on the evaluation sets, but the paper presents these as measured performance rather than a self-referential prediction. No load-bearing self-citations or uniqueness theorems are invoked. The derivation chain is self-contained and externally benchmarked.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 0 invented entities

The architecture depends on standard assumptions about sensor reliability and a small number of fitted parameters for fusion weights and risk thresholds; no new physical entities are introduced.

free parameters (2)

sensor fusion weights
Weights used in the weighted multi-modal fusion algorithm are chosen or tuned to combine the five sensor types.
dynamic risk thresholds
Thresholds that define the three emergency response levels are dynamic and therefore fitted or calibrated on data.

axioms (1)

domain assumption Multi-modal sensor data from the five types reliably reflect fall risk, physiological state, behavior, and anomalies
Invoked when the four-dimensional risk score is formed from fall probability, physiological indicators, behavioral patterns, and sensor anomaly metrics.

pith-pipeline@v0.9.0 · 5558 in / 1402 out tokens · 47416 ms · 2026-05-15T08:52:36.787521+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

12 extracted references · 12 canonical work pages

[1]

Edge computing: Vision and challenges,

W. Shi, J. Cao, Q. Zhang, Y . Li, and L. Xu, “Edge computing: Vision and challenges,”IEEE Internet of Things Journal, vol. 3, no. 5, pp. 637–646, 2016

work page 2016
[2]

Multisensor data fusion: A review of the state-of-the-art,

B. Khaleghi, A. Khamis, F. O. Karray, and S. N. Razavi, “Multisensor data fusion: A review of the state-of-the-art,”Information Fusion, vol. 14, no. 1, pp. 28–44, 2013

work page 2013
[3]

Long short-term memory,

S. Hochreiter and J. Schmidhuber, “Long short-term memory,”Neural Computation, vol. 9, no. 8, pp. 1735–1780, 1997

work page 1997
[4]

Focal loss for dense object detection,

T.-Y . Lin, P. Goyal, R. Girshick, K. He, and P. Doll ´ar, “Focal loss for dense object detection,” inProceedings of the IEEE International Conference on Computer Vision, 2017, pp. 2980–2988

work page 2017
[5]

A multi-modal cnn-lstm framework with multi- head attention and focal loss for real-time elderly fall detection,

L. Zhou and L. Wang, “A multi-modal cnn-lstm framework with multi- head attention and focal loss for real-time elderly fall detection,” ResearchGate, 2025

work page 2025
[6]

Fastapi framework,

S. Ram ´ırez, “Fastapi framework,” https://fastapi.tiangolo.com/, 2019

work page 2019
[7]

Influxdb documentation,

InfluxData Inc., “Influxdb documentation,” https://docs.influxdata.com/, 2023

work page 2023
[8]

Mqtt-s—a pub- lish/subscribe protocol for wireless sensor networks,

U. Hunkeler, H. L. Truong, and A. Stanford-Clark, “Mqtt-s—a pub- lish/subscribe protocol for wireless sensor networks,” in2008 3rd International Conference on Communication Systems Software and Middleware and Workshops (COMSWARE). IEEE, 2008, pp. 791–798

work page 2008
[9]

You, “Vue.js,” https://vuejs.org/, 2014

E. You, “Vue.js,” https://vuejs.org/, 2014

work page 2014
[10]

Sisfall: A fall and movement dataset,

A. Sucerquia, J. D. L ´opez, and J. F. Vargas-Bonilla, “Sisfall: A fall and movement dataset,”Sensors, vol. 17, no. 1, p. 198, 2017

work page 2017
[11]

Casas: A smart home in a box,

D. J. Cook, A. S. Crandall, B. L. Thomas, and N. C. Krishnan, “Casas: A smart home in a box,”Computer, vol. 46, no. 7, pp. 62–69, 2013

work page 2013
[12]

Mimic-iii, a freely accessible critical care database,

A. E. Johnson, T. J. Pollard, L. Shen, L.-w. H. Lehman, M. Feng, M. Ghassemi, B. Moody, P. Szolovits, L. A. Celi, and R. G. Mark, “Mimic-iii, a freely accessible critical care database,”Scientific Data, vol. 3, no. 1, pp. 1–9, 2016

work page 2016

[1] [1]

Edge computing: Vision and challenges,

W. Shi, J. Cao, Q. Zhang, Y . Li, and L. Xu, “Edge computing: Vision and challenges,”IEEE Internet of Things Journal, vol. 3, no. 5, pp. 637–646, 2016

work page 2016

[2] [2]

Multisensor data fusion: A review of the state-of-the-art,

B. Khaleghi, A. Khamis, F. O. Karray, and S. N. Razavi, “Multisensor data fusion: A review of the state-of-the-art,”Information Fusion, vol. 14, no. 1, pp. 28–44, 2013

work page 2013

[3] [3]

Long short-term memory,

S. Hochreiter and J. Schmidhuber, “Long short-term memory,”Neural Computation, vol. 9, no. 8, pp. 1735–1780, 1997

work page 1997

[4] [4]

Focal loss for dense object detection,

T.-Y . Lin, P. Goyal, R. Girshick, K. He, and P. Doll ´ar, “Focal loss for dense object detection,” inProceedings of the IEEE International Conference on Computer Vision, 2017, pp. 2980–2988

work page 2017

[5] [5]

A multi-modal cnn-lstm framework with multi- head attention and focal loss for real-time elderly fall detection,

L. Zhou and L. Wang, “A multi-modal cnn-lstm framework with multi- head attention and focal loss for real-time elderly fall detection,” ResearchGate, 2025

work page 2025

[6] [6]

Fastapi framework,

S. Ram ´ırez, “Fastapi framework,” https://fastapi.tiangolo.com/, 2019

work page 2019

[7] [7]

Influxdb documentation,

InfluxData Inc., “Influxdb documentation,” https://docs.influxdata.com/, 2023

work page 2023

[8] [8]

Mqtt-s—a pub- lish/subscribe protocol for wireless sensor networks,

U. Hunkeler, H. L. Truong, and A. Stanford-Clark, “Mqtt-s—a pub- lish/subscribe protocol for wireless sensor networks,” in2008 3rd International Conference on Communication Systems Software and Middleware and Workshops (COMSWARE). IEEE, 2008, pp. 791–798

work page 2008

[9] [9]

You, “Vue.js,” https://vuejs.org/, 2014

E. You, “Vue.js,” https://vuejs.org/, 2014

work page 2014

[10] [10]

Sisfall: A fall and movement dataset,

A. Sucerquia, J. D. L ´opez, and J. F. Vargas-Bonilla, “Sisfall: A fall and movement dataset,”Sensors, vol. 17, no. 1, p. 198, 2017

work page 2017

[11] [11]

Casas: A smart home in a box,

D. J. Cook, A. S. Crandall, B. L. Thomas, and N. C. Krishnan, “Casas: A smart home in a box,”Computer, vol. 46, no. 7, pp. 62–69, 2013

work page 2013

[12] [12]

Mimic-iii, a freely accessible critical care database,

A. E. Johnson, T. J. Pollard, L. Shen, L.-w. H. Lehman, M. Feng, M. Ghassemi, B. Moody, P. Szolovits, L. A. Celi, and R. G. Mark, “Mimic-iii, a freely accessible critical care database,”Scientific Data, vol. 3, no. 1, pp. 1–9, 2016

work page 2016