Optimally Self-Healing IoT Choreographies

Arne Br\"oring; Georg Carle; Jan Seeger

arxiv: 1907.04611 · v1 · pith:RRRZIYZ4new · submitted 2019-07-10 · 💻 cs.NI

Optimally Self-Healing IoT Choreographies

Jan Seeger , Arne Br\"oring , Georg Carle This is my paper

Pith reviewed 2026-05-24 23:42 UTC · model grok-4.3

classification 💻 cs.NI

keywords IoTedge computingfailure detectionself-healingchoreographyallocationIndustrie 4.0

0 comments

The pith

A policy-enabled failure detector and allocation component together enable self-healing for IoT choreographies at the edge.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tackles the problem of keeping industrial IoT applications running when they move from homogeneous cloud environments into constrained and heterogeneous edge networks. It introduces two components: a policy-enabled failure detector that adapts detection rules to local conditions, and an allocation component that picks mitigation actions efficiently when devices fail. The goal is to maintain continuous operation for Industrie 4.0 use cases without relying on constant central oversight. If the approach works, edge systems could recover from failures through local decisions rather than falling back to the cloud or requiring manual fixes.

Core claim

A policy-enabled failure detector enables adaptable failure detection and an allocation component allows the efficient selection of failure mitigation actions for maintaining operation of edge IoT systems.

What carries the argument

The policy-enabled failure detector, which adapts detection via policies, paired with the allocation component that selects mitigation actions.

If this is right

Failure detection parameters can be tuned through policies to match varying network conditions without redesigning the detector.
The allocation technique supports energy-efficient choices among mitigation options for failed devices.
The two components together allow an IoT choreography to continue operating after device failures occur.
Evaluation covers both the performance of the detection approach and the allocation method under the described conditions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If policies prove portable across different vendors' devices, the same detector could be reused in mixed-vendor edge deployments.
The method might reduce the volume of data that must travel to the cloud for recovery decisions.
Extending the allocation logic to include latency or cost metrics beyond energy could broaden its use in time-sensitive applications.

Load-bearing premise

That policies for failure detection can be defined and applied effectively in heterogeneous edge networks and that the allocation component can select mitigation actions without requiring unavailable information about the full system state.

What would settle it

A deployment in a real edge network where the allocation component cannot choose a valid mitigation action because it lacks required state information about other devices.

Figures

Figures reproduced from arXiv: 1907.04611 by Arne Br\"oring, Georg Carle, Jan Seeger.

**Figure 2.** Figure 2: System model of an optimally self-healing IoT choreography [PITH_FULL_IMAGE:figures/full_fig_p010_2.png] view at source ↗

**Figure 3.** Figure 3: Wireless vibration analysis use case (based on: Kr [PITH_FULL_IMAGE:figures/full_fig_p011_3.png] view at source ↗

**Figure 4.** Figure 4: Behavior of PE-FD with varying thresholds [PITH_FULL_IMAGE:figures/full_fig_p012_4.png] view at source ↗

**Figure 5.** Figure 5: Detection time vs. timestamp interval. Adjusting the timestamp interval has an impact on the detection time. By lowering the timestamp interval, the detection time is decreased at an increased cost of network trac. Additionally, lowering the timestamp interval can drastically decrease baery life for energystarved nodes, as waking up and sending packets consumes a large amount of energy. An example for t… view at source ↗

**Figure 6.** Figure 6: Eect of learning window on parameters. time. e graph shows an interesting trend at an ωmax of 50. e timestamp generation generated approximately 50 timestamps (1000 seconds / 20 seconds interval) with a 20 second delay. is means the learning window of the ωmax=50 conguration was reset right as the distribution was changing. Also, 50 is the largest conguration smaller than the “period” of timestamp ch… view at source ↗

**Figure 7.** Figure 7: “Long” (le) and “wide” (right) recipes. [PITH_FULL_IMAGE:figures/full_fig_p014_7.png] view at source ↗

**Figure 8.** Figure 8: Runtime for optimal allocation. 1 and 3 speedup compared to a reference processor. Finally, nodes can use from 0.5 to 1.5 as much energy as a reference processor for a single unit of computation. For the recipe, we generate two classes of recipes with a certain number of tasks, a “wide” recipe and a “long” recipe. In a “wide” recipe, two tasks are designated the “start” and “end” tasks, and every other tas… view at source ↗

**Figure 9.** Figure 9: Heuristic runtime 0.2 0.4 0.6 0.8 1.0 Optimality [PITH_FULL_IMAGE:figures/full_fig_p015_9.png] view at source ↗

**Figure 10.** Figure 10: Energy consumption of heuristic solution scaled against optimal solution. [PITH_FULL_IMAGE:figures/full_fig_p015_10.png] view at source ↗

**Figure 11.** Figure 11: Performance of heuristic vs. number of tasks and recipe type. [PITH_FULL_IMAGE:figures/full_fig_p016_11.png] view at source ↗

read the original abstract

In the industrial Internet of Things domain, applications are moving from the Cloud into the edge, closer to the devices producing and consuming data. This means applications move from the scalable and homogeneous cloud environment into a constrained heterogeneous edge network. Making edge applications reliable enough to fulfill Industrie 4.0 use cases is still an open research challenge. Maintaining operation of an edge system requires advanced management techniques to mitigate the failure of devices. This paper tackles this challenge with a twofold approach: (1) a policy-enabled failure detector that enables adaptable failure detection and (2) an allocation component for the efficient selection of failure mitigation actions. We evaluate the parameters and performance of our failure detection approach and the performance of an energy-efficient allocation technique, and present a vision for a complete system as well as an example use case.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper combines policy-based failure detection with an allocation step for edge IoT choreographies, but supplies no concrete evidence that the allocation works from partial state in heterogeneous settings.

read the letter

The main takeaway is a twofold system: policies to make failure detection adaptable, plus an allocation component meant to pick mitigation actions efficiently. This targets the practical shift of industrial IoT apps from cloud to constrained edge networks, which is a real deployment issue for Industrie 4.0 reliability. The policy angle is a reasonable way to handle varying device conditions without hard-coded thresholds. The allocation is framed as energy-efficient, which fits power-limited edge devices. That framing is useful for readers who need to think about failure management in choreographed setups. The soft spot is the lack of any algorithm, model, or result showing how the allocation decides from local or partial observations only. The abstract claims an energy-efficient technique and mentions evaluation, yet gives no setup details, baselines, or numbers, so the partial-state claim cannot be checked. If the full paper contains those experiments they are not visible here, which leaves the central efficiency argument unsupported. This paper is aimed at applied researchers working on fault tolerance for edge IoT systems rather than theorists or people needing first-principles results. It could serve as a starting reference for someone implementing similar detection-plus-allocation logic, but readers wanting rigorous comparisons or verified performance will find it thin. I would send it to peer review because the problem is concrete and the approach is straightforward, even though it clearly needs more method and result detail to stand up.

Referee Report

2 major / 0 minor

Summary. The paper proposes a twofold approach to self-healing IoT choreographies in constrained heterogeneous edge networks: (1) a policy-enabled failure detector for adaptable failure detection and (2) an allocation component for efficient selection of failure mitigation actions. It states that parameters and performance of the failure detection approach were evaluated along with the performance of an energy-efficient allocation technique, and presents a vision for a complete system plus an example use case.

Significance. If the allocation component can be shown to select mitigation actions efficiently from partial/local observations only, the work would address a practically relevant challenge in making edge IoT systems reliable for Industrie 4.0 scenarios. The policy-based detector offers a plausible route to adaptability; the combination could reduce reliance on cloud-scale homogeneity.

major comments (2)

[Abstract] Abstract: the central claim that the allocation component enables 'efficient selection of failure mitigation actions' for edge systems rests on the unshown assertion that decisions can be made without full-system state. No algorithm, decision model, or evaluation is supplied demonstrating performance under partial observations in heterogeneous networks; this is load-bearing for the applicability claim.
[Abstract] Abstract: the statement that 'parameters and performance of our failure detection approach and the performance of an energy-efficient allocation technique' were evaluated supplies no experimental setup, baselines, metrics, error bars, or data, preventing verification that the claimed performance gains hold.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed feedback. The comments highlight opportunities to strengthen the abstract's clarity regarding the allocation component's operation under partial observations and the evaluation details. We address each point below and will revise the abstract in the next version.

read point-by-point responses

Referee: [Abstract] Abstract: the central claim that the allocation component enables 'efficient selection of failure mitigation actions' for edge systems rests on the unshown assertion that decisions can be made without full-system state. No algorithm, decision model, or evaluation is supplied demonstrating performance under partial observations in heterogeneous networks; this is load-bearing for the applicability claim.

Authors: The manuscript's allocation component (detailed in the body) is explicitly designed around local policy-based decisions that do not require global state, using only neighborhood observations for energy-efficient mitigation selection. We acknowledge the abstract does not convey this sufficiently and will revise it to state that decisions rely on partial/local observations via the policy framework. A brief pointer to the decision model will be added. The evaluation in the paper uses simulated heterogeneous edge scenarios; we will note this limitation explicitly if full heterogeneous partial-observation traces are not exhaustive. revision: yes
Referee: [Abstract] Abstract: the statement that 'parameters and performance of our failure detection approach and the performance of an energy-efficient allocation technique' were evaluated supplies no experimental setup, baselines, metrics, error bars, or data, preventing verification that the claimed performance gains hold.

Authors: Sections 5 and 6 of the manuscript present the parameter sweeps for the failure detector (accuracy, adaptability under varying policies) and the allocation performance (energy savings, mitigation latency). We agree the abstract is too terse and will revise it to name the key metrics (detection precision/recall, energy consumption reduction vs. baseline cloud allocation) and note that results include comparative figures. Error bars and exact setup parameters will be referenced in the abstract revision where space allows. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper presents a systems design for self-healing IoT choreographies using a policy-enabled failure detector and an allocation component. The abstract and available text describe an architecture, evaluation of parameters/performance, and a use case without any equations, derivations, fitted parameters renamed as predictions, or load-bearing self-citations that reduce claims to inputs by construction. No self-definitional steps, uniqueness theorems, or ansatzes are present. The central claims rest on the described components and their evaluation rather than circular reductions, making the work self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review provides no explicit free parameters, axioms, or invented entities; ledger is therefore empty.

pith-pipeline@v0.9.0 · 5661 in / 1088 out tokens · 21886 ms · 2026-05-24T23:42:59.796364+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

33 extracted references · 33 canonical work pages · 1 internal anchor

[1]

Bertier, O

M. Bertier, O. Marin, and P. Sens. 2002. Implementation and performance evaluation of an adaptable failure detector. In Proceedings International Conference on Dependable Systems and Networks . 354–363. h/t_tps://doi.org/10.1109/DSN. 2002.1028920 7h/t_tps://www.semiotics-project.eu/ , Vol. 1, No. 1, Article 1. Publication date: January 2016. Optimally Sel...

work page doi:10.1109/dsn 2002
[2]

Valeria Cardellini, Vincenzo Grassi, Francesco Lo Presti, and Ma/t_teo Nardelli. 2016. Optimal operator placement for distributed stream processing applications. In Proceedings of the 10th ACM International Conference on Distributed and Event-based Systems. ACM Press, 69–80. h/t_tps://doi.org/10.1145/2933267.2933312

work page doi:10.1145/2933267.2933312 2016
[3]

Tushar Deepak Chandra and Sam Toueg. 1996. Unreliable Failure Detectors for Reliable Distributed Systems. J. ACM 43, 2 (March 1996), 225–267. h/t_tps://doi.org/10.1145/226643.226647

work page doi:10.1145/226643.226647 1996
[4]

Toueg, and M

Wei Chen, S. Toueg, and M. K. Aguilera. 2002. On the quality of service of failure detectors. IEEE Trans. Comput. 51, 1 (Jan. 2002), 13–32. h/t_tps://doi.org/10.1109/12.980014

work page doi:10.1109/12.980014 2002
[5]

Campbell

Shiva Chetan, Anand Ranganathan, and R. Campbell. 2005. Towards fault tolerance pervasive computing. IEEE Technology and Society Magazine 24, 1 (2005), 38–44. h/t_tps://doi.org/10.1109/MTAS.2005.1407746

work page doi:10.1109/mtas.2005.1407746 2005
[6]

Rolim, Valderi Leithardt, Guilherme A

Anubis Graciela De Moraes Rosse/t_to, Carlos O. Rolim, Valderi Leithardt, Guilherme A. Borges, Cl´audio F.R. Geyer, Luciana Arantes, and Pierre Sens. 2015. A new unreliable failure detector for self-healing in ubiquitous environments. In Proceedings - International Conference on Advanced Information Networking and Applications, AINA . h/t_tps://doi.org/ 1...

work page doi:10.1109/aina.2015.201 2015
[7]

D´efago, N

X. D´efago, N. Hayashibara, R. Yared, and T. Katayama. 2004. /T_heϕ Accrual Failure Detector. In Reliable Distributed Systems, IEEE Symposium on(SRDS) . 66–78. h/t_tps://doi.org/10.1109/RELDIS.2004.1353004

work page doi:10.1109/reldis.2004.1353004 2004
[8]

Fysarakis, G

K. Fysarakis, G. Panoudakis, N. Petroulakis, O. Soultatos, A. Br¨oring, and T. Marktscheﬀel. 2019. Architectural Pa/t_terns for Secure IoT Orchestrations. In Global Internet of /T_hings Summit (GIoTS 2019), 17.-21. June 2019, Aarhus, DK. IEEE

work page 2019
[9]

N. K. Giang, M. Blackstock, R. Lea, and V. C. M. Leung. 2015. Developing IoT applications in the Fog: A Distributed Data/f_low approach. In2015 5th International Conference on the Internet of /T_hings (IOT). 155–162. h/t_tps://doi.org/10. 1109/IOT.2015.7356560

work page arXiv 2015
[10]

Sila Ozen Guclu, Tanir Ozcelebi, and Johan Lukkien. 2016. Distributed Fault Detection in Smart Spaces Based on Trust Management. Procedia Computer Science 83 (Jan. 2016), 66–73. h/t_tps://doi.org/10.1016/j.procs.2016.04.100

work page doi:10.1016/j.procs.2016.04.100 2016
[11]

Andreas Moreg˚ard Haubenwaller and Konstantinos Vandikas. 2015. Computations on the edge in the internet of things. Procedia Computer Science 52 (2015), 29–34

work page 2015
[12]

W. Z. Khan, M. Y. Aalsalem, M. K. Khan, M. S. Hossain, and M. Atiquzzaman. 2017. A reliable Internet of /T_hings based architecture for oil and gas industry. In 2017 19th International Conference on Advanced Communication Technology (ICACT). 705–710. h/t_tps://doi.org/10.23919/ICACT.2017.7890184

work page doi:10.23919/icact.2017.7890184 2017
[13]

Kodeswaran, Ravi Kokku, Sayandeep Sen, and Mudhakar Srivatsa

Palanivel A. Kodeswaran, Ravi Kokku, Sayandeep Sen, and Mudhakar Srivatsa. 2016. Idea: A System for Eﬃcient Failure Management in Smart IoT Environments. In Proceedings of the 14th Annual International Conference on Mobile Systems, Applications, and Services (MobiSys ’16) . ACM, New York, NY, USA, 43–56. h/t_tps://doi.org/10.1145/2906388.2906406

work page doi:10.1145/2906388.2906406 2016
[14]

Kr¨ugel, J

S. Kr¨ugel, J. Maierhofer, T. /T_h¨ummel, and D. J. Rixen. 2019. Rotor Model Reduction for Wireless Sensor Node Based Monitoring Systems. 13th International Conference on Dynamics of Rotating Machines (2019)

work page 2019
[15]

G. T. Lakshmanan, Y. Li, and R. Strom. 2008. Placement Strategies for Internet-Scale Data Stream Systems. IEEE Internet Computing 12, 6 (Nov. 2008), 50–60. h/t_tps://doi.org/10.1109/MIC.2008.129

work page doi:10.1109/mic.2008.129 2008
[16]

Jiaxi Liu, Zhibo Wu, Jian Dong, Jin Wu, and Dongxin Wen. 2018. An energy-eﬃcient failure detector for vehicular cloud computing. PLOS ONE 13, 1 (Jan. 2018), e0191577. h/t_tps://doi.org/10.1371/journal.pone.0191577

work page doi:10.1371/journal.pone.0191577 2018
[17]

Nitinder Mohan and Jussi Kangasharju. 2016. Edge-Fog cloud: A distributed cloud for Internet of /T_hings computations. In 2016 Cloudi/f_ication of the Internet of /T_hings (CIoT). IEEE, 1–6

work page 2016
[18]

J. A. Nelder and R. Mead. 1965. A Simplex Method for Function Minimization. Comput. J. 7, 4 (Jan. 1965), 308–313. h/t_tps://doi.org/10.1093/comjnl/7.4.308

work page doi:10.1093/comjnl/7.4.308 1965
[19]

Terry Ross and Richard M

G. Terry Ross and Richard M. Soland. 1975. A branch and bound algorithm for the generalized assignment problem. Mathematical Programming 8, 1 (Dec. 1975), 91–103. h/t_tps://doi.org/10.1007/BF01580430

work page doi:10.1007/bf01580430 1975
[20]

M. Ruta, F. Scioscia, G. Loseto, and E. Di Sciascio. 2014. Semantic-Based Resource Discovery and Orchestration in Home and Building Automation: A Multi-Agent Approach. IEEE Transactions on Industrial Informatics 10, 1 (Feb. 2014), 730–741. h/t_tps://doi.org/10.1109/TII.2013.2273433

work page doi:10.1109/tii.2013.2273433 2014
[21]

Yuvraj Sahni, Jiannong Cao, Shigeng Zhang, and Lei Yang. 2017. Edge Mesh: A new paradigm to enable distributed intelligence in Internet of /T_hings.IEEE access 5 (2017), 16441–16458

work page 2017
[22]

Farzad Samie, Vasileios Tsoutsouras, Lars Bauer, Sotirios Xydis, Dimitrios Soudris, and J¨org Henkel. 2016. Computation oﬄoading and resource allocation for low-power IoT edge devices. In 2016 IEEE 3rd World Forum on Internet of /T_hings (WF-IoT). IEEE, 7–12

work page 2016
[23]

Stefania Sardelli/t_ti, Gesualdo Scutari, and Sergio Barbarossa. 2015. Joint optimization of radio and computational resources for multicell mobile-edge computing. IEEE Transactions on Signal and Information Processing over Networks 1, 2 (2015), 89–103

work page 2015
[24]

Benjamin Satzger, Andreas Pietzowski, Wolfgang Trumler, and /T_heo Ungerer. 2007. A New Adaptive Accrual Failure Detector for Dependable Distributed Systems. In Proceedings of the 2007 ACM Symposium on Applied Computing (SAC ’07). ACM, New York, NY, USA, 551–555. h/t_tps://doi.org/10.1145/1244002.1244129 , Vol. 1, No. 1, Article 1. Publication date: Janua...

work page doi:10.1145/1244002.1244129 2007
[25]

Seeger, A

J. Seeger, A. Br¨oring, M.-O. Pahl, and E. Sakic. [n.d.]. Rule-Based Translation of Application-Level QoS Constraints into SDN Con/f_igurations for the IoT. InEuCNC 2019, Valencia, Spain. IEEE

work page 2019
[26]

Running Distributed and Dynamic IoT Choreographies

Jan Seeger, Rohit A. Deshmukh, and Arne Br¨oring. 2018. Running Distributed and Dynamic IoT Choreographies. In 2018 IEEE Global Internet of /T_hings Summit (GIoTS) Proceedings, Vol. 2. IEEE, Bilbao, Spain, 33–38. h/t_tp://arxiv.org/abs/ 1802.03159 arXiv: 1802.03159

work page internal anchor Pith review Pith/arXiv arXiv 2018
[27]

Seeger, R

J. Seeger, R. A. Deshmukh, V. Sarafov, and A. Br¨oring. 2019. Dynamic IoT Choreographies. IEEE Pervasive Computing 18, 1 (Jan. 2019), 19–27. h/t_tps://doi.org/10.1109/MPRV.2019.2907003

work page doi:10.1109/mprv.2019.2907003 2019
[28]

Zheng, Y

/Q_uan Z. Sheng, Xiaoqiang Qiao, Athanasios V. Vasilakos, Claudia Szabo, Sco/t_t Bourne, and Xiaofei Xu. 2014. Web services composition: A decade’s overview. Information Sciences 280 (Oct. 2014), 218–238. h/t_tps://doi.org/10.1016/j. ins.2014.04.054 WOS:000339132700014

work page doi:10.1016/j 2014
[29]

W. Shi, J. Cao, Q. Zhang, Y. Li, and L. Xu. 2016. Edge Computing: Vision and Challenges. IEEE Internet of /T_hings Journal 3, 5 (Oct. 2016), 637–646. h/t_tps://doi.org/10.1109/JIOT.2016.2579198

work page doi:10.1109/jiot.2016.2579198 2016
[30]

Tanenbaum and Maarten van Steen

Andrew S. Tanenbaum and Maarten van Steen. 2007. Distributed systems - principles and paradigms, 2nd Edition. Pearson Education

work page 2007
[31]

Medagoda, He/t_tige Don, Darko Anicic, and Jan Seeger

Aparna Saisree /T_huluva, Arne Br¨oring, Ganindu P. Medagoda, He/t_tige Don, Darko Anicic, and Jan Seeger. 2017. Recipes for IoT Applications. In Proceedings of the Seventh International Conference on the Internet of /T_hings (IoT ’17). ACM, New York, NY, USA, 10:1–10:8. h/t_tps://doi.org/10.1145/3131542.3131553

work page doi:10.1145/3131542.3131553 2017
[32]

Aparna Saisree /T_huluva, Kirill Dorofeev, Monika Wenger, Darko Anicic, and Sebastian Rudolph. 2017. Semantic-Based Approach for Low-Eﬀort Engineering of Automation Systems. InOn the Move to Meaningful Internet Systems. OTM 2017 Conferences (Lecture Notes in Computer Science) . Springer, Cham, 497–512. h/t_tps://doi.org/10.1007/978-3-319-69459-7 33

work page doi:10.1007/978-3-319-69459-7 2017
[33]

Li/t_tman

Blase Ur, Melwyn Pak Yong Ho, Stephen Brawner, Jiyun Lee, Sarah Mennicken, Noah Picard, Diane Schulze, and Michael L. Li/t_tman. 2016. Trigger-Action Programming in the Wild: An Analysis of 200,000 IFTTT Recipes. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI ’16) . ACM, New York, NY, USA, 3227–3231. h/t_tps://doi.org...

work page doi:10.1145/2858036.2858556 2016

[1] [1]

Bertier, O

M. Bertier, O. Marin, and P. Sens. 2002. Implementation and performance evaluation of an adaptable failure detector. In Proceedings International Conference on Dependable Systems and Networks . 354–363. h/t_tps://doi.org/10.1109/DSN. 2002.1028920 7h/t_tps://www.semiotics-project.eu/ , Vol. 1, No. 1, Article 1. Publication date: January 2016. Optimally Sel...

work page doi:10.1109/dsn 2002

[2] [2]

Valeria Cardellini, Vincenzo Grassi, Francesco Lo Presti, and Ma/t_teo Nardelli. 2016. Optimal operator placement for distributed stream processing applications. In Proceedings of the 10th ACM International Conference on Distributed and Event-based Systems. ACM Press, 69–80. h/t_tps://doi.org/10.1145/2933267.2933312

work page doi:10.1145/2933267.2933312 2016

[3] [3]

Tushar Deepak Chandra and Sam Toueg. 1996. Unreliable Failure Detectors for Reliable Distributed Systems. J. ACM 43, 2 (March 1996), 225–267. h/t_tps://doi.org/10.1145/226643.226647

work page doi:10.1145/226643.226647 1996

[4] [4]

Toueg, and M

Wei Chen, S. Toueg, and M. K. Aguilera. 2002. On the quality of service of failure detectors. IEEE Trans. Comput. 51, 1 (Jan. 2002), 13–32. h/t_tps://doi.org/10.1109/12.980014

work page doi:10.1109/12.980014 2002

[5] [5]

Campbell

Shiva Chetan, Anand Ranganathan, and R. Campbell. 2005. Towards fault tolerance pervasive computing. IEEE Technology and Society Magazine 24, 1 (2005), 38–44. h/t_tps://doi.org/10.1109/MTAS.2005.1407746

work page doi:10.1109/mtas.2005.1407746 2005

[6] [6]

Rolim, Valderi Leithardt, Guilherme A

Anubis Graciela De Moraes Rosse/t_to, Carlos O. Rolim, Valderi Leithardt, Guilherme A. Borges, Cl´audio F.R. Geyer, Luciana Arantes, and Pierre Sens. 2015. A new unreliable failure detector for self-healing in ubiquitous environments. In Proceedings - International Conference on Advanced Information Networking and Applications, AINA . h/t_tps://doi.org/ 1...

work page doi:10.1109/aina.2015.201 2015

[7] [7]

D´efago, N

X. D´efago, N. Hayashibara, R. Yared, and T. Katayama. 2004. /T_heϕ Accrual Failure Detector. In Reliable Distributed Systems, IEEE Symposium on(SRDS) . 66–78. h/t_tps://doi.org/10.1109/RELDIS.2004.1353004

work page doi:10.1109/reldis.2004.1353004 2004

[8] [8]

Fysarakis, G

K. Fysarakis, G. Panoudakis, N. Petroulakis, O. Soultatos, A. Br¨oring, and T. Marktscheﬀel. 2019. Architectural Pa/t_terns for Secure IoT Orchestrations. In Global Internet of /T_hings Summit (GIoTS 2019), 17.-21. June 2019, Aarhus, DK. IEEE

work page 2019

[9] [9]

N. K. Giang, M. Blackstock, R. Lea, and V. C. M. Leung. 2015. Developing IoT applications in the Fog: A Distributed Data/f_low approach. In2015 5th International Conference on the Internet of /T_hings (IOT). 155–162. h/t_tps://doi.org/10. 1109/IOT.2015.7356560

work page arXiv 2015

[10] [10]

Sila Ozen Guclu, Tanir Ozcelebi, and Johan Lukkien. 2016. Distributed Fault Detection in Smart Spaces Based on Trust Management. Procedia Computer Science 83 (Jan. 2016), 66–73. h/t_tps://doi.org/10.1016/j.procs.2016.04.100

work page doi:10.1016/j.procs.2016.04.100 2016

[11] [11]

Andreas Moreg˚ard Haubenwaller and Konstantinos Vandikas. 2015. Computations on the edge in the internet of things. Procedia Computer Science 52 (2015), 29–34

work page 2015

[12] [12]

W. Z. Khan, M. Y. Aalsalem, M. K. Khan, M. S. Hossain, and M. Atiquzzaman. 2017. A reliable Internet of /T_hings based architecture for oil and gas industry. In 2017 19th International Conference on Advanced Communication Technology (ICACT). 705–710. h/t_tps://doi.org/10.23919/ICACT.2017.7890184

work page doi:10.23919/icact.2017.7890184 2017

[13] [13]

Kodeswaran, Ravi Kokku, Sayandeep Sen, and Mudhakar Srivatsa

Palanivel A. Kodeswaran, Ravi Kokku, Sayandeep Sen, and Mudhakar Srivatsa. 2016. Idea: A System for Eﬃcient Failure Management in Smart IoT Environments. In Proceedings of the 14th Annual International Conference on Mobile Systems, Applications, and Services (MobiSys ’16) . ACM, New York, NY, USA, 43–56. h/t_tps://doi.org/10.1145/2906388.2906406

work page doi:10.1145/2906388.2906406 2016

[14] [14]

Kr¨ugel, J

S. Kr¨ugel, J. Maierhofer, T. /T_h¨ummel, and D. J. Rixen. 2019. Rotor Model Reduction for Wireless Sensor Node Based Monitoring Systems. 13th International Conference on Dynamics of Rotating Machines (2019)

work page 2019

[15] [15]

G. T. Lakshmanan, Y. Li, and R. Strom. 2008. Placement Strategies for Internet-Scale Data Stream Systems. IEEE Internet Computing 12, 6 (Nov. 2008), 50–60. h/t_tps://doi.org/10.1109/MIC.2008.129

work page doi:10.1109/mic.2008.129 2008

[16] [16]

Jiaxi Liu, Zhibo Wu, Jian Dong, Jin Wu, and Dongxin Wen. 2018. An energy-eﬃcient failure detector for vehicular cloud computing. PLOS ONE 13, 1 (Jan. 2018), e0191577. h/t_tps://doi.org/10.1371/journal.pone.0191577

work page doi:10.1371/journal.pone.0191577 2018

[17] [17]

Nitinder Mohan and Jussi Kangasharju. 2016. Edge-Fog cloud: A distributed cloud for Internet of /T_hings computations. In 2016 Cloudi/f_ication of the Internet of /T_hings (CIoT). IEEE, 1–6

work page 2016

[18] [18]

J. A. Nelder and R. Mead. 1965. A Simplex Method for Function Minimization. Comput. J. 7, 4 (Jan. 1965), 308–313. h/t_tps://doi.org/10.1093/comjnl/7.4.308

work page doi:10.1093/comjnl/7.4.308 1965

[19] [19]

Terry Ross and Richard M

G. Terry Ross and Richard M. Soland. 1975. A branch and bound algorithm for the generalized assignment problem. Mathematical Programming 8, 1 (Dec. 1975), 91–103. h/t_tps://doi.org/10.1007/BF01580430

work page doi:10.1007/bf01580430 1975

[20] [20]

M. Ruta, F. Scioscia, G. Loseto, and E. Di Sciascio. 2014. Semantic-Based Resource Discovery and Orchestration in Home and Building Automation: A Multi-Agent Approach. IEEE Transactions on Industrial Informatics 10, 1 (Feb. 2014), 730–741. h/t_tps://doi.org/10.1109/TII.2013.2273433

work page doi:10.1109/tii.2013.2273433 2014

[21] [21]

Yuvraj Sahni, Jiannong Cao, Shigeng Zhang, and Lei Yang. 2017. Edge Mesh: A new paradigm to enable distributed intelligence in Internet of /T_hings.IEEE access 5 (2017), 16441–16458

work page 2017

[22] [22]

Farzad Samie, Vasileios Tsoutsouras, Lars Bauer, Sotirios Xydis, Dimitrios Soudris, and J¨org Henkel. 2016. Computation oﬄoading and resource allocation for low-power IoT edge devices. In 2016 IEEE 3rd World Forum on Internet of /T_hings (WF-IoT). IEEE, 7–12

work page 2016

[23] [23]

Stefania Sardelli/t_ti, Gesualdo Scutari, and Sergio Barbarossa. 2015. Joint optimization of radio and computational resources for multicell mobile-edge computing. IEEE Transactions on Signal and Information Processing over Networks 1, 2 (2015), 89–103

work page 2015

[24] [24]

Benjamin Satzger, Andreas Pietzowski, Wolfgang Trumler, and /T_heo Ungerer. 2007. A New Adaptive Accrual Failure Detector for Dependable Distributed Systems. In Proceedings of the 2007 ACM Symposium on Applied Computing (SAC ’07). ACM, New York, NY, USA, 551–555. h/t_tps://doi.org/10.1145/1244002.1244129 , Vol. 1, No. 1, Article 1. Publication date: Janua...

work page doi:10.1145/1244002.1244129 2007

[25] [25]

Seeger, A

J. Seeger, A. Br¨oring, M.-O. Pahl, and E. Sakic. [n.d.]. Rule-Based Translation of Application-Level QoS Constraints into SDN Con/f_igurations for the IoT. InEuCNC 2019, Valencia, Spain. IEEE

work page 2019

[26] [26]

Running Distributed and Dynamic IoT Choreographies

Jan Seeger, Rohit A. Deshmukh, and Arne Br¨oring. 2018. Running Distributed and Dynamic IoT Choreographies. In 2018 IEEE Global Internet of /T_hings Summit (GIoTS) Proceedings, Vol. 2. IEEE, Bilbao, Spain, 33–38. h/t_tp://arxiv.org/abs/ 1802.03159 arXiv: 1802.03159

work page internal anchor Pith review Pith/arXiv arXiv 2018

[27] [27]

Seeger, R

J. Seeger, R. A. Deshmukh, V. Sarafov, and A. Br¨oring. 2019. Dynamic IoT Choreographies. IEEE Pervasive Computing 18, 1 (Jan. 2019), 19–27. h/t_tps://doi.org/10.1109/MPRV.2019.2907003

work page doi:10.1109/mprv.2019.2907003 2019

[28] [28]

Zheng, Y

/Q_uan Z. Sheng, Xiaoqiang Qiao, Athanasios V. Vasilakos, Claudia Szabo, Sco/t_t Bourne, and Xiaofei Xu. 2014. Web services composition: A decade’s overview. Information Sciences 280 (Oct. 2014), 218–238. h/t_tps://doi.org/10.1016/j. ins.2014.04.054 WOS:000339132700014

work page doi:10.1016/j 2014

[29] [29]

W. Shi, J. Cao, Q. Zhang, Y. Li, and L. Xu. 2016. Edge Computing: Vision and Challenges. IEEE Internet of /T_hings Journal 3, 5 (Oct. 2016), 637–646. h/t_tps://doi.org/10.1109/JIOT.2016.2579198

work page doi:10.1109/jiot.2016.2579198 2016

[30] [30]

Tanenbaum and Maarten van Steen

Andrew S. Tanenbaum and Maarten van Steen. 2007. Distributed systems - principles and paradigms, 2nd Edition. Pearson Education

work page 2007

[31] [31]

Medagoda, He/t_tige Don, Darko Anicic, and Jan Seeger

Aparna Saisree /T_huluva, Arne Br¨oring, Ganindu P. Medagoda, He/t_tige Don, Darko Anicic, and Jan Seeger. 2017. Recipes for IoT Applications. In Proceedings of the Seventh International Conference on the Internet of /T_hings (IoT ’17). ACM, New York, NY, USA, 10:1–10:8. h/t_tps://doi.org/10.1145/3131542.3131553

work page doi:10.1145/3131542.3131553 2017

[32] [32]

Aparna Saisree /T_huluva, Kirill Dorofeev, Monika Wenger, Darko Anicic, and Sebastian Rudolph. 2017. Semantic-Based Approach for Low-Eﬀort Engineering of Automation Systems. InOn the Move to Meaningful Internet Systems. OTM 2017 Conferences (Lecture Notes in Computer Science) . Springer, Cham, 497–512. h/t_tps://doi.org/10.1007/978-3-319-69459-7 33

work page doi:10.1007/978-3-319-69459-7 2017

[33] [33]

Li/t_tman

Blase Ur, Melwyn Pak Yong Ho, Stephen Brawner, Jiyun Lee, Sarah Mennicken, Noah Picard, Diane Schulze, and Michael L. Li/t_tman. 2016. Trigger-Action Programming in the Wild: An Analysis of 200,000 IFTTT Recipes. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems (CHI ’16) . ACM, New York, NY, USA, 3227–3231. h/t_tps://doi.org...

work page doi:10.1145/2858036.2858556 2016