Rethinking Software Misconfigurations in the Real World: An Empirical Study and Literature Analysis

Hanfeng Zhang; Juncheng Hu; Sihan Xu; Wei Wang; Yan Jia; Yingnan Zhou; Yuhao Liu; Zheli Liu; Zhiwei Chang

arxiv: 2412.11121 · v3 · pith:BB54JY2Bnew · submitted 2024-12-15 · 💻 cs.SE

Rethinking Software Misconfigurations in the Real World: An Empirical Study and Literature Analysis

Yuhao Liu , Yingnan Zhou , Hanfeng Zhang , Zhiwei Chang , Sihan Xu , Yan Jia , Wei Wang , Juncheng Hu

show 1 more author

Zheli Liu

This is my paper

classification 💻 cs.SE

keywords softwaremisconfigurationsmisconfigurationliteraturereal-worldresearchtoolsdatasets

0 comments

read the original abstract

Software misconfiguration has consistently been a major reason for software failures. Over the past two decades, much work has been done to detect and diagnose software misconfigurations. However, there is still a gap between real-world misconfigurations and the literature. It is desirable to investigate whether existing taxonomy and tools are applicable for real-world misconfigurations in modern software. In this paper, we conduct an empirical study on 772 real-world misconfiguration issues, based on which we propose a novel classification of the root causes of software misconfigurations, i.e., constraint violation, resource unavailability, component integration error, and configuration semantic misinterpretation. Then, we systematically review the literature on misconfiguration troubleshooting to study the trends of research and the practicality of the tools and datasets in this field. We find that the research targets have changed from system and infrastructure software to advanced applications (e.g., cloud service). Meanwhile, research on non-crash misconfigurations has also grown significantly. Despite the progress, a majority of studies lack reproducibility due to the unavailable tools and evaluation datasets. In total, only eleven tools and four datasets are publicly available. We analyze the trends of existing literature on misconfiguration troubleshooting, summarize the challenges that users are faced with, and highlight the suggestions to mitigate and diagnose software misconfigurations. We release the real-world dataset of misconfiguration issues for follow-up research.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

AI Native Asset Intelligence
cs.CR 2026-05 unverdicted novelty 5.0

The paper presents a modeling-plus-scoring framework that turns fragmented security signals into stable asset-level importance scores by separating intrinsic exposure from business and data context, evaluated on 131k ...
AI Native Asset Intelligence
cs.CR 2026-05 unverdicted novelty 5.0

AI-native asset intelligence framework converts heterogeneous security signals into normalized asset importance scores by separating intrinsic exposure from contextual factors using modeling and deterministic aggregation.