Hidden Dependencies and Component Variants in SBOM-Based Software Composition Analysis
Pith reviewed 2026-05-09 21:24 UTC · model grok-4.3
The pith
SBOMs often miss hidden code dependencies and fail to tag component variants consistently, causing scanners to disagree on which vulnerabilities apply.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Hidden code-level dependencies that are not recorded as component-level relations, together with component variants that lack stable identity across tools, produce inconsistent vulnerability reports and inconsistent VEX handling in current SBOM-based scanners.
What carries the argument
Two mismatch patterns—hidden dependencies (actual code relations absent from the SBOM component graph) and component variants (clones whose identities scanners cannot reconcile)—that directly alter the input to vulnerability matching.
If this is right
- The same software product can receive different vulnerability assessments depending on which scanner consumes its SBOM.
- VEX statements attached to one SBOM may be ignored or interpreted differently by other tools.
- Current SBOM formats limit the reliability of software composition analysis for vulnerability management.
- Richer ways to express dependencies and component identities are required before SBOMs can deliver consistent results.
Where Pith is reading between the lines
- Organizations may need to run multiple independent scanners and reconcile their outputs rather than trust any single one.
- Component identity schemes that survive cloning and variant creation would reduce one source of mismatch.
- Automated checks that detect hidden dependencies from source or binary analysis could be added to SBOM generation pipelines.
Load-bearing premise
The observed mismatches in the examined cases are representative of wider SBOM production and consumption practice and are the direct cause of the differing scanner outputs.
What would settle it
A broad audit that examines many real SBOMs and scanners and finds no measurable difference in reported vulnerabilities or VEX outcomes even when hidden dependencies and variants are present would falsify the central claim.
Figures
read the original abstract
Software Bills of Material (SBOMs) have emerged as an important technology for vulnerability management amid rising supply-chain attacks. They represent component relationships within a software product and support software composition analysis (SCA) by linking components to known vulnerabilities. However, the effectiveness of SBOM-based analysis depends on how accurately SBOMs represent component identities and actual dependencies in software. This paper studies two mismatch patterns: hidden code-level dependencies that are not represented as component-level dependencies, and component variants (clones) that cannot be identified consistently by scanners. We show that these mismatches can lead to inconsistent vulnerability reporting and inconsistent handling of VEX statements across popular SBOM-based vulnerability scanners. These results highlight limitations in current SBOM production and consumption and motivate richer dependency representation and component identity.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper investigates two mismatch patterns in SBOMs for software composition analysis: hidden code-level dependencies not captured as component-level dependencies, and component variants (clones) that cannot be consistently identified by scanners. Through analysis of popular SBOM-based vulnerability scanners, it claims to demonstrate that these mismatches produce inconsistent vulnerability reports and inconsistent handling of VEX statements, thereby highlighting limitations in current SBOM production and consumption practices and motivating richer dependency representations and component identity mechanisms.
Significance. If the observed inconsistencies are shown to be systematic rather than isolated, the work would be significant for the SBOM and supply-chain security community by providing concrete evidence of how representational gaps affect downstream vulnerability management. The observational grounding in real scanner behavior is a positive feature, but the absence of quantification or controlled experiments limits the strength of the broader claims about systemic issues in SBOM ecosystems.
major comments (2)
- [Results and Discussion sections] The central claim that hidden dependencies and component variants lead to inconsistent vulnerability reporting and VEX handling rests on targeted case studies. Without a larger sample, frequency counts, or an ablation that isolates these mismatches from scanner-specific heuristics and CVE database differences, the causal link and generalizability to production SBOMs remain under-supported (see the results and discussion sections describing the scanner comparisons).
- [Introduction and Conclusion] The assumption that the studied mismatches are representative of broader SBOM production/consumption problems is load-bearing for the motivation to adopt richer representations, yet no data on the prevalence of such patterns across open-source or commercial SBOMs is provided to substantiate this.
minor comments (2)
- [Abstract] The abstract states the claims but provides no overview of the methodology, dataset size, or scanner selection criteria; adding a brief methods summary would improve readability.
- [Background] Notation for component variants and hidden dependencies could be formalized earlier (e.g., with a small diagram or table) to make the mismatch patterns clearer before the case studies.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on our manuscript. We address the two major comments point by point below, clarifying the exploratory scope of our case studies while making targeted revisions to better align the claims with the evidence presented.
read point-by-point responses
-
Referee: [Results and Discussion sections] The central claim that hidden dependencies and component variants lead to inconsistent vulnerability reporting and VEX handling rests on targeted case studies. Without a larger sample, frequency counts, or an ablation that isolates these mismatches from scanner-specific heuristics and CVE database differences, the causal link and generalizability to production SBOMs remain under-supported (see the results and discussion sections describing the scanner comparisons).
Authors: We appreciate this observation. The manuscript presents targeted case studies to demonstrate that the identified mismatch patterns can produce inconsistent outputs across scanners, as shown through concrete examples of vulnerability reporting and VEX handling. We do not assert systematic prevalence or isolate every confounding factor via ablation; the contribution is in surfacing these representational gaps through real scanner behavior. We have revised the Results and Discussion sections to explicitly describe the findings as illustrative, to acknowledge scanner-specific heuristics as a potential influence, and to recommend controlled experiments in future work. revision: partial
-
Referee: [Introduction and Conclusion] The assumption that the studied mismatches are representative of broader SBOM production/consumption problems is load-bearing for the motivation to adopt richer representations, yet no data on the prevalence of such patterns across open-source or commercial SBOMs is provided to substantiate this.
Authors: We acknowledge that the paper provides no prevalence statistics across a broad corpus of SBOMs. The motivation is grounded in the observation that these mismatches exist and affect downstream analysis in the studied cases, thereby indicating limitations in current SBOM practices. We have revised the Introduction and Conclusion to remove any implication of representativeness, to frame the work as identifying specific patterns that warrant richer representations, and to explicitly note the absence of prevalence data as an area for future study. revision: partial
Circularity Check
No circularity: purely observational empirical analysis
full rationale
The paper conducts a case-study examination of SBOM production/consumption mismatches (hidden dependencies and component variants) and their effects on scanner outputs and VEX handling. No equations, derivations, fitted parameters, or predictive models are present. Claims rest on direct inspection of real scanners and SBOMs rather than any self-referential construction or self-citation chain. The work is therefore self-contained with no reduction of results to their own inputs.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Empirical Software Engineering28(3), 59 (2023)
Alfadel, M., Costa, D.E., Shihab, E.: Empirical analysis of security vulnera- bilities in Python packages. Empirical Software Engineering28(3), 59 (2023). https://doi.org/10.1007/s10664-022-10278-4
-
[2]
ACM Transactions on Software Engineering and Methodology33(6), 1–25 (2024)
Bi, T., Xia, B., Xing, Z., Lu, Q., Zhu, L.: On the way to SBOMs: Investigating design issues and solutions in practice. ACM Transactions on Software Engineering and Methodology33(6), 1–25 (2024). https://doi.org/10.1145/3654442
-
[3]
Biden, J.R.: Executive order 14028: Improving the nation’s cybersecurity. Federal Register, Vol. 86, No. 93, pp. 26633–26647 (May 2021), signed 12 May 2021; pub- lished 17 May 2021. Available athttps://www.federalregister.gov/document s/2021/05/17/2021-10460/improving-the-nations-cybersecurity
work page 2021
-
[4]
ACM Transactions on Internet Technology21(1), 1–14 (2021)
Boldi, P., Gousios, G.: Fine-grained network analysis for modern software ecosystems. ACM Transactions on Internet Technology21(1), 1–14 (2021). https://doi.org/10.1145/3418209 Hidden Dependencies... 17
-
[5]
In: 36th European Conference on Object-Oriented Programming (ECOOP)
Chakraborty, M., Olivares, R., Sridharan, M., Hassanshahi, B.: Automatic root cause quantification for missing edges in JavaScript call graphs. In: 36th European Conference on Object-Oriented Programming (ECOOP). LIPIcs, vol. 222, pp. 3:1–3:28. Schloss Dagstuhl – Leibniz-Zentrum für Informatik (2022). https://doi.org/10.4230/LIPIcs.ECOOP.2022.3
-
[6]
arXiv preprint arXiv:2503.14388 (2025)
Churakova,Y.,Ekstedt,M.,Schmid,L.:Vexedbyvextools:Consistencyevaluation of container vulnerability scanners. arXiv preprint arXiv:2503.14388 (2025)
-
[7]
Communications of the ACM62(9), 36–43 (2019)
Cox, R.: Surviving software dependencies. Communications of the ACM62(9), 36–43 (2019)
work page 2019
-
[8]
Cybersecurity and Infrastructure Security Agency (CISA): Vulnerability ex- ploitability eXchange (VEX) – use cases. Technical report, U.S. Cybersecurity and Infrastructure Security Agency (Apr 2022),https://www.cisa.gov/sites/d efault/files/publications/VEX_Use_Cases_Document_508c.pdf
work page 2022
-
[9]
Cybersecurity and Infrastructure Security Agency (CISA): Minimum requirements for vulnerability exploitability eXchange (VEX). Technical report, U.S. Cyberse- curity and Infrastructure Security Agency (Apr 2023),https://www.cisa.gov/s ites/default/files/2023-04/minimum-requirements-for-vex-508c.pdf, vEX Working Group coordinated by CISA
work page 2023
-
[10]
Dann, A., Plate, H., Hermann, B., Ponta, S.E., Bodden, E.: Iden- tifying challenges for OSS vulnerability scanners – a study & test suite. IEEE Transactions on Software Engineering48(9), 3613–3625 (2022). https://doi.org/10.1109/TSE.2021.3101739
-
[11]
An evaluation of open-source software microbenchmark suites for continuous performance assessment,
Decan, A., Mens, T., Constantinou, E.: On the impact of security vulnerabilities in the npm package dependency network. In: Proceedings of the 15th International ConferenceonMiningSoftwareRepositories(MSR).pp.181–191.ACM,NewYork, NY, USA (2018). https://doi.org/10.1145/3196398.3196401
-
[12]
Dietrich, J., Rasheed, S., Jordan, A., White, T.: On the security blind spots of software composition analysis. In: Proceedings of the 2024 Workshop on Software Supply Chain Offensive Research and Ecosystem Defenses (SCORED). pp. 1–11. ACM, New York, NY, USA (2024). https://doi.org/10.1145/3689944.3696165
-
[13]
Dietrich, J., Rasheed, S., Jordan, A., White, T.: On the security blind spots of software composition analysis. In: Proceedings of the 2024 Workshop on Software Supply Chain Offensive Research and Ecosystem Defenses. pp. 77–87 (2024)
work page 2024
-
[14]
Ernst, M.D.: Static and dynamic analysis: Synergy and duality. In: Proceedings of the ACM SIGPLAN-SIGSOFT Workshop on Program Analysis for Software Tools and Engineering (PASTE). pp. 35–35. ACM, New York, NY, USA (2003), invited talk, also presented at WODA 2003
work page 2003
-
[15]
Grove, D., DeFouw, G., Dean, J., Chambers, C.: Call graph construction in object-oriented languages. In: Proceedings of the 12th ACM SIGPLAN Conference on Object-Oriented Programming, Systems, Languages, and Ap- plications (OOPSLA). pp. 108–124. ACM, New York, NY, USA (1997). https://doi.org/10.1145/263698.264352
-
[16]
Empirical Software Engineering27(5), 102 (2022)
Hejderup, J., Beller, M., Triantafyllou, K., Gousios, G.: Präzi: From package-based to call-based dependency networks. Empirical Software Engineering27(5), 102 (2022). https://doi.org/10.1007/s10664-021-10071-9
-
[17]
Hejderup, J., van Deursen, A., Gousios, G.: Software ecosystem call graph for dependency management. In: Proceedings of the 40th International Conference on Software Engineering: New Ideas and Emerging Results (ICSE-NIER). pp. 101–
-
[18]
https://doi.org/10.1145/3183399.3183417 18 Rasheed et al
ACM, New York, NY, USA (2018). https://doi.org/10.1145/3183399.3183417 18 Rasheed et al
-
[19]
Imtiaz, N., Thorn, S., Williams, L.: A comparative study of vulnerabil- ity reporting by software composition analysis tools. In: Proceedings of the 15th ACM/IEEE International Symposium on Empirical Software Engineer- ing and Measurement (ESEM). pp. 1–11. ACM, New York, NY, USA (2021). https://doi.org/10.1145/3475716.3475769
-
[20]
IEEE Transactions on Software Engineering48(10), 3887–3899 (2022)
Jafari, A.J., Costa, D.E., Abdalkareem, R., Shihab, E., Tsantalis, N.: Dependency smells in JavaScript projects. IEEE Transactions on Software Engineering48(10), 3887–3899 (2022). https://doi.org/10.1109/TSE.2021.3106247
-
[21]
Jayasuriya, D., Terragni, V., Dietrich, J., Ou, S., Blincoe, K.: Understanding break- ing changes in the wild. In: Proceedings of the 32nd ACM SIGSOFT International Symposium on Software Testing and Analysis (ISSTA). pp. 1433–1444. ACM, New York, NY, USA (2023). https://doi.org/10.1145/3597926.3598147
-
[22]
Kern, P.: How to ensure the software supply chain security of a product without disclosing it/author philipp kern, bsc (2025)
work page 2025
-
[23]
Frankenstein: fast and lightweight call graph generation for software builds,
Keshani, M., Mir, A.M., Gousios, G., Proksch, S.: Frankenstein: Fast and lightweight call graph generation for software builds. Empirical Software Engi- neering29(1), 10 (2024). https://doi.org/10.1007/s10664-023-10388-7
-
[24]
Empirical Software Engineering23(1), 384–417 (2018)
Kula, R.G., German, D.M., Ouni, A., Ishio, T., Inoue, K.: Do developers update their library dependencies? an empirical study on the impact of security advi- sories on library migration. Empirical Software Engineering23(1), 384–417 (2018). https://doi.org/10.1007/s10664-017-9521-5
-
[25]
Transplantfix: Graph differencing-based code transplantation for automated program repair,
Latendresse, J., Mujahid, S., Costa, D.E., Shihab, E.: Not all dependen- cies are equal: An empirical study on production dependencies in NPM. In: Proceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering (ASE). pp. 1–12. ACM, New York, NY, USA (2022). https://doi.org/10.1145/3551349.3556896
-
[26]
Lehmann, D., Thalakottur, M., Tip, F., Pradel, M.: That’s a tough call: Study- ing the challenges of call graph construction for WebAssembly. In: Proceed- ings of the 32nd ACM SIGSOFT International Symposium on Software Test- ing and Analysis (ISSTA). pp. 892–903. ACM, New York, NY, USA (2023). https://doi.org/10.1145/3597926.3598104
-
[27]
Lhoták, O.: Comparing call graphs. In: Proceedings of the 7th ACM SIGPLAN-SIGSOFT Workshop on Program Analysis for Software Tools and Engineering (PASTE). pp. 37–42. ACM, New York, NY, USA (2007). https://doi.org/10.1145/1251535.1251542
-
[28]
We Feel Like We’re Winging It:
Liu, C., Chen, S., Fan, L., Chen, B., Liu, Y., Peng, X.: Software composition analy- sis for vulnerability detection: An empirical study on java projects. In: Proceedings of the 31st ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering (ESEC/FSE). ACM, New York, NY, USA (2023). https://doi.org/10.1145...
-
[29]
Nelson Amaral, Bor-Yuh Evan Chang, Samuel Z
Livshits, B., Sridharan, M., Smaragdakis, Y., Lhoták, O., Amaral, J.N., Chang, B.Y.E., Guyer, S.Z., Khedker, U.P., Møller, A., Vardoulakis, D.: In defense of soundiness: A manifesto. Communications of the ACM58(2), 44–46 (2015). https://doi.org/10.1145/2644805
-
[30]
Maratos, A., Lubinsky, E., Methman, J., Surti, J., Peele, L.: Supply chain security: The role of sboms and vex in cyber risk mitigation (2025)
work page 2025
- [31]
-
[32]
In: 2023 IEEE International Hidden Dependencies
Mir, A.M., Keshani, M., Proksch, S.: On the effect of transitivity and granularity on vulnerability propagation in the Maven ecosystem. In: 2023 IEEE International Hidden Dependencies... 19 Conference on Software Analysis, Evolution and Reengineering (SANER). pp. 201–
work page 2023
-
[33]
Mir, Mehdi Keshani, and Sebastian Proksch
IEEE (2023). https://doi.org/10.1109/SANER56733.2023.00028
-
[34]
A landscape study of open source and proprietary tools for software bill of materials (sbom)
Mirakhorli, M., Garcia, D., Dillon, S., Laporte, K., Morrison, M., Lu, H., Koscinski, V., Enoch, C.: A landscape study of open source and proprietary tools for software bill of materials (sbom). arXiv preprint arXiv:2402.11151 (2024)
-
[35]
Murphy, David Notkin, William G
Murphy, G.C., Notkin, D., Griswold, W.G., Lan, E.S.: An empirical study of static call graph extractors. ACM Transactions on Software Engineering and Methodol- ogy7(2), 158–191 (1998). https://doi.org/10.1145/279310.279314
-
[36]
O’Donoghue, E., Reinhold, A.M., Izurieta, C.: Assessing security risks of software supply chains using software bill of materials. In: 2024 IEEE International Confer- ence on Software Analysis, Evolution and Reengineering - Companion (SANER-C). pp. 134–140 (2024). https://doi.org/10.1109/SANER-C62648.2024.00023
-
[37]
Ohm, M., Plate, H., Sykosch, A., Meier, M.: Backstabber’s knife collection: A re- view of open source software supply chain attacks. In: Proceedings of the 17th International Conference on Detection of Intrusions and Malware, and Vulnera- bility Assessment (DIMVA). Lecture Notes in Computer Science, vol. 12223, pp. 23–43. Springer (2020). https://doi.org/...
-
[38]
openvex: Openvex specification (2025),https://github.com/openvex/spec/blob /main/OPENVEX-SPEC.md
work page 2025
-
[39]
OWASP CycloneDX Working Group: CycloneDX: Vulnerability exploitability eX- change (VEX) capabilities.https://cyclonedx.org/capabilities/vex/(2025), part of the CycloneDX Bill of Materials specification
work page 2025
-
[40]
IEEE Transactions on Software Engineering48(5), 1592–1609 (2022)
Pashchenko, I., Plate, H., Ponta, S.E., Sabetta, A., Massacci, F.: Vuln4real: A methodology for counting actually vulnerable dependen- cies. IEEE Transactions on Software Engineering48(5), 1592–1609 (2022). https://doi.org/10.1109/TSE.2020.3025443
-
[41]
Empirical Software Engineering25(5), 3175–3215 (2020)
Ponta, S.E., Plate, H., Sabetta, A.: Detection, assessment and mitigation of vul- nerabilities in open source dependencies. Empirical Software Engineering25(5), 3175–3215 (2020). https://doi.org/10.1007/s10664-020-09830-x
-
[42]
Sánchez, Pedro Delgado-Pérez, Inmaculada Medina-Bulo, and Sergio Segura
Prana, G.A.A., Sharma, A., Shar, L.K., Foo, D., Santosa, A.E., Sharma, A., Lo, D.: Out of sight, out of mind? how vulnerable dependencies affect open-source projects. Empirical Software Engineering26(4), 59 (2021). https://doi.org/10.1007/s10664- 021-09959-3
-
[43]
Rock, L., Hagen, S., Schmidt, T.: Common security advisory framework version 2.0. Oasis standard, OASIS Open (Nov 2022),https://docs.oasis-open.org/cs af/csaf/v2.0/os/csaf-v2.0-os.html, approved 18 November 2022; subsequently published as ISO/IEC 20153:2025
work page 2022
-
[44]
arXiv preprint arXiv:2512.17710 (2025)
Rosso, M., Jaffar, M.A.J., Brighente, A., Conti, M.: A practical solution to sys- tematically monitor inconsistencies in sbom-based vulnerability scanners. arXiv preprint arXiv:2512.17710 (2025)
- [45]
-
[46]
arXiv preprint arXiv:2510.19393 (2025)
Schott, S., Ponta, S.E., Fischer, W., Klauke, J., Bodden, E.: Bytecode-centric de- tection of known-to-be-vulnerable dependencies in java projects. arXiv preprint arXiv:2510.19393 (2025)
-
[47]
Empirical Software Engineering 26(3), 45 (2021)
Soto-Valero, C., Harrand, N., Monperrus, M., Baudry, B.: A comprehensive study of bloated dependencies in the Maven ecosystem. Empirical Software Engineering 26(3), 45 (2021). https://doi.org/10.1007/s10664-020-09914-8 20 Rasheed et al
-
[48]
Dataflow analysis-inspired deep learning for efficient vulnerability detection
Stalnaker, T., Wintersgill, N., Chaparro, O., Di Penta, M., German, D.M., Poshy- vanyk, D.: BOMs away! inside the minds of stakeholders: A comprehensive study of bills of materials for software systems. In: Proceedings of the 46th IEEE/ACM International Conference on Software Engineering (ICSE). pp. 1–13. ACM, New York, NY, USA (2024). https://doi.org/10....
-
[49]
In: Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering (ICSE)
Sui, L., Dietrich, J., Tahir, A., Fourtounis, G.: On the recall of static call graph construction in practice. In: Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering (ICSE). pp. 1049–1060. ACM, New York, NY, USA (2020). https://doi.org/10.1145/3377811.3380441
-
[50]
The OpenVEX Authors: OpenVEX specification.https://github.com/openvex /spec(2023), lightweight JSON-LD VEX implementation meeting the CISA VEX Minimum Requirements
work page 2023
-
[51]
Tip, F., Palsberg, J.: Scalable propagation-based call graph construc- tion algorithms. In: Proceedings of the 15th ACM SIGPLAN Confer- ence on Object-Oriented Programming, Systems, Languages, and Appli- cations (OOPSLA). pp. 281–293. ACM, New York, NY, USA (2000). https://doi.org/10.1145/353171.353190
-
[52]
Empirical Software Engineering 30(4) (2025)
Turcotte, A., Arteca, E., Soto-Valero, C., Monperrus, M.: Detecting and remov- ing bloated dependencies in CommonJS packages. Empirical Software Engineering 30(4) (2025). https://doi.org/10.1007/s10664-025-10618-0
-
[53]
Wu, W., Hu, H., Fan, Z., Qiao, Y., Huang, Y., Li, Y., Zheng, Z., Lyu, M.: An empirical study of code clones from commercial ai code generators. Proc. ACM Softw. Eng.2(FSE) (Jun 2025). https://doi.org/10.1145/3729397,https://doi. org/10.1145/3729397
-
[54]
In: Proceedings of the 45th IEEE/ACM International Conference on Software Engineering (ICSE)
Xia, B., Bi, T., Xing, Z., Lu, Q., Zhu, L.: An empirical study on software bill of materials: Where we stand and the road ahead. In: Proceedings of the 45th IEEE/ACM International Conference on Software Engineering (ICSE). pp. 2630–
-
[55]
IEEE (2023). https://doi.org/10.1109/ICSE48619.2023.00219
-
[56]
Zhan, Y., Sharma, A.: The dynamics of software composition analysis. In: Proceed- ings of the IEEE/ACM International Conference on Software Engineering (ICSE), NIER Track. IEEE (2021). https://doi.org/10.1109/ICSE-NIER52604.2021.00019
-
[57]
In: Proceedings of the 28th USENIX Security Symposium (USENIX Security)
Zimmermann, M., Staicu, C.A., Tenny, C., Pradel, M.: Small world with high risks: A study of security threats in the npm ecosystem. In: Proceedings of the 28th USENIX Security Symposium (USENIX Security). pp. 995–1010. USENIX Association (2019)
work page 2019
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.