pith. sign in

arxiv: 2605.15842 · v1 · pith:SSEYKZZHnew · submitted 2026-05-15 · ⚛️ physics.soc-ph · cs.SI

Reconstructing temporal multi-relational firm networks at scale using large language models. The case of the semiconductor industry

Pith reviewed 2026-05-19 19:32 UTC · model grok-4.3

classification ⚛️ physics.soc-ph cs.SI
keywords semiconductor industrymulti-relational networkslarge language modelssupply chaintemporal networksweb data extractionfirm relationsnetwork reconstruction
0
0 comments X

The pith

Large language models can extract supply-chain, partnership and ownership links from public webpages to build a temporal network of over 1,300 semiconductor firms.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that large language models applied to open web data can identify and classify relations among semiconductor companies at scale, producing an evolving map of more than 1,300 firms connected by supply chains, partnerships, and ownership. This matters because existing proprietary sources are costly, incomplete, and slow to update, leaving gaps in understanding rapid shifts in global dependencies. The resulting network matches validation metrics against proprietary records and aggregate economic figures while documenting concrete changes such as reduced connections during shortages. If the extraction holds, analysts gain a practical way to monitor industry structure without waiting for paid databases.

Core claim

The authors claim that scanning 170 million semiconductor firm webpages with large language models identifies and classifies supply-chain, partnership, and ownership links, yielding a temporal network of over 1,300 linked firms. This network overlaps with and complements proprietary databases, remains consistent with aggregate economic data, and records a temporary 9% decline in edges during the 2022 chip shortage together with rising centrality for AI supply-chain firms such as NVIDIA and geographic realignment of relations amid geopolitical shifts.

What carries the argument

The LLM-based pipeline that reads firm webpages and classifies relational statements into supply-chain, partnership, and ownership categories to assemble the temporal multi-relational graph.

If this is right

  • The network records a temporary 9% decline in edges during the 2022 chip shortage.
  • Centrality rises rapidly for AI supply-chain bottleneck firms such as NVIDIA.
  • Geographic patterns of interfirm relations shift in response to geopolitical turbulence.
  • The framework supplies up-to-date maps for assessing resilience in the semiconductor sector.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same webpage-scanning approach could be applied to map firm relations in other sectors where public data is plentiful but proprietary records lag.
  • Combining the extracted temporal networks with economic flow models might permit simulations of how disruptions at individual firms spread through supply chains.
  • Repeated updates from web sources could support continuous policy tracking of strategic industries beyond one-time studies.

Load-bearing premise

Publicly available firm webpages contain sufficiently complete and unbiased information on supply-chain, partnership, and ownership relations, and the LLM can classify those links without systematic errors that would distort network structure or temporal dynamics.

What would settle it

A side-by-side comparison showing that the extracted network's link counts, centrality rankings, or recorded 9% edge decline during 2022 diverge substantially from a comprehensive proprietary database or independent transaction-volume statistics for the same firms and period.

Figures

Figures reproduced from arXiv: 2605.15842 by Christian Diem, Elma Dervic, Georg Heiler, Hernan Picatto, Jan Hurt, Klaus Friesenbichler, Peter Klimek, Seyda K\"ose.

Figure 1
Figure 1. Figure 1: Evolution and structure of the reconstructed firm-level production network (2015–2025). a Evolution of network size, showing the number of nodes (blue dots), the count of multi-relational edges, where different relationship types between the same firm pair are counted separately (green triangles) and the count of unique directed relationships, where multiple relationship types between the same firm pair ar… view at source ↗
Figure 2
Figure 2. Figure 2: Comparison of our newly constructed supply-chain network with S&P Capital IQ data. The chart compares the total edge counts in the S&P ground truth and the reconstructed network, the number of edges in the subnetworks induced by the overlapping nodes, alongside the observed and expected overlaps using the con￾figuration model as suitable statistical null model. via Monte Carlo rewiring and further decompos… view at source ↗
Figure 3
Figure 3. Figure 3: Centered regional collaboration trajectories of selected firms (2017– 2025). Each line shows the temporal evolution of a firm’s relative regional orienta￾tion, measured as the share of edges with the focal region among its immediate (1-hop) neighbors in the reconstructed network and centered by the global mean over the full observation period. a United States versus European Union. b United States versus J… view at source ↗
Figure 4
Figure 4. Figure 4 [PITH_FULL_IMAGE:figures/full_fig_p011_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Methodological pipeline for reconstructing the firm-level network. [PITH_FULL_IMAGE:figures/full_fig_p017_5.png] view at source ↗
read the original abstract

The semiconductor industry is foundational to modern technology, yet its complex global multi-relational firm network remains poorly understood, posing challenges to scientists, firms, and policymakers. Traditional analysis relies on proprietary databases that are often expensive, incomplete, and slowly updated, limiting their ability to capture rapidly evolving dependencies. Here, we demonstrate that a novel, generalizable methodology combining Large Language Models (LLMs) with open web data can reconstruct this network and its structural dynamics at scale. We identify and classify supply-chain, partnership, and ownership links from 170 million semiconductor firm webpages, yielding a temporal network of over 1,300 linked firms. We validate link-extraction quality (Precision: 0.884; F1-score: 0.784), network overlap and complementarity with a proprietary database, and consistency with aggregate economic data. Our network reveals a temporary 9% decline in edges during the 2022 chip shortage, rapid increases in the centrality of AI supply-chain bottleneck firms such as NVIDIA, and geographic realignment of interfirm relations amid geopolitical turbulence. This generalizable framework overcomes barriers to transparency and provides essential, up-to-date maps for assessing resilience and informing policy across strategically relevant sectors.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 2 minor

Summary. The paper claims that a novel LLM-based methodology applied to 170 million open semiconductor firm webpages can reconstruct a temporal multi-relational network of over 1,300 firms, extracting supply-chain, partnership and ownership links at scale. It reports validation via aggregate link-extraction metrics (precision 0.884, F1 0.784), overlap/complementarity with a proprietary database, and consistency with aggregate economic statistics, then uses the resulting network to document a 9% edge decline during the 2022 chip shortage, rising centrality of AI-bottleneck firms such as NVIDIA, and geographic realignment amid geopolitical turbulence.

Significance. If the central extraction and temporal claims hold after bias checks, the work would provide a scalable, low-cost alternative to proprietary databases for mapping strategic industry networks. This could enable timely analysis of supply-chain resilience and structural change in critical sectors, with clear relevance to policy and economic network science.

major comments (1)
  1. [Validation and results sections] Validation and results sections: the reported aggregate precision (0.884) and F1 (0.784) are not shown to be uniform across firm size, geography, relation type or time period. Without a stratified error analysis or temporal-slice overlap statistics against the proprietary database, it remains possible that differential web visibility or LLM classification errors (e.g., under-representation of smaller Asian suppliers) drive the observed 9% edge decline in 2022 and the reported rise in NVIDIA centrality rather than genuine network evolution.
minor comments (2)
  1. [Abstract] The abstract states 'network overlap and complementarity with a proprietary database' without quantitative figures; adding these numbers (or directing readers to the relevant table/figure) would improve immediate clarity.
  2. [Methods] Methods description of the LLM prompting and classification pipeline would benefit from an explicit statement of how temporal information is extracted and dated from webpages.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their constructive feedback, which helps strengthen the validation of our LLM-based network reconstruction methodology. We address the major comment on the uniformity of validation metrics below.

read point-by-point responses
  1. Referee: Validation and results sections: the reported aggregate precision (0.884) and F1 (0.784) are not shown to be uniform across firm size, geography, relation type or time period. Without a stratified error analysis or temporal-slice overlap statistics against the proprietary database, it remains possible that differential web visibility or LLM classification errors (e.g., under-representation of smaller Asian suppliers) drive the observed 9% edge decline in 2022 and the reported rise in NVIDIA centrality rather than genuine network evolution.

    Authors: We acknowledge that the reported validation metrics are aggregate and do not include explicit stratified breakdowns by firm size, geography, relation type, or time period. Our current validation relies on overall precision and F1, combined with overlap/complementarity checks against a proprietary database and consistency with aggregate economic statistics. While these provide support for the network's reliability, we agree that a stratified analysis would more rigorously rule out systematic biases such as differential web visibility for smaller Asian suppliers. In the revised version, we will add a stratified error analysis (including precision/F1 by region and firm size where metadata permits) and temporal-slice overlap statistics with the proprietary database. This will help confirm that the 9% edge decline during the 2022 chip shortage and the rise in NVIDIA centrality reflect genuine structural changes, consistent with independent industry reports on supply disruptions and AI bottlenecks. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical extraction validated against external data

full rationale

The paper describes an LLM-based pipeline to extract multi-relational links from 170 million firm webpages, producing a temporal network of 1,300+ firms. Validation consists of precision/F1 metrics on sampled annotations, overlap checks with an independent proprietary database, and consistency tests against aggregate economic statistics. No equations, fitted parameters, or first-principles derivations are presented as predictions; the central claims rest on external benchmarks rather than internal re-use of the extracted network itself. No self-citation chains or ansatzes are invoked to justify the core reconstruction step.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the premise that web data plus current LLM capabilities can produce accurate relational links at scale; no free parameters or invented entities are stated in the abstract.

axioms (1)
  • domain assumption LLMs can accurately extract and classify supply-chain, partnership, and ownership relations from firm webpages
    The entire reconstruction pipeline depends on this extraction quality; the reported precision and F1 scores are presented as evidence but the assumption is not independently proven in the abstract.

pith-pipeline@v0.9.0 · 5777 in / 1420 out tokens · 49686 ms · 2026-05-19T19:32:07.393133+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

51 extracted references · 51 canonical work pages

  1. [1]

    Choi, Vasco M

    Anton Pichler, Christian Diem, Alexandra Brintrup, Fran¸ cois Lafond, Glenn Mager- man, Gert Buiten, Thomas Y. Choi, Vasco M. Carvalho, J. Doyne Farmer, and Stefan Thurner. Building an alliance to map global supply networks.Science, 382(6668):270–272, 2023

  2. [2]

    The network origins of aggregate fluctuations.Econometrica, 80(5):1977–2016, 2012

    Daron Acemoglu, Vasco M Carvalho, Asuman Ozdaglar, and Alireza Tahbaz-Salehi. The network origins of aggregate fluctuations.Econometrica, 80(5):1977–2016, 2012

  3. [3]

    Sys- temic risk analysis on reconstructed economic and financial networks.Scientific reports, 5(1):15758, 2015

    Giulio Cimini, Tiziano Squartini, Diego Garlaschelli, and Andrea Gabrielli. Sys- temic risk analysis on reconstructed economic and financial networks.Scientific reports, 5(1):15758, 2015. 22

  4. [4]

    Quantifying economic resilience from in- put–output susceptibility to improve predictions of economic growth and recovery

    Peter Klimek and Sebastian Poledna. Quantifying economic resilience from in- put–output susceptibility to improve predictions of economic growth and recovery. Nature Communications, 10(1):1677, 2019

  5. [5]

    Firm-level propagation of shocks through supply-chain networks.Nature Sustainability, 2(9):841–847, 2019

    Hiroyasu Inoue and Yasuyuki Todo. Firm-level propagation of shocks through supply-chain networks.Nature Sustainability, 2(9):841–847, 2019

  6. [6]

    The network origins of aggregate fluctuations.Econometrica, 80, 10 2011

    Daron Acemoglu, Vasco Carvalho, Asuman Ozdaglar, and Alireza Tahbaz-Salehi. The network origins of aggregate fluctuations.Econometrica, 80, 10 2011

  7. [7]

    Sup- ply chain disruptions: Evidence from the great east Japan earthquake*.The Quar- terly Journal of Economics, 136(2):1255–1321, 12 2020

    Vasco M Carvalho, Makoto Nirei, Yukiko U Saito, and Alireza Tahbaz-Salehi. Sup- ply chain disruptions: Evidence from the great east Japan earthquake*.The Quar- terly Journal of Economics, 136(2):1255–1321, 12 2020

  8. [8]

    Estimating the loss of economic predictability from aggregating firm-level produc- tion networks.PNAS Nexus, 3(3):pgae064, 02 2024

    Christian Diem, Andr´ as Borsos, Tobias Reisch, J´ anos Kert´ esz, and Stefan Thurner. Estimating the loss of economic predictability from aggregating firm-level produc- tion networks.PNAS Nexus, 3(3):pgae064, 02 2024

  9. [9]

    Inequality in economic shock exposures across the global firm-level supply network.Nature Communications, 15(1):3348, 2024

    Abhijit Chakraborty, Tobias Reisch, Christian Diem, Pablo Astudillo-Est´ evez, and Stefan Thurner. Inequality in economic shock exposures across the global firm-level supply network.Nature Communications, 15(1):3348, 2024

  10. [10]

    Climate stress test of the global supply chain network: the case of river floods

    Georgios Papadopoulos, Javier Ojea Ferreiro, and Roberto Panzica. Climate stress test of the global supply chain network: the case of river floods. 2025

  11. [11]

    Quantifying firm-level economic systemic risk from nation-wide supply networks

    Christian Diem, Andr´ as Borsos, Tobias Reisch, J´ anos Kert´ esz, and Stefan Thurner. Quantifying firm-level economic systemic risk from nation-wide supply networks. Scientific reports, 12(1):7719, 2022

  12. [12]

    Evolution and determinants of firm-level systemic risk in local production networks.arXiv preprint arXiv:2506.21426, 2025

    Anna Mancini, Bal´ azs Lengyel, Riccardo Di Clemente, and Giulio Cimini. Evolution and determinants of firm-level systemic risk in local production networks.arXiv preprint arXiv:2506.21426, 2025

  13. [13]

    Supply network stress-testing of food security on the establishment-level.International Journal of Production Research, 63(9):3259–3283, 2025

    Christian Diem, William Schueller, Melanie Gerschberger, Johannes Stangl, Beate Conrady, Markus Gerschberger, and Stefan Thurner. Supply network stress-testing of food security on the establishment-level.International Journal of Production Research, 63(9):3259–3283, 2025

  14. [14]

    Carvalho, and J

    James McNerney, Charles Savoie, Francesco Caravelli, Vasco M. Carvalho, and J. Doyne Farmer. How production networks amplify economic growth.Proceedings of the National Academy of Sciences, 119(1):e2106031118, 2022

  15. [15]

    The network of global corporate control.PloS one, 6(10):e25995, 2011

    Stefania Vitali, James B Glattfelder, and Stefano Battiston. The network of global corporate control.PloS one, 6(10):e25995, 2011

  16. [16]

    The power of corporate control in the global ownership network.Plos one, 15(8):e0237862, 2020

    Takayuki Mizuno, Shohei Doi, and Shuhei Kurizaki. The power of corporate control in the global ownership network.Plos one, 15(8):e0237862, 2020. 23

  17. [17]

    Epidemic effects in the diffusion of emerging digital technologies: evidence from artificial intelligence adoption.Research Policy, 53(2):104917, 2024

    Johannes Dahlke, Mathias Beck, Jan Kinne, David Lenz, Robert Dehghan, Martin W¨ orter, and Bernd Ebersberger. Epidemic effects in the diffusion of emerging digital technologies: evidence from artificial intelligence adoption.Research Policy, 53(2):104917, 2024

  18. [18]

    Innovation diffu- sion in global contexts: determinants of post-adoption digital transformation of Eu- ropean companies.European journal of information systems, 15(6):601–616, 2006

    Kevin Zhu, Shutao Dong, Sean Xin Xu, and Kenneth L Kraemer. Innovation diffu- sion in global contexts: determinants of post-adoption digital transformation of Eu- ropean companies.European journal of information systems, 15(6):601–616, 2006

  19. [19]

    Nvidia’s$100bn bet on OpenAI raises more questions than it answers.The Economist, September 2025

    The Economist. Nvidia’s$100bn bet on OpenAI raises more questions than it answers.The Economist, September 2025. Business section. Published September 22, 2025. Accessed 2026-01-09

  20. [20]

    Mapping of the global semiconductor supply chain: Embedding Austria in the global semiconductor inter-firm network

    Peter Klimek, Markus Gerschberger, Christopher Schwarz, Tiberiu-Alexandru Cioban, Agnes K¨ ugler, Elma Dervic, Georg Heiler, Hernan Picatto, Klaus Friesen- bichler, and Lukas Schmoigl. Mapping of the global semiconductor supply chain: Embedding Austria in the global semiconductor inter-firm network. Policy brief, Supply Chain Intelligence Institute Austri...

  21. [21]

    Mapping the semiconductor value chain: Working towards identifying de- pendencies and vulnerabilities

    OECD. Mapping the semiconductor value chain: Working towards identifying de- pendencies and vulnerabilities. Technical Report 182, OECD Publishing, Paris, June 2025

  22. [22]

    Firm-level production networks: what do we (really) know.INET Oxford Working Paper, 2023, 2023

    Andrea Bacilieri, Andr´ as Borsos, Pablo Astudillo-Estevez, and Fran¸ cois Lafond. Firm-level production networks: what do we (really) know.INET Oxford Working Paper, 2023, 2023

  23. [23]

    The construction of world input–output tables in the wiod project.Eco- nomic Systems Research, 25(1):71–98, 2013

    Erik Dietzenbacher, Bart Los, Robert Stehrer, Marcel Timmer, and Gaaitzen de Vries. The construction of world input–output tables in the wiod project.Eco- nomic Systems Research, 25(1):71–98, 2013

  24. [24]

    The Belgian production network 2002-2012

    Emmanuel Dhyne, Glenn Magerman, and Stela Rub´ ınov´ a. The Belgian production network 2002-2012. Technical report, NBB Working Paper, 2015

  25. [25]

    Re- constructing production networks using machine learning.Journal of Economic Dynamics and Control, 148:104607, 2023

    Luca Mungo, Fran¸ cois Lafond, Pablo Astudillo-Est´ evez, and J Doyne Farmer. Re- constructing production networks using machine learning.Journal of Economic Dynamics and Control, 148:104607, 2023

  26. [26]

    Mapping the supply chain: Why, what and how?International Journal of Production Economics, 250:108688, 2022

    Bart L MacCarthy, Wafaa AH Ahmed, and Guven Demirel. Mapping the supply chain: Why, what and how?International Journal of Production Economics, 250:108688, 2022

  27. [27]

    chips shortage

    Xiling Wu, Caihua Zhang, and Wei Du. An analysis on the crisis of “chips shortage” in automobile industry——based on the double influence of COVID-19 and trade friction. InJournal of Physics: Conference Series, volume 1971, page 012100. IOP Publishing, 2021. 24

  28. [28]

    The global semi- conductor chip shortage: Causes, implications, and potential remedies.IFAC- PapersOnLine, 55(10):476–483, 2022

    Wassen Mohammad, Adel Elomri, and Laoucine Kerbache. The global semi- conductor chip shortage: Causes, implications, and potential remedies.IFAC- PapersOnLine, 55(10):476–483, 2022

  29. [29]

    Geopolitical disruptions in global supply chains: a state-of-the-art literature re- view.Production planning & control, 36(4):536–562, 2025

    Lukasz Bednarski, Samuel Roscoe, Constantin Blome, and Martin C Schleper. Geopolitical disruptions in global supply chains: a state-of-the-art literature re- view.Production planning & control, 36(4):536–562, 2025

  30. [30]

    Recon- structing supply networks.Journal of Physics: Complexity, 5(1):012001, 2024

    Luca Mungo, Alexandra Brintrup, Diego Garlaschelli, and Fran¸ cois Lafond. Recon- structing supply networks.Journal of Physics: Complexity, 5(1):012001, 2024

  31. [31]

    Reconstructing firm- level interactions in the dutch input–output network from production constraints

    Leonardo Niccol` o Ialongo, Camille De Valk, Emiliano Marchese, Fabian Jansen, Hicham Zmarrou, Tiziano Squartini, and Diego Garlaschelli. Reconstructing firm- level interactions in the dutch input–output network from production constraints. Scientific reports, 12(1):11847, 2022

  32. [32]

    Extracting supply chain maps from news articles using deep neural networks.International Journal of Production Research, 58(17):5320–5336, 2020

    P Wichmann, A Brintrup, S Baker, P Woodall, and D McFarlane. Extracting supply chain maps from news articles using deep neural networks.International Journal of Production Research, 58(17):5320–5336, 2020

  33. [33]

    Enhancing supply chain visibility with knowledge graphs and large language models.arXiv preprint arXiv:2408.07705,

    Sara AlMahri, Liming Xu, and Alexandra Brintrup. Enhancing supply chain visibility with knowledge graphs and large language models.arXiv preprint arXiv:2408.07705, 2024

  34. [34]

    Supply chain mapping through retrieval-augmented generation: applications to the elec- tronics industry.Journal of the Operational Research Society, pages 1–21, 2025

    Ilya Jackson, Maria Jes´ us Sa´ enz, Dmitry Ivanov, and Benedict Jun Ma. Supply chain mapping through retrieval-augmented generation: applications to the elec- tronics industry.Journal of the Operational Research Society, pages 1–21, 2025

  35. [35]

    Large-scale structure of a nation-wide pro- duction network.The European Physical Journal B, 77(4):565–580, 2010

    Yoshi Fujiwara and Hideaki Aoyama. Large-scale structure of a nation-wide pro- duction network.The European Physical Journal B, 77(4):565–580, 2010

  36. [36]

    Topological robustness of the global automotive industry.Logistics Research, 9(1):1, 2016

    Alexandra Brintrup, Anna Ledwoch, and Jose Barros. Topological robustness of the global automotive industry.Logistics Research, 9(1):1, 2016

  37. [37]

    Geoeconomics.Annual Review of Eco- nomics, 17, 2025

    Cathrin Mohr and Christoph Trebesch. Geoeconomics.Annual Review of Eco- nomics, 17, 2025

  38. [38]

    A data-analytics approach to identifying hidden critical suppliers in supply networks: Development of nexus supplier index.Decision Support Systems, 114:37–48, 2018

    Benjamin BM Shao, Zhan Michael Shi, Thomas Y Choi, and Sangho Chae. A data-analytics approach to identifying hidden critical suppliers in supply networks: Development of nexus supplier index.Decision Support Systems, 114:37–48, 2018

  39. [39]

    Supply chain network rewiring dynamics at the firm-level.arXiv preprint arXiv:2503.20594, 2025

    Tobias Reisch, Andr´ as Borsos, and Stefan Thurner. Supply chain network rewiring dynamics at the firm-level.arXiv preprint arXiv:2503.20594, 2025

  40. [40]

    Orbis database.https://www.bvdinfo.com/en-gb/our-produ cts/data/orbis

    Bureau van Dijk. Orbis database.https://www.bvdinfo.com/en-gb/our-produ cts/data/orbis. Accessed 2025-11-10

  41. [41]

    Eto chip explorer.https://github.com/georgetown-cset/ eto-chip-explorer

    Georgetown CSET. Eto chip explorer.https://github.com/georgetown-cset/ eto-chip-explorer. Accessed 2025-11-10. 25

  42. [42]

    Semiconductor materials and equipment.https://abachy.com/

    Abachy.com. Semiconductor materials and equipment.https://abachy.com/. Accessed 2025-11-10

  43. [43]

    Common crawl dataset.https://commoncrawl.org/

    Common Crawl Foundation. Common crawl dataset.https://commoncrawl.org/. Accessed 2025-11-10

  44. [44]

    Cost-effective big data orchestra- tion using dagster: A multi-platform approach.arXiv preprint arXiv: 2408.11635, 2024

    Hernan Picatto, Georg Heiler, and Peter Klimek. Cost-effective big data orchestra- tion using dagster: A multi-platform approach.arXiv preprint arXiv: 2408.11635, 2024

  45. [45]

    S&P capital IQ database.https://www.capita liq.com

    S&P Global Market Intelligence. S&P capital IQ database.https://www.capita liq.com. Accessed 2025-11-10

  46. [46]

    Inter-country input–output (ICIO) tables.https://www.oecd.org/sti/i nd/inter-country-input-output-tables.htm, 2023

    OECD. Inter-country input–output (ICIO) tables.https://www.oecd.org/sti/i nd/inter-country-input-output-tables.htm, 2023. Accessed 2025-11-10

  47. [47]

    Baci: International trade database at the product-level (the 2020 version).https://www.cepii.fr/CEPII/en/bdd_modele /presentation.asp?id=37, 2020

    Guillaume Gaulier and Soledad Zignago. Baci: International trade database at the product-level (the 2020 version).https://www.cepii.fr/CEPII/en/bdd_modele /presentation.asp?id=37, 2020. Accessed 2025-11-10. 26 Supplementary Information S1 Consistency Filtering and Robustness Analysis To ensure the temporal reliability of domains used in our analysis, we c...

  48. [48]

    The activity string ended with at least three consecutive 1’s (111), ensuring reten- tion of domains consistently active in the later years

  49. [49]

    The total number of 1’s (active years) exceeded four

  50. [50]

    The activity string contained at least one sequence of three or more consecutive 1’s

  51. [51]

    Applying these criteria yielded a core set of 2015 temporally consistent domains

    The activity string did not contain any sequence of three or more consecutive 0’s, except in cases where the pattern ended in111. Applying these criteria yielded a core set of 2015 temporally consistent domains. These domains were used in the main analysis. S1.1 Robustness Check Using a Broader Domain Set To assess the robustness of our results, we repeat...