Artificial Intelligence Specialization in the European Union: Underexplored Role of the Periphery at NUTS-3 Level
Pith reviewed 2026-05-15 21:22 UTC · model grok-4.3
The pith
Peripheral NUTS-3 regions lead in relative AI specialization across the EU.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Using bibliometric data classified into Electrical Engineering, Electronics & Computer Science at the macro level and Artificial Intelligence & Machine Learning at the meso level, the study applies RSI and RCI metrics to reveal that relative AI specialization concentrates in peripheral NUTS-3 regions rather than metropolitan centers, with selected peripheral areas combining high specialization and citation performance while overall specialization correlates only weakly with impact.
What carries the argument
Relative Specialization Index (RSI) applied to meso-level AI & Machine Learning Citation Topics at the NUTS-3 scale, which quantifies how much a region's AI output deviates from the expected share based on its total research activity.
Load-bearing premise
The Clarivate Citation Topics system correctly identifies AI and Machine Learning publications without significant misclassification or coverage differences across regions.
What would settle it
Re-running the RSI calculations on the same publications but with an independent AI topic classification that eliminates the reported concentration of high specialization in Eastern European and Spanish peripheral regions.
Figures
read the original abstract
This study examines the distribution of Artificial Intelligence (AI) research across European NUTS-3 regions during the period 2015-2024. Using bibliometric data from Clarivate InCites and the Citation Topics classification system, we analyse two hierarchical thematic levels: Electrical Engineering, Electronics & Computer Science (Macro Citation Topic 4) and Artificial Intelligence & Machine Learning (Meso Citation Topic 4.61). Relative Specialization Index (RSI) and Relative Citation Impact (RCI) indicators are calculated for 781 European NUTS-3 regions. While major metropolitan hubs such as Paris, Warszawa, and Madrid dominate in absolute publication volume, the results reveal that the highest levels of relative AI specialization are concentrated in peripheral regions, particularly in Eastern Europe and Spain. Granada and Vilniaus apskritis stand out as regions combining high specialization with strong citation visibility. The analysis further suggests a weak relationship between regional specialization and citation impact, revealing multiple regional profiles, including highly specialized regions with limited citation visibility, highly visible regions with comparatively low specialization, and diversified scientific systems combining moderate specialization with strong citation impact. Fyn emerges as an extreme case of very high citation impact despite relatively low specialization.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript analyzes the geographic distribution of AI research across 781 EU NUTS-3 regions (2015-2024) using Clarivate InCites bibliometric data and the Citation Topics taxonomy. It distinguishes macro topic 4 (Electrical Engineering, Electronics & Computer Science) from meso topic 4.61 (Artificial Intelligence & Machine Learning), computes Relative Specialization Index (RSI) and Relative Citation Impact (RCI) for each region, and reports that absolute publication volume is concentrated in hubs such as Paris, Warszawa and Madrid while the highest RSI values occur in peripheral regions, notably Granada and Vilniaus apskritis. The study further finds only a weak relationship between regional specialization and citation impact and identifies several distinct regional profiles.
Significance. If the underlying classification and data handling are reliable, the work supplies a policy-relevant descriptive map at an unusually fine NUTS-3 scale, shifting emphasis from absolute output in core cities to relative specialization in the periphery. The identification of multiple profiles (high-specialization/low-visibility, high-visibility/low-specialization, and balanced) adds nuance beyond simple rankings. The scale of the analysis (781 regions) and the use of established RSI/RCI formulas are strengths, but the absence of validation or robustness checks limits the strength of the headline claim.
major comments (2)
- [Data and Methods] Data and Methods section: The central RSI rankings rest entirely on the accuracy of Clarivate meso-level topic 4.61 labels for AI/ML. No validation against alternative taxonomies (arXiv cs.AI, Scopus AI keywords), manual coding checks, or sensitivity tests for regional coverage or language bias is reported. Because NUTS-3 units often have low publication counts, even modest misclassification rates can produce large swings in the regional AI share that enters the RSI numerator while the EU-wide denominator remains fixed.
- [Results] Results and Discussion: The headline finding that peripheral regions (Granada, Vilniaus apskritis) exhibit the highest relative specialization is presented without confidence intervals, bootstrap standard errors, or robustness checks to alternative normalizations or exclusion of multi-affiliated papers. For small-count regions this omission leaves open whether the reported top ranks are stable or artifacts of classification or counting conventions.
minor comments (2)
- The manuscript would benefit from an explicit table or appendix listing the top 10-15 regions by RSI together with their raw AI and total publication counts, RCI values, and population or researcher-base denominators used in any normalization.
- [Methods] Clarify in the Methods whether fractional counting was applied to multi-affiliated publications and whether any threshold on minimum publications per region was imposed before computing RSI.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed comments, which have helped us strengthen the robustness and transparency of our analysis. We address each major comment point by point below, indicating where revisions have been made to the manuscript.
read point-by-point responses
-
Referee: [Data and Methods] Data and Methods section: The central RSI rankings rest entirely on the accuracy of Clarivate meso-level topic 4.61 labels for AI/ML. No validation against alternative taxonomies (arXiv cs.AI, Scopus AI keywords), manual coding checks, or sensitivity tests for regional coverage or language bias is reported. Because NUTS-3 units often have low publication counts, even modest misclassification rates can produce large swings in the regional AI share that enters the RSI numerator while the EU-wide denominator remains fixed.
Authors: We acknowledge the referee's concern regarding the reliance on the proprietary Clarivate Citation Topics taxonomy. In the revised manuscript, we have expanded the Methods section to include a dedicated discussion of the taxonomy's construction and known limitations, citing prior validation studies on similar bibliometric classifications. We performed and now report sensitivity tests by applying minimum publication thresholds (excluding regions with fewer than 5 or 10 AI publications) and confirm that the top-ranked peripheral regions (Granada, Vilniaus apskritis) remain stable. Full cross-validation with arXiv cs.AI or Scopus keyword-based definitions is not feasible due to licensing restrictions on the InCites data; this limitation is now explicitly stated in the Discussion section along with a call for future multi-source studies. revision: partial
-
Referee: [Results] Results and Discussion: The headline finding that peripheral regions (Granada, Vilniaus apskritis) exhibit the highest relative specialization is presented without confidence intervals, bootstrap standard errors, or robustness checks to alternative normalizations or exclusion of multi-affiliated papers. For small-count regions this omission leaves open whether the reported top ranks are stable or artifacts of classification or counting conventions.
Authors: We agree that uncertainty measures are essential for interpreting rankings in low-count regions. The revised Results section now includes bootstrap standard errors (1,000 resamples) for RSI values of the top 20 regions, demonstrating that Granada and Vilniaus apskritis retain their leading positions with non-overlapping intervals relative to core hubs. We have also added a robustness check excluding multi-affiliated papers, which yields consistent top rankings. These additions are presented in new tables and figures, directly addressing concerns about stability. revision: yes
- Complete external validation of all Clarivate meso-topic 4.61 labels via manual coding or direct comparison to arXiv/Scopus taxonomies across the full dataset, due to proprietary data access restrictions.
Circularity Check
No circularity; standard indices on external bibliometric data
full rationale
The derivation uses the standard RSI formula (regional AI share divided by EU-wide AI share) and RCI on raw publication and citation counts from Clarivate InCites Citation Topics 4.61. No parameters are fitted to the target result, no self-citations form the load-bearing premise, and no step renames or re-derives the input data. The classification taxonomy is treated as an external input; any coverage concerns are external validity issues, not circularity. The chain from data to specialization rankings is direct and non-reductive.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption NUTS-3 regions form comparable units for measuring scientific specialization and impact
- domain assumption The Citation Topics hierarchical system validly separates AI & Machine Learning from broader engineering topics
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Clarivate Citation Topics 4.61 (Artificial Intelligence & Machine Learning) at NUTS-3 level, 2015-2024
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
D. Arranz et al. Trends in the use of AI in science: a bibliometric analysis. European Commission, R&D Paper Series, Working Paper 2023/04, March 2023
work page 2023
-
[2]
Bibliometric analysis of scientific production on artificial intelligence from 1960 to 2021,
X. Liu et al., “Bibliometric analysis of scientific production on artificial intelligence from 1960 to 2021,” Technol. Forecast. Soc. Change, 2024
work page 1960
-
[3]
An extensive bibliometric analysis of artificial intelligence techniques from 2013 to 2023,
H. Saeidnia et al., “An extensive bibliometric analysis of artificial intelligence techniques from 2013 to 2023,” J. Supercomput., 2025
work page 2013
-
[4]
Bibliometric Mining of Research Trends in Machine Learning,
P. Waltman et al., “Bibliometric Mining of Research Trends in Machine Learning,” AI, vol. 5, no. 1, art. 12, 2024
work page 2024
-
[5]
A Decade of Artificial Intelligence Research in the European Union: A Bibliometric Analysis,
A. Frankowska and B. Pawlik, “A Decade of Artificial Intelligence Research in the European Union: A Bibliometric Analysis,” in C. Biele et al. (eds.) Digital Interaction and Machine Intelligence. MIDI 2021. LNNS, vol 440, 2022, pp. 52-62
work page 2021
-
[6]
J. Hautala, “Reviewing research on regional development in the AI era — new there(s), new actors, and an old call “from cluster to process,” Fennia vol. 202, no. 2, pp. 299- 312, 2024
work page 2024
-
[7]
Research into journalism in Spain: Sizeable, but neither international nor impactful,
B. Salvador -Mata, S. Cortiñas -Rovira, and V. Herrero - Solana, "Research into journalism in Spain: Sizeable, but neither international nor impactful," Journalism, vol. 25, no. 1, pp. 1–22, 2024
work page 2024
-
[8]
A new bibliometric approach to assess the scientific specialization of regions,
G. Abramo, C. A. D’Angelo, and F. Di Costa, “A new bibliometric approach to assess the scientific specialization of regions,” Research Evaluation, vol. 23, no. 2, pp. 183–194, 2014
work page 2014
-
[9]
Mainstream research in Latin America and the Caribbean,
J. D. Frame, “Mainstream research in Latin America and the Caribbean,” Interciencia, vol. 2, no. 3, pp. 143 –148, 1977
work page 1977
-
[10]
A. Schubert and T. Braun, “Relative indicators and relational charts for comparative assessment of publication output and citation impact,” Scientometrics, vol. 9, pp. 281–291, 1986
work page 1986
-
[11]
Trade liberalisation and ‘revealed’ comparative advantage,
B. Balassa, “Trade liberalisation and ‘revealed’ comparative advantage,” The Manchester School, vol. 33, pp. 99–123, 1965
work page 1965
-
[12]
Reflections on the activity index and related indicators,
R. Rousseau and L.Y. Yang, “Reflections on the activity index and related indicators,” J. Informetrics, vol. 6, pp. 413–421, 2012
work page 2012
-
[13]
Balassa = revealed competitive advantage = activity,
R. Rousseau, “Balassa = revealed competitive advantage = activity,” Scientometrics, vol. 121, pp. 1835 –1836, 2019
work page 2019
-
[14]
A note on using revealed comparative advantages in scientometrics studies,
M. J. Mansourzadeh et al. “A note on using revealed comparative advantages in scientometrics studies,” Scientometrics, vol. 121, pp. 595–599, 2019
work page 2019
-
[15]
From Louvain to Leiden: Guaranteeing well -connected communities,
V.A. Traag, L. Waltman, and N.J. van Eck, “From Louvain to Leiden: Guaranteeing well -connected communities,” Scientific Reports, vol. 9, art. 5233, 2019
work page 2019
-
[16]
Revealed comparative advantage and the alternatives as measures of international specialization,
K. Laursen, “Revealed comparative advantage and the alternatives as measures of international specialization,” Eurasian Business Review, vol. 5, pp. 99–115, 2015
work page 2015
-
[17]
Computer science in Eastern Europe 1989 –2014: a bibliometric study,
D. Fiala, and P. Willett, “Computer science in Eastern Europe 1989 –2014: a bibliometric study,” Aslib J of Inform Management, vol. 67, pp. 526–541, 2015
work page 1989
-
[18]
The creation and integration of AI in Europe,
V. Buarque et al. "The creation and integration of AI in Europe," Cambridge J. of Regions, Econ and Soc, vol. 13, pp. 175-193, 2020
work page 2020
-
[19]
The emergence of artificial intelligence in European regions: the role of a local ICT base,
J. Xiao and T. Boschma, “The emergence of artificial intelligence in European regions: the role of a local ICT base,” Annals of Regional Sci, vol. 71, pp. 747 –773, 2023
work page 2023
-
[20]
EU- funded investment in Artificial Intelligence and regional specialization,
A.M. Santos, F. Molica, and C. Torrecilla -Salinas, “EU- funded investment in Artificial Intelligence and regional specialization,” Regional Sci Policy & Practice, vol. 17, 100190, 2025
work page 2025
-
[21]
G. Cicerone, A. Faggian, S. Montresor, and F. Rentocchini, “Regional artificial intelligence and the geography of environmental technologies: does local AI knowledge help regional green -tech specialization?” Regional Studies, vol. 57, pp. 330–343, 2022
work page 2022
-
[22]
Patents of Industry 4.0 in Spain: topics and actors,
V. Herrero -Solana and B. Jürgens, “Patents of Industry 4.0 in Spain: topics and actors,” Ibersid, vol. 19, no. 1, pp. 45–53, 2025
work page 2025
-
[23]
Innovation policy for a complex world,
P.A. Balland, “Innovation policy for a complex world,” in Science, Research and Innovation Performance of the EU 2022. Brussels: European Commission, 2022. 6 Fig. 3. Relative Specialization Index (RSI) vs. Relative Citation Impact (RCI) in regions with at least 100 papers in AI
work page 2022
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.