pith. sign in

arxiv: 2605.16333 · v1 · pith:X6DLDDXJnew · submitted 2026-05-05 · 💻 cs.DL · cs.IR

SotA Lens: A Network-Augmented Methodology and Tool for Exploratory State-of-the-Art Reviews

Pith reviewed 2026-05-20 23:52 UTC · model grok-4.3

classification 💻 cs.DL cs.IR
keywords state of the art reviewcitation networkcommunity detectionexploratory reviewbibliometricsresearch mappingsystematic reviewmultidisciplinary fields
0
0 comments X

The pith

SotA Lens combines citation graphs and community detection to support exploratory state-of-the-art reviews.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Researchers often need to explore a broad field before they can define a precise protocol for a systematic review, especially in multidisciplinary topics where terms vary. SotA Lens offers a workflow that starts with seed searches, expands through citations to build a directed graph, applies community detection, and lets users label the resulting clusters. In a demonstration on Dynamic Projection-Mapping and Spatial Augmented Reality, this produced a graph with nearly 2,200 papers and 16 interpretable communities from a modest starting set. The method generates auditable artifacts that help identify clusters and gaps to guide the next steps of a review. It is meant to complement rather than replace established protocols like PRISMA.

Core claim

The paper claims that constructing a citation network from bounded expansions of seed search results and then applying community detection allows for the identification of human-interpretable research communities, thereby enabling a structured exploratory review that maps the state of the art in fields with dispersed or unstable terminology, as shown by labeling sixteen communities in the proof-of-concept case.

What carries the argument

The SotA Lens workflow that builds a directed citation graph from seed results and uses community detection to reveal research clusters.

Load-bearing premise

That citation connections and community detection will group papers into clusters that correspond to meaningful and distinguishable research communities.

What would settle it

If domain experts reviewing the output communities report that they do not reflect actual divisions in the research landscape, the method's mapping capability would be called into question.

Figures

Figures reproduced from arXiv: 2605.16333 by Diogo Peralta Cordeiro.

Figure 1
Figure 1. Figure 1: Initial conceptual map of Dynamic Projection-Mapping within neighbouring extended [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Simplified article-search and citation-expansion workflow implemented by [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Largest connected component of the 2010–2023 filtered citation graph. Each vertex is [PITH_FULL_IMAGE:figures/full_fig_p007_3.png] view at source ↗
read the original abstract

Researchers often begin new projects by conducting a broad State-of-the-Art review before they are ready to define the narrow protocol required by a systematic review. This is especially common in multidisciplinary areas where terminology is unstable, communities are weakly connected, and relevant work is dispersed across technical and application domains. This paper presents SotA Lens, a network-augmented methodology and lightweight software toolkit for exploratory State-of-the-Art reviews. The approach combines documented seed search, DOI-level metadata resolution, bounded citation expansion, directed graph construction, community detection, ranking of authors and subject terms, and human labelling of research communities. It is designed to complement, not replace, established review protocols such as PRISMA, PRISMA-ScR, systematic mapping studies, and bibliometric science mapping. The method is demonstrated through a proof-of-concept review of Dynamic Projection-Mapping and Spatial Augmented Reality. Starting from approximately 200 seed search results, the workflow produced a citation graph with 2,198 DOI-level vertices and 8,249 reference edges; a filtered largest component for 2010-2023 contained 986 vertices, 2,693 edges, and sixteen labelled communities. The contribution is both methodological and practical: SotA Lens helps researchers map broad fields, identify clusters and gaps, and produce auditable review artifacts before committing to a narrower systematic review protocol. This paper is not intended as a domain survey of Dynamic Projection-Mapping or Spatial Augmented Reality; rather, it introduces and demonstrates an original review-support methodology and software artifact using that domain as a proof-of-concept case study.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper introduces SotA Lens, a network-augmented methodology and lightweight software toolkit for exploratory state-of-the-art reviews in broad, multidisciplinary fields with unstable terminology. It combines documented seed searches, DOI metadata resolution, bounded citation expansion to construct directed citation graphs, community detection, ranking of authors and subject terms, and human labelling of communities. Positioned as a complement to protocols such as PRISMA, the approach is demonstrated in a proof-of-concept on Dynamic Projection-Mapping and Spatial Augmented Reality, producing a citation graph with 2,198 DOI-level vertices and 8,249 reference edges; the filtered 2010-2023 largest component contains 986 vertices, 2,693 edges, and sixteen labelled communities. The central contribution is a practical workflow for mapping fields, identifying clusters and gaps, and generating auditable review artifacts before narrowing to systematic protocols.

Significance. If the detected communities align with meaningful, human-interpretable subfields, SotA Lens could offer a useful pre-protocol tool for researchers to explore dispersed literature and produce reproducible artifacts. The provision of a documented workflow, concrete POC outputs (2,198 vertices, 16 communities), and a software toolkit are strengths that support practical adoption in digital libraries and bibliometrics research.

major comments (2)
  1. Proof-of-Concept Demonstration: The workflow identifies sixteen author-labelled communities in the 986-vertex filtered graph (2010-2023 component), yet supplies no external validation such as expert agreement scores, comparison to existing domain taxonomies, or coherence metrics to confirm that these clusters correspond to real subfield structures rather than citation-pattern artifacts. This validation is load-bearing for the claim that SotA Lens reliably helps identify meaningful clusters and gaps.
  2. Methodology section on community detection: The description does not specify the community-detection algorithm (e.g., Louvain, Leiden) or its parameters, nor does it report quantitative graph properties such as modularity scores for the sixteen communities; without these details the reproducibility of the labelled clusters cannot be assessed.
minor comments (2)
  1. The abstract and workflow description would benefit from a high-level diagram illustrating the sequence of steps (seed search through human labelling) to improve readability for readers unfamiliar with network methods.
  2. Clarify the exact criteria and thresholds used for filtering the largest connected component and for ranking authors/subject terms, as these choices affect the final community structure.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments and the opportunity to clarify our manuscript. We respond to each major comment below, indicating where revisions will be made.

read point-by-point responses
  1. Referee: Proof-of-Concept Demonstration: The workflow identifies sixteen author-labelled communities in the 986-vertex filtered graph (2010-2023 component), yet supplies no external validation such as expert agreement scores, comparison to existing domain taxonomies, or coherence metrics to confirm that these clusters correspond to real subfield structures rather than citation-pattern artifacts. This validation is load-bearing for the claim that SotA Lens reliably helps identify meaningful clusters and gaps.

    Authors: We acknowledge that the proof-of-concept relies primarily on author-driven labelling of communities without supplementary quantitative validation such as coherence scores or external expert agreement. The methodology intentionally incorporates human labelling as an integral step for exploratory reviews in fields with unstable terminology. We will revise the manuscript to provide a more detailed account of the labelling process (including how top authors, terms, and sample papers were inspected) and add an explicit limitations subsection noting the lack of external validation metrics while outlining avenues for future work, such as inter-rater studies or taxonomy comparisons. This strengthens transparency without overstating the current demonstration's scope. revision: partial

  2. Referee: Methodology section on community detection: The description does not specify the community-detection algorithm (e.g., Louvain, Leiden) or its parameters, nor does it report quantitative graph properties such as modularity scores for the sixteen communities; without these details the reproducibility of the labelled clusters cannot be assessed.

    Authors: We agree this detail is necessary for reproducibility. Community detection was performed with the Louvain algorithm (python-louvain implementation, resolution parameter 1.0) on the filtered 2010-2023 component, yielding a modularity of 0.61. We will update the Methodology section to name the algorithm, library, exact parameters, and report the modularity score. The accompanying software repository will be updated with the precise configuration and scripts used. revision: yes

Circularity Check

0 steps flagged

Methodology is self-contained with no reduction to fitted inputs or self-citations

full rationale

The paper defines SotA Lens as a sequence of documented steps (seed search, DOI resolution, bounded citation expansion, directed graph construction, community detection, author/term ranking, and human labelling) that are specified independently of any particular dataset or fitted parameter. The proof-of-concept applies this workflow to Dynamic Projection-Mapping literature to produce a 986-vertex graph and sixteen labelled communities, but makes no claim that these outputs are predictions derived from the same data or that the method's validity reduces to a tautology. No equations, uniqueness theorems, or ansatzes are invoked via self-citation; the contribution is the workflow description itself, which remains falsifiable against external benchmarks such as expert agreement or existing taxonomies. This is the normal case of a methodological paper whose derivation chain does not collapse by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on standard bibliometric assumptions rather than new free parameters or invented entities; the workflow treats citation graphs as proxies for research structure without additional fitted constants beyond the proof-of-concept example.

axioms (1)
  • domain assumption Citation networks and community detection algorithms produce clusters that align with human-interpretable research communities
    Invoked when the method applies community detection to the citation graph and proceeds to human labelling of the resulting communities.

pith-pipeline@v0.9.0 · 5820 in / 1275 out tokens · 38355 ms · 2026-05-20T23:52:51.516628+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

20 extracted references · 20 canonical work pages

  1. [1]

    Bibliometrix: An R-tool for comprehensive science mapping analysis.Journal of Informetrics, 11(4):959–975, 2017

    Massimo Aria and Corrado Cuccurullo. Bibliometrix: An R-tool for comprehensive science mapping analysis.Journal of Informetrics, 11(4):959–975, 2017. doi: 10.1016/j.joi.2017.08.007

  2. [2]

    Gephi: An open source software for exploring and manipulating networks

    Mathieu Bastian, Sebastien Heymann, and Mathieu Jacomy. Gephi: An open source software for exploring and manipulating networks. InProceedings of the International AAAI Conference on Web and Social Media, volume 3, pages 361–362, 2009

  3. [3]

    A K Peters/CRC Press, Natick, MA, USA, 2005

    Oliver Bimber and Ramesh Raskar.Spatial Augmented Reality: Merging Real and Virtual Worlds. A K Peters/CRC Press, Natick, MA, USA, 2005

  4. [4]

    : High-resolution volumetric computation of offset surfaces with feature preservation

    Oliver Bimber, Daisuke Iwai, Gordon Wetzstein, and Anselm Grundhöfer. The visual com- puting of projector-camera systems.Computer Graphics Forum, 27(8):2219–2245, 2008. doi: 10.1111/j.1467-8659.2008.01175.x

  5. [5]

    VOSviewer: Visualizing scientific landscapes.https://www.vosviewer.com/, 2026

    Centre for Science and Technology Studies, Leiden University. VOSviewer: Visualizing scientific landscapes.https://www.vosviewer.com/, 2026. Accessed 28 April 2026

  6. [6]

    Chaomei Chen. CiteSpace II: Detecting and visualizing emerging trends and transient patterns in scientific literature.Journal of the American Society for Information Science and Technology, 57(3):359–377, 2006. doi: 10.1002/asi.20317. 10

  7. [7]

    Grant and Andrew Booth

    Maria J. Grant and Andrew Booth. A typology of reviews: An analysis of 14 review types and associated methodologies.Health Information & Libraries Journal, 26(2):91–108, 2009. doi: 10.1111/j.1471-1842.2009.00848.x

  8. [8]

    Satyanarayan and J

    Anselm Grundhöfer and Daisuke Iwai. Recent advances in projection mapping algorithms, hardware and applications.Computer Graphics Forum, 37(2):653–675, 2018. doi: 10.1111/cgf. 13335

  9. [9]

    Hagberg, Daniel A

    Aric A. Hagberg, Daniel A. Schult, and Pieter J. Swart. Exploring network structure, dynamics, and function using NetworkX. InProceedings of the 7th Python in Science Conference, pages 11–15, 2008

  10. [10]

    Hendricks, D

    Ginny Hendricks, Dominika Tkaczyk, Jennifer Lin, and Patricia Feeney. Crossref: The sustainable source of community-owned scholarly metadata.Quantitative Science Studies, 1 (1):414–427, 2020. doi: 10.1162/qss_a_00022

  11. [11]

    Projectionmappingtechnologies: Areviewofcurrenttrendsandfuturedirections

    DaisukeIwai. Projectionmappingtechnologies: Areviewofcurrenttrendsandfuturedirections. Proceedings of the Japan Academy, Series B, 100(3):234–251, 2024. doi: 10.2183/pjab.100.012

  12. [12]

    Mark E. J. Newman. Modularity and community structure in networks.Proceedings of the National Academy of Sciences, 103(23):8577–8582, 2006. doi: 10.1073/pnas.0601602103

  13. [13]

    BMJ372(71), 1–9 (2021) https://doi.org/10.1136/bmj.n71

    Matthew J. Page, Joanne E. McKenzie, Patrick M. Bossuyt, Isabelle Boutron, Tammy C. Hoff- mann, Cynthia D. Mulrow, Larissa Shamseer, Jennifer M. Tetzlaff, Elie A. Akl, Sue E. Brennan, Roger Chou, Julie Glanville, Jeremy M. Grimshaw, Asbjørn Hróbjartsson, Manoj M. Lalu, Tianjing Li, Elizabeth W. Loder, Evan Mayo-Wilson, Steve McDonald, Luke A. McGuinness, ...

  14. [14]

    SotA Lens: Citation-network mapping for exploratory state-of-the-art reviews, 2026

    Diogo Peralta Cordeiro. SotA Lens: Citation-network mapping for exploratory state-of-the-art reviews, 2026. URLhttps://doi.org/10.5281/zenodo.19860899

  15. [15]

    Guidelines for conducting systematic mapping studies in software engineering: An update.Information and Software Technology, 64:1–18, 2015

    Kai Petersen, Sairam Vakkalanka, and Ludwik Kuzniarz. Guidelines for conducting systematic mapping studies in software engineering: An update.Information and Software Technology, 64:1–18, 2015. doi: 10.1016/j.infsof.2015.03.007

  16. [16]

    PRISMA 2020.https://www.prisma-statement.org/prisma-2020,

    PRISMA Statement. PRISMA 2020.https://www.prisma-statement.org/prisma-2020,

  17. [17]

    Accessed 28 April 2026

  18. [18]

    PRISMA for scoping reviews.https://www.prisma-statement.org/ scoping, 2026

    PRISMA Statement. PRISMA for scoping reviews.https://www.prisma-statement.org/ scoping, 2026. Accessed 28 April 2026

  19. [19]

    Tricco, Erin Lillie, Wasifa Zarin, Kelly K

    Andrea C. Tricco, Erin Lillie, Wasifa Zarin, Kelly K. O’Brien, Heather Colquhoun, Danielle Levac, David Moher, Micah D. J. Peters, Tanya Horsley, Laura Weeks, Susanne Hempel, Elie A. Akl, Christine Chang, Jessie McGowan, Lesley Stewart, Lisa Hartling, Adrian Aldcroft, Michael G. Wilson, Chantelle Garritty, Simon Lewin, Christina M. Godfrey, Marilyn T. Mac...

  20. [20]

    Scientometrics , author =

    Nees Jan van Eck and Ludo Waltman. Software survey: VOSviewer, a computer program for bibliometric mapping.Scientometrics, 84(2):523–538, 2010. doi: 10.1007/s11192-009-0146-3. 11