pith. sign in

arxiv: 1907.02790 · v1 · pith:AK4AHEWEnew · submitted 2019-07-05 · 💻 cs.DB

Interlinking Heterogeneous Data for Smart Energy Systems

Pith reviewed 2026-05-25 01:50 UTC · model grok-4.3

classification 💻 cs.DB
keywords linked dataRDFsmart energy systemsphotovoltaic datadata integrationheterogeneous dataweather records
0
0 comments X

The pith

Converting PV and weather records to RDF graphs creates a uniform format for linking isolated energy datasets

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Smart energy systems depend on photovoltaic and weather data that currently sit in separate, incompatible formats across many sources. The paper proposes turning those records into an RDF graph format using established web standards so that the data becomes homogeneous and datasets from different places can be connected. A reader would care because this step would let analysis tools work across multiple datasets instead of being limited to one isolated collection at a time.

Core claim

We propose an approach based on Web (W3C) standards and Linked Data technologies for representing and converting PV and weather records into an Resource Description Framework (RDF) graph-based data format. This format, and the presented approach, is ideal in a data integration scenario where data needs to be converted into homogeneous form and different datasets could be interlinked for distributed analysis.

What carries the argument

RDF graph-based data format produced by Linked Data conversion of PV and weather records, which performs the work of turning heterogeneous inputs into a single interlinkable structure.

If this is right

  • Machine-learning frameworks designed on one dataset can be tested on data from other sites.
  • Results from separate studies become directly comparable.
  • Analysis can be reproduced using records collected at different locations.
  • Data from distributed sources moves from isolated collections to a single linked graph.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same conversion steps could be applied to other energy data types such as consumption or grid measurements.
  • Semantic web query tools could then search across weather conditions and power output in one step.
  • Long-term maintenance would require checking that new data sources still map cleanly into the same graph structure.

Load-bearing premise

Converting the original records to RDF keeps every necessary detail and allows interlinking to happen without new inconsistencies or unreasonable effort.

What would settle it

Take two existing PV and weather datasets, apply the conversion to RDF, attempt to interlink them, and check whether combined queries return the original values without loss or added contradictions.

Figures

Figures reproduced from arXiv: 1907.02790 by Alan Meehan, Declan O'Sullivan, Fabrizio Orlandi, Murhaf Hossari, Soumyabrata Dev, Tarek AlSkaif.

Figure 1
Figure 1. Figure 1: The Photovoltaic and Weather Analysis (PWA) Ontology [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Data Conversion Process 09 pwa:hasObservation ?PVObservation . 10 11 ?WeatherRec pwa:hasWeatherMeasure ?WMeasure . 12 13 ?WMeasure pwa:name "cloud cover" ; 14 pwa:value ?CloudCoverValue . 15 FILTER (?CloudCoverValue > 0.05) . 16 17 ?PVObservation a pwa:PhotovoltaicObservation ; 18 pwa:value ?PVValue . 19 FILTER (?PVValue > 0.02) . 20 } 21 ORDER BY ?CloudCoverValue V. CONCLUSION AND FUTURE WORK In this pape… view at source ↗
read the original abstract

Smart energy systems in general, and solar energy analysis in particular, have recently gained increasing interest. This is mainly due to stronger focus on smart energy saving solutions and recent developments in photovoltaic (PV) cells. Various data-driven and machine-learning frameworks are being proposed by the research community. However, these frameworks perform their analysis - and are designed on - specific, heterogeneous and isolated datasets, distributed across different sites and sources, making it hard to compare results and reproduce the analysis on similar data. We propose an approach based on Web (W3C) standards and Linked Data technologies for representing and converting PV and weather records into an Resource Description Framework (RDF) graph-based data format. This format, and the presented approach, is ideal in a data integration scenario where data needs to be converted into homogeneous form and different datasets could be interlinked for distributed analysis.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 0 minor

Summary. The paper claims that an approach based on W3C standards and Linked Data technologies can represent and convert PV and weather records into an RDF graph-based data format. This format is presented as ideal for data integration scenarios, enabling conversion to a homogeneous form and interlinking of different datasets for distributed analysis in smart energy systems.

Significance. If validated, the approach could help overcome heterogeneity in isolated datasets used by machine-learning frameworks for solar energy analysis, supporting better comparison and reproducibility of results. The manuscript provides no implementation details, concrete mappings, or validation results to support the claim.

major comments (1)
  1. [Abstract, final paragraph] Abstract, final paragraph: The central claim that the RDF format 'is ideal in a data integration scenario where data needs to be converted into homogeneous form and different datasets could be interlinked' is unsupported. No concrete mapping rules, worked examples on real records, round-tripping checks, or consistency analysis are provided to demonstrate that all fields, units, timestamps, and provenance are preserved without loss or new inconsistencies.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their detailed review and constructive feedback. We address the major comment below and will revise the manuscript accordingly.

read point-by-point responses
  1. Referee: [Abstract, final paragraph] Abstract, final paragraph: The central claim that the RDF format 'is ideal in a data integration scenario where data needs to be converted into homogeneous form and different datasets could be interlinked' is unsupported. No concrete mapping rules, worked examples on real records, round-tripping checks, or consistency analysis are provided to demonstrate that all fields, units, timestamps, and provenance are preserved without loss or new inconsistencies.

    Authors: We agree that the abstract claim would be strengthened by concrete evidence. The manuscript currently focuses on a high-level proposal for applying W3C Linked Data standards to PV and weather data. In the revised version we will add a dedicated section containing explicit mapping rules, worked examples drawn from real PV and weather records, and a discussion of how fields, units, timestamps and provenance are represented so that readers can verify preservation and absence of new inconsistencies. These additions will directly support the integration scenario described. revision: yes

Circularity Check

0 steps flagged

No circularity: methodological proposal with no derivations or self-referential reductions

full rationale

The paper proposes converting PV/weather records to RDF using W3C/Linked Data standards for homogeneous integration and interlinking. No equations, fitted parameters, predictions, or uniqueness theorems appear. The central claim is a high-level methodological suggestion rather than a derived result; it does not reduce to its own inputs by construction, self-citation chains, or renaming. The manuscript supplies no internal validation steps that could create circularity, making the finding self-contained as a standards-based recommendation.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

No free parameters, invented entities, or novel axioms introduced; the work rests on the domain assumption that W3C Linked Data standards are suitable for energy data integration.

axioms (1)
  • domain assumption W3C Linked Data and RDF standards provide a suitable homogeneous representation for heterogeneous PV and weather records
    Invoked in the proposal without further justification or evidence of information preservation.

pith-pipeline@v0.9.0 · 5687 in / 1021 out tokens · 30142 ms · 2026-05-25T01:50:10.785272+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

22 extracted references · 22 canonical work pages

  1. [1]

    Analyzing solar irradiance variation from GPS and cameras,

    S. Manandhar, S. Dev, Y . H. Lee, and Y . S. Meng, “Analyzing solar irradiance variation from GPS and cameras,” in 2018 USNC-URSI Radio Science Meeting (Joint with AP-S Symposium) . IEEE, 2018, pp. 93–94

  2. [2]

    A data- driven approach to detect precipitation from meteorological sensor data,

    S. Manandhar, S. Dev, Y . H. Lee, Y . S. Meng, and S. Winkler, “A data- driven approach to detect precipitation from meteorological sensor data,” in IEEE International Geoscience and Remote Sensing Symposium . IEEE, 2018, pp. 3872–3875

  3. [3]

    Systematic study of weather variables for rainfall detection,

    S. Manandhar, S. Dev, Y . H. Lee, S. Winkler, and Y . S. Meng, “Systematic study of weather variables for rainfall detection,” in IEEE International Geoscience and Remote Sensing Symposium . IEEE, 2018, pp. 3027–3030

  4. [4]

    Linked data: The story so far,

    C. Bizer, T. Heath, and T. Berners-Lee, “Linked data: The story so far,” in Semantic services, interoperability and web applications: emerging concepts, pp. 205–227. IGI Global, 2011

  5. [5]

    The semantic web,

    T. Berners-Lee, J. Hendler, O. Lassila, et al., “The semantic web,” Scientific American, vol. 284, no. 5, pp. 28–37, 2001

  6. [6]

    Knowledge Graphs: New Directions for Knowledge Representation on the Semantic Web (Dagstuhl Seminar 18371),

    P. A. Bonatti, S. Decker, A. Polleres, and V . Presutti, “Knowledge Graphs: New Directions for Knowledge Representation on the Semantic Web (Dagstuhl Seminar 18371),” Dagstuhl Reports , vol. 8, no. 9, pp. 29–111, 2019

  7. [7]

    Vendor landscape: Graph databases,

    “Vendor landscape: Graph databases,” https://go.neo4j.com/rs/ 710-RRC-335/images/Forrester-Research-Neo4j-Graph-Databases.pdf, Accessed: 2018-12-08

  8. [8]

    Semantic integration: a survey of ontology-based ap- proaches,

    N. F. Noy, “Semantic integration: a survey of ontology-based ap- proaches,” ACM Sigmod Record , vol. 33, no. 4, pp. 65–70, 2004

  9. [9]

    Domingue, D

    J. Domingue, D. Fensel, and J. A. Hendler, Handbook of semantic web technologies, Springer Science & Business Media, 2011

  10. [10]

    SPARQL 1.1 query language,

    S. Harris, A. Seaborne, and E. Prudhommeaux, “SPARQL 1.1 query language,” W3C recommendation, 2013

  11. [11]

    RDF primer,

    F. Manola, E. Miller, B. McBride, et al., “RDF primer,” W3C recommendation, 2004

  12. [12]

    Linked open vocabularies (LOV): a gateway to reusable semantic vocabularies on the web,

    P-Y . Vandenbussche, G. A. Atemezing, M. Poveda-Villal ´on, and B. Vatant, “Linked open vocabularies (LOV): a gateway to reusable semantic vocabularies on the web,” Semantic Web, vol. 8, no. 3, pp. 437–452, 2017

  13. [13]

    Transforming meteorological data into linked data,

    G. Atemezing, O. Corcho, D. Garijo, J. Mora, M. Poveda-Villal ´on, P. Rozas, D. Vila-Suero, and B. Villaz ´on-Terrazas, “Transforming meteorological data into linked data,” Semantic Web , vol. 4, no. 3, pp. 285–290, 2013

  14. [14]

    The SSN ontology of the W3C semantic sensor network incubator group,

    M. Compton, P. Barnaghi, L. Bermudez, R. Garc ´ıA-Castro, O. Corcho, S. Cox, J. Graybeal, M. Hauswirth, C. Henson, A. Herzog, et al., “The SSN ontology of the W3C semantic sensor network incubator group,” Web semantics: science, services and agents on the World Wide Web , vol. 17, pp. 25–32, 2012

  15. [15]

    Data linkage in medical science using the resource description framework: the A VERT model,

    B. P. Reddy, B. Houlding, L. Hederman, M. Canney, C. Debruyne, C. O’Brien, A. Meehan, D. O’Sullivan, and M. A. Little, “Data linkage in medical science using the resource description framework: the A VERT model,” HRB Open Research , vol. 1, pp. 20, mar 2019

  16. [16]

    Guidelines for linked data gen- eration and publication: An example in building energy consumption,

    F. Radulovic, M. Poveda-Villal ´on, D. Vila-Suero, V . Rodr´ıguez-Doncel, R. Garc´ıa-Castro, and A. G ´omez-P´erez, “Guidelines for linked data gen- eration and publication: An example in building energy consumption,” Automation in Construction , vol. 57, pp. 178–187, 2015

  17. [17]

    Deriving semantic sensor metadata from raw measurements,

    J-P. Calbimonte, O. Corcho, Z. Yan, H. Jeung, and K. Aberer, “Deriving semantic sensor metadata from raw measurements,” 2012

  18. [18]

    An evaluation of uplift mapping languages,

    A. Crotti Jr., C. Debruyne, R. Brennan, and D. OSullivan, “An evaluation of uplift mapping languages,” International Journal of Web Information Systems, vol. 13, no. 4, pp. 405–424, 2017

  19. [19]

    Ontology mapping: the state of the art,

    Y . Kalfoglou and M. Schorlemmer, “Ontology mapping: the state of the art,” The knowledge engineering review , vol. 18, no. 1, pp. 1–31, 2003

  20. [20]

    R2RML: RDB to RDF mapping language. W3C recommendation (2012),

    S. Das, S. Sundara, and R. Cyganiak, “R2RML: RDB to RDF mapping language. W3C recommendation (2012),” 2016

  21. [21]

    RML: A generic language for integrated RDF mappings of heterogeneous data.,

    A. Dimou, M. Vander Sande, P. Colpaert, R. Verborgh, E. Mannens, and R. Van de Walle, “RML: A generic language for integrated RDF mappings of heterogeneous data.,” in LDOW, 2014

  22. [22]

    A SPARQL extension for generating RDF from heterogeneous formats,

    M. Lefranc ¸ois, A. Zimmermann, and N. Bakerally, “A SPARQL extension for generating RDF from heterogeneous formats,” in European Semantic Web Conference. Springer, 2017, pp. 35–50