Optimizing Navigational Graph Queries
Pith reviewed 2026-05-24 00:26 UTC · model grok-4.3
The pith
Novel optimization techniques for navigational graph queries constrain intermediate results to achieve orders of magnitude faster evaluation on real workloads.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We present a number of novel powerful optimization techniques which aim to constrain the intermediate results during query evaluation. We show how these techniques can be planned effectively and executed efficiently towards the first practical evaluation solution for complex navigational queries on real-world workloads. Indeed, our experimental results show several orders of magnitude improvement in query evaluation performance over state-of-the-art techniques on a wide range of queries on diverse datasets.
What carries the argument
Optimization techniques that constrain intermediate results during evaluation of queries combining recursive and pattern-matching fragments.
If this is right
- Complex navigational queries become practically evaluable on real-world graph data without excessive resource use.
- Query engines can handle wider ranges of recursive and pattern-matching combinations at scale.
- Performance advantages appear consistently across diverse datasets and query workloads.
- The approach reduces reliance on ad-hoc tuning for graph query performance.
Where Pith is reading between the lines
- The same constraint-based approach might apply to optimizing other graph query languages that mix recursion with joins.
- Database systems could incorporate these planners to lower overall memory demands during query execution.
- Future work might test whether the techniques scale to streaming or dynamic graph settings.
Load-bearing premise
The proposed optimization techniques can be planned effectively and executed efficiently to constrain intermediate results for complex navigational queries on real-world workloads.
What would settle it
A test on real-world datasets where the techniques produce no reduction in intermediate result sizes or yield no measurable speedup over existing methods on navigational queries would disprove the central claim.
Figures
read the original abstract
We study the optimization of navigational graph queries, i.e., queries which combine recursive and pattern-matching fragments. Current approaches to their evaluation are not effective in practice. Towards addressing this, we present a number of novel powerful optimization techniques which aim to constrain the intermediate results during query evaluation. We show how these techniques can be planned effectively and executed efficiently towards the first practical evaluation solution for complex navigational queries on real-world workloads. Indeed, our experimental results show several orders of magnitude improvement in query evaluation performance over state-of-the-art techniques on a wide range of queries on diverse datasets.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper studies the optimization of navigational graph queries combining recursive and pattern-matching fragments. It presents novel optimization techniques to constrain intermediate results during evaluation, claims these can be planned effectively and executed efficiently to yield the first practical evaluation solution for complex navigational queries on real-world workloads, and reports experimental results showing several orders of magnitude improvement over state-of-the-art techniques across a wide range of queries on diverse datasets.
Significance. If the techniques and results hold, the work would address a practical gap in graph database query evaluation where current approaches are ineffective, potentially enabling efficient handling of complex queries in real-world settings.
major comments (1)
- [Abstract] Abstract: the central claim of 'several orders of magnitude improvement' and 'first practical evaluation solution' rests entirely on experimental results, yet the provided manuscript text supplies no details on the optimization techniques, planning/execution methods, experimental setup, datasets, or baselines, rendering the claim impossible to assess for soundness.
Simulated Author's Rebuttal
We thank the referee for their review. We address the single major comment below.
read point-by-point responses
-
Referee: [Abstract] Abstract: the central claim of 'several orders of magnitude improvement' and 'first practical evaluation solution' rests entirely on experimental results, yet the provided manuscript text supplies no details on the optimization techniques, planning/execution methods, experimental setup, datasets, or baselines, rendering the claim impossible to assess for soundness.
Authors: The abstract is a concise summary. The full manuscript details the optimization techniques (Sections 3–4), planning and execution methods (Section 4), and experimental setup including datasets and baselines (Section 5). These sections support assessment of the claims. revision: no
Circularity Check
No significant circularity
full rationale
The paper describes novel optimization techniques for navigational graph queries and validates them via experimental results showing performance gains on real-world workloads. No equations, derivations, fitted parameters presented as predictions, or self-citation chains appear in the provided abstract or described content. The central claims rest on empirical evaluation of planning and execution efficiency rather than any self-definitional or constructionally equivalent steps. This is a standard experimental systems paper whose derivation chain is self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Angles, R., Aranda, C.B., Hogan, A., Ro- jas, C., Vrgoˇ c, D.: Wdbench: A wikidata graph query benchmark. In: The Seman- tic Web – ISWC 2022: 21st International Semantic Web Conference, Virtual Event, Oc- tober 23–27, 2022, Proceedings, p. 714–731. Springer-Verlag, Berlin, Heidelberg (2022). DOI 10.1007/978-3-031-19433-7 {\ }41. URL https: //doi.org/10.10...
-
[2]
Arroyuelo, D., G´ omez-Brand´ on, A., Hogan, A., Navarro, G., Rojas-Ledesma, J.: Optimizing rpqs over a compact graph representation. The VLDB Journal pp. 1–26 (2023)
work page 2023
-
[3]
Arroyuelo, D., Hogan, A., Navarro, G., Rojas- Ledesma, J.: Time- and space-efficient regular path queries. In: 38th IEEE International Con- ference on Data Engineering, ICDE 2022, Kuala Lumpur, Malaysia, May 9-12, 2022, pp. 3091–
work page 2022
-
[4]
IEEE (2022). DOI 10.1109/ICDE53745. 2022.00277. URL https://doi.org/10.1109/ ICDE53745.2022.00277
-
[5]
Barcel´ o, P., P´ erez, J., Reutter, J.L.: Relative ex- pressiveness of nested regular expressions. In: J. Freire, D. Suciu (eds.) Proceedings of the 6th Al- berto Mendelzon International Workshop on Foun- dations of Data Management, Ouro Preto, Brazil, June 27-30, 2012, CEUR Workshop Proceedings , vol. 866, pp. 180–195. CEUR-WS.org (2012). URL https://ce...
work page 2012
-
[6]
Synthesis Lectures on Data Management
Bonifati, A., Fletcher, G., Voigt, H., Yakovets, N.: Querying Graphs. Synthesis Lectures on Data Management. Morgan & Clay- pool Publishers (2018). DOI 10.2200/ S00873ED1V01Y201808DTM051. URL https:// doi.org/10.2200/S00873ED1V01Y201808DTM051
-
[7]
Bonifati, A., Martens, W., Timm, T.: An an- alytical study of large SPARQL query logs. VLDB J. 29(2-3), 655–679 (2020). DOI 10.1007/ s00778-019-00558-9. URL https://doi.org/10. 1007/s00778-019-00558-9
work page 2020
-
[8]
Calvanese, D., Giacomo, G.D., Lenzerini, M., Vardi, M.Y.: Containment of conjunctive regu- lar path queries with inverse. In: A.G. Cohn, F. Giunchiglia, B. Selman (eds.) KR 2000, Prin- ciples of Knowledge Representation and Reasoning Proceedings of the Seventh International Confer- ence, Breckenridge, Colorado, USA, April 11-15, 2000, pp. 176–185. Morgan ...
work page 2000
-
[9]
Codd, E.F.: A relational model of data for large shared data banks. Commun. ACM 13(6), 377–387 (1970). DOI 10.1145/362384.362685. URL https: //doi.org/10.1145/362384.362685
-
[10]
URL https://string-db.org/ cgi/about?footer{_}active{_}subpage= statistics
CONSORTIUM, S.: String database — statis- tics (2023). URL https://string-db.org/ cgi/about?footer{_}active{_}subpage= statistics. Accessed: 13 Oct 2023 16:00
work page 2023
-
[11]
DeHaan, D., Tompa, F.W.: Optimal top-down join enumeration. In: C.Y. Chan, B.C. Ooi, A. Zhou (eds.) Proceedings of the ACM SIGMOD Interna- tional Conference on Management of Data, Beijing, China, June 12-14, 2007, pp. 785–796. ACM (2007). 22 Thomas Mulder et al. DOI 10.1145/1247480.1247567. URL https:// doi.org/10.1145/1247480.1247567
-
[12]
TigerGraph: A Native MPP Graph Database
Deutsch, A., Xu, Y., Wu, M., Lee, V.E.: Tiger- graph: A native MPP graph database. CoRR abs/1901.08248 (2019). URL http://arxiv. org/abs/1901.08248
work page internal anchor Pith review Pith/arXiv arXiv 1901
-
[13]
Fender, P., Moerkotte, G.: A new, highly effi- cient, and easy to implement top-down join enu- meration algorithm. In: S. Abiteboul, K. B¨ ohm, C. Koch, K. Tan (eds.) Proceedings of the 27th In- ternational Conference on Data Engineering, ICDE 2011, April 11-16, 2011, Hannover, Germany, pp. 864–875. IEEE Computer Society (2011). DOI 10.1109/ICDE.2011.5767...
-
[14]
Fender, P., Moerkotte, G.: Reassessing top-down join enumeration. IEEE Trans. Knowl. Data Eng. 24(10), 1803–1818 (2012). DOI 10.1109/TKDE. 2011.235. URL https://doi.org/10.1109/TKDE. 2011.235
- [15]
-
[16]
https://github.com/kuzudb/kuzu (2022)
Feng, X., Jin, G., Chen, Z., Liu, C., Saliho˘ glu, S.: K` uzu Database Management System Source Code. https://github.com/kuzudb/kuzu (2022)
work page 2022
-
[17]
Feng, X., Jin, G., Chen, Z., Liu, C., Saliho˘ glu, S.: K` uzu graph database management system. In: CIDR (2023)
work page 2023
-
[18]
Francis, N., Gheerbrant, A., Guagliardo, P., Libkin, L., Marsault, V., Martens, W., Murlak, F., Peter- freund, L., Rogova, A., Vrgoc, D.: A researcher’s digest of GQL (invited talk). In: F. Geerts, B. Van- devoort (eds.) 26th International Conference on Database Theory, ICDT 2023, March 28-31, 2023, Ioannina, Greece, LIPIcs, vol. 255, pp. 1:1–1:22 (2023)
work page 2023
-
[19]
Han, Y., Wu, Z., Wu, P., Zhu, R., Yang, J., Tan, L.W., Zeng, K., Cong, G., Qin, Y., Pfadler, A., Qian, Z., Zhou, J., Li, J., Cui, B.: Cardinality estimation in dbms: a comprehensive benchmark evaluation. Proc. VLDB Endow. 15(4), 752–765 (2021). DOI 10.14778/3503585.3503586. URL https://doi.org/10.14778/3503585.3503586
-
[20]
Hofer, M., Hellmann, S., Dojchinovski, M., Frey, J.: The new dbpedia release cycle: Increasing agility and efficiency in knowledge extraction work- flows. In: E. Blomqvist, P. Groth, V. de Boer, T. Pellegrini, M. Alam, T. K¨ afer, P. Kiese- berg, S. Kirrane, A. Mero˜ no-Pe˜ nuela, H.J. Pan- dit (eds.) Semantic Systems. In the Era of Knowl- edge Graphs - 1...
-
[21]
Jachiet, L., Genev` es, P., Gesbert, N., Laya¨ ıda, N.: On the optimization of recursive relational queries: Application to graph queries. In: D. Maier, R. Pot- tinger, A. Doan, W. Tan, A. Alawini, H.Q. Ngo (eds.) Proceedings of the 2020 International Confer- ence on Management of Data, SIGMOD Conference 2020, online conference [Portland, OR, USA], June 1...
-
[22]
van Leeuwen, W., Fletcher, G., Yakovets, N.: A general cardinality estimation framework for sub- graph matching in property graphs. IEEE Trans. Knowl. Data Eng. 35(6), 5485–5505 (2023). DOI 10.1109/TKDE.2022.3161328. URL https://doi. org/10.1109/TKDE.2022.3161328
-
[23]
Semantic Web 6(2), 167–195 (2015)
Lehmann, J., Isele, R., Jakob, M., Jentzsch, A., Kontokostas, D., Mendes, P.N., Hellmann, S., Morsey, M., van Kleef, P., Auer, S., Bizer, C.: Db- pedia - A large-scale, multilingual knowledge base extracted from wikipedia. Semantic Web 6(2), 167–195 (2015). DOI 10.3233/SW-140134. URL https://doi.org/10.3233/SW-140134
-
[24]
Leis, V., Radke, B., Gubichev, A., Mirchev, A., Boncz, P.A., Kemper, A., Neumann, T.: Query optimization through the looking glass, and what we found running the join order benchmark. VLDB J. 27(5), 643–668 (2018). DOI 10.1007/ S00778-017-0480-7. URL https://doi.org/10. 1007/s00778-017-0480-7
work page 2018
-
[25]
Moerkotte, G., Neumann, T.: Dynamic program- ming strikes back. In: J.T. Wang (ed.) Proceed- ings of the ACM SIGMOD International Confer- ence on Management of Data, SIGMOD 2008, Van- couver, BC, Canada, June 10-12, 2008, pp. 539–552. ACM (2008). DOI 10.1145/1376616.1376672. URL https://doi.org/10.1145/1376616.1376672
-
[26]
Neumann, T.: Efficient generation and execution of dag-structured query graphs. Ph.D. thesis, University of Mannheim, Germany (2005). URL http://bibserv7.bib.uni-mannheim.de/madoc/ volltexte/2005/1089/index.html
work page 2005
-
[27]
Neumann, T., Freitag, M.J.: Umbra: A disk-based system with in-memory performance. In: 10th Conference on Innovative Data Systems Research, CIDR 2020, Amsterdam, The Netherlands, January Optimizing Navigational Graph Queries 23 12-15, 2020, Online Proceedings. www.cidrdb.org (2020). URL http://cidrdb.org/cidr2020/ papers/p29-neumann-cidr20.pdf
work page 2020
-
[28]
In: Proceedings of the Sixteenth International Confer- ence on Very Large Databases, p
Ono, K., Lohman, G.M.: Measuring the complex- ity of join enumeration in query optimization. In: Proceedings of the Sixteenth International Confer- ence on Very Large Databases, p. 314–325. Mor- gan Kaufmann Publishers Inc., San Francisco, CA, USA (1990)
work page 1990
-
[29]
URL https://github.com/duckdb/duckdb
Raasveldt, M., Muehleisen, H.: DuckDB (2023). URL https://github.com/duckdb/duckdb
work page 2023
-
[30]
Reutter, J.L., Romero, M., Vardi, M.Y.: Regu- lar queries on graph databases. Theory Com- put. Syst. 61(1), 31–83 (2017). DOI 10.1007/ s00224-016-9676-2. URL https://doi.org/10. 1007/s00224-016-9676-2
work page 2017
-
[31]
Nucleic Acids Research51(D1), D638–D646 (2022)
Szklarczyk, D., Kirsch, R., Koutrouli, M., Nas- tou, K., Mehryary, F., Hachilif, R., Gable, A.L., Fang, T., Doncheva, N., Pyysalo, S., Bork, P., Jensen, L., von Mering, C.: The STRING database in 2023: protein–protein association networks and functional enrichment analyses for any sequenced genome of interest. Nucleic Acids Research51(D1), D638–D646 (2022...
-
[32]
Willsey, M., Nandi, C., Wang, Y.R., Flatt, O., Tat- lock, Z., Panchekha, P.: Egg: Fast and extensible equality saturation. Proc. ACM Program. Lang. 5(POPL) (2021). DOI 10.1145/3434304. URL https://doi.org/10.1145/3434304
-
[33]
ten Wolde, D., Singh, T., Sz´ arnyas, G., Boncz, P.A.: Duckpgq: Efficient property graph queries in an analytical RDBMS. In: 13th Confer- ence on Innovative Data Systems Research, CIDR 2023, Amsterdam, The Netherlands, January 8-11,
work page 2023
-
[34]
www.cidrdb.org (2023). URL https://www. cidrdb.org/cidr2023/papers/p66-wolde.pdf
work page 2023
-
[35]
Yakovets, N., Godfrey, P., Gryz, J.: Query plan- ning for evaluating SPARQL property paths. In: F. ¨Ozcan, G. Koutrika, S. Madden (eds.) Pro- ceedings of the 2016 International Conference on Management of Data, SIGMOD Conference 2016, San Francisco, CA, USA, June 26 - July 01, 2016, pp. 1875–1889. ACM (2016). DOI 10. 1145/2882903.2882944. URL https://doi....
-
[36]
URL https://github.com/ ymous5380/ongq
Ymous, A.: Optimizing Navigational Queries On- line Appendix (2023). URL https://github.com/ ymous5380/ongq
work page 2023
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.