A Differentially Private Algorithm for Range Queries on Trajectories
Pith reviewed 2026-05-24 19:23 UTC · model grok-4.3
The pith
An algorithm for range queries on trajectories adds noise adaptively based on data partitions and query set to achieve lower error while guaranteeing differential privacy.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The algorithm guarantees epsilon-differential privacy for range queries on trajectories by privately partitioning the data space into uniform regions, computing traffic density per region, and then estimating the distribution of trajectories over the queried space from the regions, densities, and given query set, which yields significantly lower error than prior methods.
What carries the argument
The adaptive noise mechanism that partitions space privately into regions, computes densities, and estimates trajectory distribution from the query set to guide noise placement.
If this is right
- Range queries on trajectory data can be answered with higher utility at the same privacy level.
- The method preserves sequential consistency of trajectories better than treating them as unordered sets of points.
- Privacy mechanisms can improve by incorporating both data distribution and query workload instead of adding noise uniformly.
Where Pith is reading between the lines
- The approach could extend to other sequential spatial data such as movement paths in indoor environments if the partitioning step generalizes.
- Accuracy gains may shrink if the query set changes frequently after the initial partitioning, requiring re-estimation.
- The private partitioning step itself may introduce error that limits gains on very sparse datasets.
Load-bearing premise
That estimating trajectory distribution from the privately computed regions, their densities, and the query set will produce high accuracy for the given queries.
What would settle it
Running the algorithm and a uniform-noise baseline on the same real trajectory dataset and query set at identical epsilon and measuring whether the new method's query error is not lower.
Figures
read the original abstract
We propose a novel algorithm to ensure $\epsilon$-differential privacy for answering range queries on trajectory data. In order to guarantee privacy, differential privacy mechanisms add noise to either data or query, thus introducing errors to queries made and potentially decreasing the utility of information. In contrast to the state-of-the-art, our method achieves significantly lower error as it is the first data- and query-aware approach for such queries. The key challenge for answering range queries on trajectory data privately is to ensure an accurate count. Simply representing a trajectory as a set instead of \emph{sequence} of points will generally lead to highly inaccurate query answers as it ignores the sequential dependency of location points in trajectories, i.e., will violate the consistency of trajectory data. Furthermore, trajectories are generally unevenly distributed across a city and adding noise uniformly will generally lead to a poor utility. To achieve differential privacy, our algorithm adaptively adds noise to the input data according to the given query set. It first privately partitions the data space into uniform regions and computes the traffic density of each region. The regions and their densities, in addition to the given query set, are then used to estimate the distribution of trajectories over the queried space, which ensures high accuracy for the given query set. We show the accuracy and efficiency of our algorithm using extensive empirical evaluations on real and synthetic data sets.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes a novel ε-differentially private algorithm for range queries on trajectory data. It privately partitions the space into uniform regions, computes per-region traffic densities, and then uses these densities together with the given query set to estimate the distribution of trajectories over the queried space. The central claim is that this data- and query-aware approach yields significantly lower error than prior methods while preserving privacy, by avoiding both uniform noise addition and the consistency violations that arise from treating trajectories as unordered sets.
Significance. If the estimation procedure correctly incorporates sequential dependencies, the work would constitute a meaningful advance in utility for private trajectory range queries on unevenly distributed data. The empirical evaluations on real and synthetic datasets are presented as evidence of accuracy and efficiency gains.
major comments (1)
- [Abstract (algorithm description paragraph)] Abstract, paragraph on algorithm description: the estimation step that 'uses the regions and their densities, in addition to the given query set, to estimate the distribution of trajectories' is described only at a high level. No mechanism is indicated for enforcing sequential consistency or modeling transitions across regions, which the paper itself identifies as the source of inaccurate counts when trajectories are treated as sets. This modeling choice is load-bearing for the 'significantly lower error' claim.
Simulated Author's Rebuttal
We thank the referee for their review and recommendation. We address the single major comment below, clarifying the role of the abstract versus the full technical description.
read point-by-point responses
-
Referee: [Abstract (algorithm description paragraph)] Abstract, paragraph on algorithm description: the estimation step that 'uses the regions and their densities, in addition to the given query set, to estimate the distribution of trajectories' is described only at a high level. No mechanism is indicated for enforcing sequential consistency or modeling transitions across regions, which the paper itself identifies as the source of inaccurate counts when trajectories are treated as sets. This modeling choice is load-bearing for the 'significantly lower error' claim.
Authors: The abstract is written at the conventional high level of detail for that section. The concrete mechanism that enforces sequential consistency—by using the privately computed region densities together with the query set to model transitions and estimate a consistent trajectory distribution rather than an unordered set—is fully specified in the algorithm sections of the manuscript (including the estimation procedure and its analysis). This is the same modeling choice the paper contrasts with set-based approaches, and the empirical results are obtained from the implemented procedure that incorporates these transitions. revision: no
Circularity Check
No circularity; accuracy claim rests on empirical evaluation of new procedure
full rationale
The paper introduces a procedural algorithm (private partitioning into uniform regions, per-region density computation, then estimation of trajectory distribution from those densities plus the query set) and supports its accuracy claim solely via 'extensive empirical evaluations on real and synthetic data sets.' No equations, fitted parameters renamed as predictions, or self-citations appear in the provided text. The estimation step is described as an input to the procedure rather than derived from the output by construction, so the derivation chain does not reduce to its own inputs.
Axiom & Free-Parameter Ledger
axioms (2)
- standard math ε-differential privacy definition
- domain assumption sequential dependency of location points must be preserved for accurate counts
Reference graph
Works this paper leans on
-
[1]
L. Bonomi and L. Xiong. A two-phase algorithm for mining sequential patterns with differential privacy. In Proceedings of the 22nd ACM international conference on Information & Knowledge Management , pages 269–278. ACM, 2013
work page 2013
-
[2]
R. Chen, B. Fung, B. C. Desai, and N. M. Sossou. Differentially private transit data publication: a case study on the montreal transportation system. In Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining , pages 213–221. ACM, 2012
work page 2012
-
[3]
G. Cormode, C. Procopiuc, D. Srivastava, E. Shen, and T. Yu. Differen- tially private spatial decompositions. In Data engineering (ICDE), 2012 IEEE 28th international conference on , pages 20–31. IEEE, 2012
work page 2012
-
[4]
Y .-A. De Montjoye, C. A. Hidalgo, M. Verleysen, and V . D. Blondel. Unique in the crowd: The privacy bounds of human mobility. Scientific reports, 3:1376, 2013
work page 2013
- [5]
- [6]
- [7]
-
[8]
M. Hardt and G. N. Rothblum. A multiplicative weights mechanism for privacy-preserving data analysis. In Foundations of Computer Science, 2010 51st Annual IEEE Symposium on , pages 61–70. IEEE, 2010
work page 2010
-
[9]
M. Hay, A. Machanavajjhala, G. Miklau, Y . Chen, and D. Zhang. Principled evaluation of differentially private algorithms using dpbench. In Proceedings of the 2016 International Conference on Management of Data, pages 139–154. ACM, 2016
work page 2016
-
[10]
X. He, G. Cormode, A. Machanavajjhala, C. M. Procopiuc, and D. Sri- vastava. Dpt: differentially private trajectory synthesis using hierarchical reference systems. Proceedings of the VLDB Endowment , 8(11):1154– 1165, 2015
work page 2015
- [11]
-
[12]
L. Leonardi, S. Orlando, A. Raffaet `a, A. Roncato, C. Silvestri, G. An- drienko, and N. Andrienko. A general framework for trajectory data warehousing and visual olap. GeoInformatica, 18(2):273–312, 2014
work page 2014
-
[13]
C. Li, M. Hay, G. Miklau, and Y . Wang. A data-and workload-aware algorithm for range queries under differential privacy. Proceedings of the VLDB Endowment , 7(5):341–352, 2014
work page 2014
-
[14]
I. V . Lopez, R. T. Snodgrass, and B. Moon. Spatiotemporal aggregate computation: A survey. IEEE Transactions on Knowledge and Data Engineering, 17(2):271–286, 2005
work page 2005
-
[15]
F. McSherry and K. Talwar. Mechanism design via differential privacy. In Foundations of Computer Science, 2007. FOCS’07. 48th Annual IEEE Symposium on, pages 94–103. IEEE, 2007
work page 2007
-
[16]
F. D. McSherry. Privacy integrated queries: an extensible platform for privacy-preserving data analysis. In Proceedings of the ACM SIGMOD International Conference on Management of data , pages 19–30. ACM, 2009
work page 2009
-
[17]
M. F. Mokbel, L. Alarabi, J. Bao, A. Eldawy, A. Magdy, M. Sarwat, E. Waytas, and S. Yackel. Mntg: an extensible web-based traffic gener- ator. In International Symposium on Spatial and Temporal Databases , pages 38–55. Springer, 2013
work page 2013
-
[18]
A. Monreale, W. H. Wang, F. Pratesi, S. Rinzivillo, D. Pedreschi, G. An- drienko, and N. Andrienko. Privacy-preserving distributed movement data aggregation. In Geographic Information Science at the Heart of Europe, pages 225–245. Springer, 2013
work page 2013
-
[19]
L. Moreira-Matias, J. Gama, M. Ferreira, J. Mendes-Moreira, and L. Damas. Predicting taxi–passenger demand using streaming data. IEEE Transactions on Intelligent Transportation Systems , 14(3):1393– 1402, 2013
work page 2013
-
[20]
E. Naghi Zadeh Kakhki. Utility-aware protection of trajectory privacy . PhD thesis, The University of Melbourne, 2016
work page 2016
-
[21]
W. Qardaji, W. Yang, and N. Li. Differentially private grids for geospa- tial data. In Data Engineering (ICDE), 2013 IEEE 29th International Conference on, pages 757–768. IEEE, 2013
work page 2013
-
[22]
H. Xie, L. Kulik, and E. Tanin. Privacy-aware traffic monitoring. IEEE Transactions on Intelligent Transportation Systems , 11(1):61–70, 2010
work page 2010
-
[23]
H. Xie, E. Tanin, and L. Kulik. Distributed histograms for processing aggregate data from moving objects. In Mobile Data Management, 2007 International Conference on , pages 152–157. IEEE, 2007
work page 2007
-
[24]
H. Xie, E. Tanin, L. Kulik, P. Scheuermann, G. Trajcevski, and M. Fanaeepour. Euler histogram tree: A spatial data structure for aggregate range queries on vehicle trajectories. In Proceedings of the 7th ACM SIGSPATIAL International Workshop on Computational Transportation Science, pages 18–24. ACM, 2014
work page 2014
-
[25]
F. Xu, Z. Tu, Y . Li, P. Zhang, X. Fu, and D. Jin. Trajectory recovery from ash: User privacy is not preserved in aggregated mobility data. In Proceedings of the 26th International Conference on World Wide Web , pages 1241–1250. International World Wide Web Conferences Steering Committee, 2017. APPENDIX A PROOF OF THEOREM III.2 Let T RU be the cost of tru...
work page 2017
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.