pith. sign in

arxiv: 1907.03827 · v1 · pith:QVPWET75new · submitted 2019-06-21 · 💻 cs.CY · cs.AI· cs.LG· stat.AP

FairST: Equitable Spatial and Temporal Demand Prediction for New Mobility Systems

Pith reviewed 2026-05-25 18:17 UTC · model grok-4.3

classification 💻 cs.CY cs.AIcs.LGstat.AP
keywords fairnessspatiotemporal predictionmobility systemsdemand forecastingequityregularizationconvolutional networksbike sharing
0
0 comments X

The pith

FairST adds fairness regularization to spatiotemporal demand models, cutting demographic gaps by more than 80 percent while matching or exceeding accuracy of standard LSTMs and CNNs.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper seeks to demonstrate that demand forecasting for bike-sharing and ride-hailing can treat equity across demographic groups as a first-class constraint rather than an afterthought. It does so by embedding two new spatiotemporal fairness measures directly into model training as a regularization penalty. The resulting FairST architecture combines 1D, 2D, and 3D convolutions to capture urban features and mobility dynamics. On real-world datasets the method shrinks measured fairness gaps dramatically and, surprisingly, often improves raw prediction accuracy over fairness-oblivious baselines. A sympathetic reader would care because current mobility systems have been shown to widen access disparities; a method that narrows those gaps without paying an accuracy price could change how operators and cities allocate vehicles and plan infrastructure.

Core claim

FairST is a convolutional model that learns spatial-temporal demand patterns while minimizing two novel fairness metrics—region-based fairness gap (RFG) when demographics are known at the zone level and individual-based fairness gap (IFG) when population distributions are available—by treating them as regularization terms; on bike-share and ride-share traces this produces more than an 80 percent reduction in the fairness gap and higher accuracy than LSTMs, ConvLSTMs, or 3D CNNs trained without fairness constraints.

What carries the argument

RFG and IFG fairness-gap metrics inserted as regularization terms inside a 1D/2D/3D convolutional network that fuses urban features with mobility time series.

If this is right

  • Demand forecasts can be made substantially more balanced across demographic groups without loss of predictive power.
  • The same regularization approach can be layered onto other spatiotemporal forecasting tasks that affect resource allocation.
  • Operators could use the metrics to set explicit equity targets during model deployment rather than auditing after the fact.
  • Standard accuracy-only models may systematically under-serve certain groups even when overall error looks low.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Fairness regularization may act as an implicit form of domain adaptation that improves generalization on heterogeneous urban data.
  • If the metrics prove robust, cities could require fairness-gap reporting as a condition for approving new mobility permits.
  • The approach invites analogous fairness constraints in related prediction problems such as ambulance or delivery routing.

Load-bearing premise

The proposed RFG and IFG metrics correctly quantify real equity in access, and the available region-level or population-distribution data contain no systematic labeling errors.

What would settle it

An audit that finds large differences in actual trip completion rates or user satisfaction between demographic groups inside regions the model labels as low-RFG or low-IFG would refute the claim that the metrics produce equitable outcomes.

Figures

Figures reproduced from arXiv: 1907.03827 by An Yan, Bill Howe.

Figure 1
Figure 1. Figure 1: FairST is a deep learning based demand predic [PITH_FULL_IMAGE:figures/full_fig_p001_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Data preprocessing. (a) We partition a city into [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: A three-stream network architecture. The network input contains three streams, including 1D time series features, [PITH_FULL_IMAGE:figures/full_fig_p005_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Accuracy vs. fairness metrics (single attibute). (a), [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Ground truth vs. predictions heat maps for September 27, 2018 16:00 pm - 17:00 pm. (d), (e), (f) are the predictions [PITH_FULL_IMAGE:figures/full_fig_p009_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: shows the results of FairST with RF ((a) and (c)) and IF regularizer ((b) and (d)) evaluated using RFG and IFG. Overall, as λ increases, accuracy decreases and fairness increases, indicating that both regularizers consistently help the model to approach equity on multiple sensitive attributes without sacrificing too much accuracy. 30 0 30 60 90 120 150 RFG (a) FairST with RF loss 30 0 30 60 90 120 150 RFG … view at source ↗
read the original abstract

Emerging transportation modes, including car-sharing, bike-sharing, and ride-hailing, are transforming urban mobility but have been shown to reinforce socioeconomic inequities. Spatiotemporal demand prediction models for these new mobility regimes must therefore consider fairness as a first-class design requirement. We present FairST, a fairness-aware model for predicting demand for new mobility systems. Our approach utilizes 1D, 2D and 3D convolutions to integrate various urban features and learn the spatial-temporal dynamics of a mobility system, but we include fairness metrics as a form of regularization to make the predictions more equitable across demographic groups. We propose two novel spatiotemporal fairness metrics, a region-based fairness gap (RFG) and an individual-based fairness gap (IFG). Both quantify equity in a spatiotemporal context, but vary by whether demographics are labeled at the region level (RFG) or whether population distribution information is available (IFG). Experimental results on real bike share and ride share datasets demonstrate the effectiveness of the proposed model: FairST not only reduces the fairness gap by more than 80%, but can surprisingly achieve better accuracy than state-of-the-art yet fairness-oblivious methods including LSTMs, ConvLSTMs, and 3D CNN.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The paper introduces FairST, a spatiotemporal demand prediction model for new mobility systems (bike/ride share) that augments 1D/2D/3D convolutions with two novel fairness regularizers: region-based fairness gap (RFG) and individual-based fairness gap (IFG). The central claim is that FairST reduces the fairness gap by more than 80% while simultaneously improving predictive accuracy over fairness-oblivious baselines (LSTMs, ConvLSTMs, 3D CNNs) on real datasets.

Significance. If the RFG/IFG metrics validly quantify equity and the reported gains prove robust, the work would be significant for fairness-aware spatiotemporal modeling in urban computing. The integration of demographic regularization into convolutional architectures and the application to real mobility datasets are concrete contributions; however, the absence of external validation for the metrics limits the strength of the significance assessment.

major comments (3)
  1. [§5] §5 (Experimental results): The manuscript reports a >80% fairness-gap reduction and accuracy gains over LSTMs/ConvLSTMs/3D CNNs but supplies no information on data splits, hyperparameter selection, statistical significance testing, or controls for post-hoc choices. This directly undermines evaluation of the central experimental claim.
  2. [§4] §4 (Fairness metrics): RFG and IFG are introduced as novel regularizers and then used to quantify the reported 80% reduction, yet the paper provides no external validation, sensitivity analysis, or comparison against alternative equity definitions. Because the metrics are load-bearing for both training and the headline result, their validity must be substantiated.
  3. [§5] §5 and data description: The evaluation assumes region-level and population-distribution demographic labels in the bike-share/ride-share datasets are accurate and free of systematic labeling errors, but no discussion or robustness checks address potential mismatches between these proxies and actual equity.
minor comments (2)
  1. [Abstract] The phrase 'can surprisingly achieve better accuracy' in the abstract is informal; replace with a neutral statement of the observed result.
  2. [§4] Notation for RFG and IFG should be defined with explicit equations before their use as regularization terms.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive feedback on experimental reproducibility, metric validation, and data assumptions. We address each major comment below and will incorporate revisions to strengthen the manuscript.

read point-by-point responses
  1. Referee: [§5] §5 (Experimental results): The manuscript reports a >80% fairness-gap reduction and accuracy gains over LSTMs/ConvLSTMs/3D CNNs but supplies no information on data splits, hyperparameter selection, statistical significance testing, or controls for post-hoc choices. This directly undermines evaluation of the central experimental claim.

    Authors: We agree that additional experimental details are required for reproducibility and to support the central claims. In the revised manuscript we will add a dedicated experimental setup subsection that specifies: (i) the temporal data split (e.g., first 70% for training, next 15% validation, last 15% test) chosen to prevent leakage; (ii) the hyperparameter search procedure (grid search over learning rate, regularization weight, and convolution kernel sizes, selected on validation fairness-accuracy trade-off); (iii) statistical significance via paired t-tests over five random seeds; and (iv) confirmation that no post-hoc selection of results favoring FairST occurred. These additions will directly address the concern. revision: yes

  2. Referee: [§4] §4 (Fairness metrics): RFG and IFG are introduced as novel regularizers and then used to quantify the reported 80% reduction, yet the paper provides no external validation, sensitivity analysis, or comparison against alternative equity definitions. Because the metrics are load-bearing for both training and the headline result, their validity must be substantiated.

    Authors: RFG and IFG are constructed by extending standard group fairness notions (demographic parity and equalized odds) to spatiotemporal demand settings, using region-level or population-weighted individual labels. We acknowledge the absence of external validation or direct comparison to other equity measures. The revision will include: (a) a sensitivity analysis sweeping the fairness regularization coefficient and reporting resulting accuracy-fairness curves, and (b) a brief comparison of RFG/IFG against a simple demographic-parity baseline computed on the same data. Full external validation against alternative equity definitions or domain-expert judgment lies outside the scope of the current study. revision: partial

  3. Referee: [§5] §5 and data description: The evaluation assumes region-level and population-distribution demographic labels in the bike-share/ride-share datasets are accurate and free of systematic labeling errors, but no discussion or robustness checks address potential mismatches between these proxies and actual equity.

    Authors: We will expand the data-description section to explicitly state that demographic attributes are derived from U.S. Census tract data and therefore constitute proxies that may contain labeling noise or aggregation bias. In addition, we will add a robustness experiment that perturbs the demographic labels by ±10% and re-evaluates both accuracy and fairness-gap reduction, thereby quantifying sensitivity to label error. revision: yes

Circularity Check

0 steps flagged

No significant circularity; fairness metrics evaluated independently on held-out data

full rationale

The paper defines novel RFG and IFG metrics, incorporates them as regularization terms during training, and reports reductions in those metrics plus accuracy gains on real bike/ride-share datasets. No quoted step shows a claimed prediction or result reducing by construction to the same fitted quantities (e.g., no evidence that test-set fairness gaps equal training regularization terms). No self-citation chain, uniqueness theorem, or ansatz smuggling is present or load-bearing. The central claims rest on external dataset evaluations and baseline comparisons rather than self-referential definitions, qualifying as self-contained against benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no concrete information on free parameters, background axioms, or invented entities; the model is described at the level of standard convolutions plus regularization terms whose exact formulation is not given.

pith-pipeline@v0.9.0 · 5752 in / 1171 out tokens · 29341 ms · 2026-05-25T18:17:10.311870+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

50 extracted references · 50 canonical work pages · 2 internal anchors

  1. [1]

    Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, et al

  2. [2]

    In OSDI, Vol

    Tensorflow: a system for large-scale machine learning.. In OSDI, Vol. 16. 265–283

  3. [3]

    Franziska Bell and Slawek Smyl. 2018. Forecasting at Uber: An Introduction. Uber Engineering (2018). https://eng.uber.com/forecasting-introduction/

  4. [4]

    A Convex Framework for Fair Regression

    Richard Berk, Hoda Heidari, Shahin Jabbari, Matthew Joseph, Michael J. Kearns, Jamie Morgenstern, Seth Neel, and Aaron Roth. 2017. A Convex Framework for Fair Regression. CoRR abs/1706.02409 (2017)

  5. [5]

    Anne Elizabeth Brown. 2018. Ridehail revolution: Ridehail travel and equity in Los Angeles. Ph.D. Dissertation. UCLA

  6. [6]

    Toon Calders, Asim Karim, Faisal Kamiran, Wasif Ali, and Xiangliang Zhang

  7. [7]

    In Data Mining (ICDM), 2013 IEEE 13th International Conference on

    Controlling attribute effect in linear regression. In Data Mining (ICDM), 2013 IEEE 13th International Conference on . IEEE, 71–80

  8. [8]

    Biz Carson. 2018. Lyft doubled rides in 2017 as its rival Uber stumbled. Forbes

  9. [9]

    Alexa Delbosc and Graham Currie. 2011. Using Lorenz curves to assess public transport equity. Journal of Transport Geography 19, 6 (2011), 1252–1259

  10. [10]

    Cynthia Dwork, Moritz Hardt, Toniann Pitassi, Omer Reingold, and Richard Zemel. 2012. Fairness Through Awareness. In Proceedings of the 3rd Innovations in Theoretical Computer Science Conference (ITCS ’12) . 214–226

  11. [11]

    Wafic El-Assi, Mohamed Salah Mahmoud, and Khandker Nurul Habib. 2017. Effects of built environment and weather on bike sharing demand: a station level analysis of commercial bike sharing in Toronto. Transportation 44, 3 (2017), 589–613

  12. [12]

    Friedler, John Moeller, Carlos Scheidegger, and Suresh Venkatasubramanian

    Michael Feldman, Sorelle A. Friedler, John Moeller, Carlos Scheidegger, and Suresh Venkatasubramanian. 2015. Certifying and Removing Disparate Impact. In Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD ’15). 259–268

  13. [13]

    Inês Frade and Anabela Ribeiro. 2014. Bicycle sharing systems demand. Procedia- Social and Behavioral Sciences 111 (2014), 518–527

  14. [14]

    2016.Racial and gender discrimination in transportation network companies

    Yanbo Ge, Christopher R Knittel, Don MacKenzie, and Stephen Zoepf. 2016.Racial and gender discrimination in transportation network companies . Technical Report. National Bureau of Economic Research

  15. [15]

    Felix A Gers, Jürgen Schmidhuber, and Fred Cummins. 1999. Learning to forget: Continual prediction with LSTM. (1999)

  16. [16]

    Bruce Glymour and Jonathan Herington. 2019. Measuring the Biases That Matter: The Ethical and Casual Foundations for Measures of Fairness in Algorithms. In Proceedings of the Conference on Fairness, Accountability, and Transparency (FAT* ’19). 269–278

  17. [17]

    Moritz Hardt, Eric Price, and Nathan Srebro. 2016. Equality of Opportunity in Supervised Learning. In Proceedings of the 30th International Conference on Neural Information Processing Systems (NIPS’16). 3323–3331

  18. [18]

    Jan Hauke and Tomasz Kossowski. 2011. Comparison of values of Pearson’s and Spearman’s correlation coefficients on the same sets of data. Quaestiones geographicae 30, 2 (2011), 87–93

  19. [19]

    Kate Hosford and Meghan Winters. 2018. Who are public bicycle share programs serving? An evaluation of the equity of spatial access to bicycle share service areas in Canadian cities.Transportation research record (2018), 0361198118783107

  20. [20]

    Ryan Hughes and Don MacKenzie. 2016. Transportation network company wait times in Greater Seattle, and relationship to socioeconomic indicators. Journal of Transport Geography 56 (2016), 36–44

  21. [21]

    Ben Hutchinson and Margaret Mitchell. 2019. 50 Years of Test (Un)Fairness: Lessons for Machine Learning. In Proceedings of the Conference on Fairness, Ac- countability, and Transparency (FAT* ’19). 49–58

  22. [22]

    Shuiwang Ji, Wei Xu, Ming Yang, and Kai Yu. 2013. 3D convolutional neural networks for human action recognition. IEEE transactions on pattern analysis and machine intelligence 35, 1 (2013), 221–231

  23. [23]

    Faisal Kamiran and Toon Calders. 2009. Classifying without discriminating. In Computer, Control and Communication, 2009. IC4 2009. 2nd International Confer- ence on. IEEE, 1–6

  24. [24]

    Junpei Komiyama, Akiko Takeda, Junya Honda, and Hajime Shimao. 2018. Non- convex Optimization for Regression with Fairness Constraints. In Proceedings of the 35th International Conference on Machine Learning (Proceedings of Ma- chine Learning Research), Jennifer Dy and Andreas Krause (Eds.), Vol. 80. PMLR, StockholmsmÃďssan, Stockholm Sweden, 2737–2746

  25. [25]

    Xuefeng Li, Yong Zhang, Li Sun, and Qiyang Liu. 2018. Free-Floating Bike Sharing in Jiangsu: Users’ Behaviors and Influencing Factors. Energies 11, 7 (2018), 1664

  26. [26]

    Yexin Li, Yu Zheng, and Qiang Yang. 2018. Dynamic Bike Reposition: A Spatio- Temporal Reinforcement Learning Approach. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD ’18). 1724–1733

  27. [27]

    Yexin Li, Yu Zheng, Huichu Zhang, and Lei Chen. 2015. Traffic Prediction in a Bike-sharing System. In Proceedings of the 23rd SIGSPATIAL International Conference on Advances in Geographic Information Systems (SIGSPATIAL ’15) . Article 33, 10 pages

  28. [28]

    Qiang Liu, Shu Wu, Liang Wang, and Tieniu Tan. 2016. Predicting the Next Location: A Recurrent Model with Spatial and Temporal Contexts.. In AAAI. 194–200

  29. [29]

    Andrew L Maas, Awni Y Hannun, and Andrew Y Ng. 2013. Rectifier nonlinearities improve neural network acoustic models. In Proc. icml, Vol. 30. 3

  30. [30]

    Nathan McNeil, Jennifer Dill, John MacArthur, Joseph Broach, and Steven How- land. 2017. Breaking Barriers to Bike Share: Insights from Residents of Tradition- ally Underserved Neighborhoods. (2017)

  31. [31]

    Stephen J Mooney, Kate Hosford, Bill Howe, An Yan, Meghan Winters, Alon Bassok, and Jana A Hirsch. 2019. Freedom from the station: Spatial equity in access to dockless bike share. Journal of Transport Geography 74 (2019), 91–96

  32. [32]

    Bilong Shen, Xiaodan Liang, Yufeng Ouyang, Miaofeng Liu, Weimin Zheng, and Kathleen M Carley. 2018. StepDeep: A Novel Spatial-temporal Mobility Event Prediction Framework based on Deep Neural Network. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining . ACM, 724–733

  33. [33]

    SimplyAnalytics. 2018. EASI/MRI Census US. Retrieved November 2, 2018 from SimplyAnalyticsdatabase

  34. [34]

    Jennifer Stark and Nicholas Diakopoulos. 2016. Uber seems to offer better ser- vice in areas with more white people. That raises some tough questions. The Washington Post (2016)

  35. [35]

    Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Vanhoucke, and Andrew Rabinovich. 2015. Going deeper with convolutions. InProceedings of the IEEE conference on computer vision and pattern recognition . 1–9

  36. [36]

    Du Tran, Lubomir Bourdev, Rob Fergus, Lorenzo Torresani, and Manohar Paluri

  37. [37]

    In Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV) (ICCV ’15)

    Learning Spatiotemporal Features with 3D Convolutional Networks. In Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV) (ICCV ’15). 4489–4497

  38. [38]

    Julia Ursaki and Lisa Aultman-Hall. 2016. Quantifying the equity of bikeshare access in US cities. In 95th Annual Meeting of the Transportation Research Board, Washington, DC

  39. [39]

    Patrick Vogel, Torsten Greiser, and Dirk Christian Mattfeld. 2011. Understanding bike-sharing systems using data mining: Exploring activity patterns. Procedia- Social and Behavioral Sciences 20 (2011), 514–523

  40. [40]

    Dong Wang, Wei Cao, Jian Li, and Jieping Ye. 2017. DeepSD: supply-demand prediction for online car-hailing services using deep neural networks. In 2017 IEEE 33rd International Conference on Data Engineering (ICDE) . IEEE, 243–254

  41. [41]

    Mingshu Wang and Lan Mu. 2018. Spatial disparities of Uber accessibility: An exploratory analysis in Atlanta, USA. Computers, Environment and Urban Systems 67 (2018), 169–175

  42. [42]

    Shi Xingjian, Zhourong Chen, Hao Wang, Dit-Yan Yeung, Wai-Kin Wong, and Wang-chun Woo. 2015. Convolutional LSTM network: A machine learning ap- proach for precipitation nowcasting. In Advances in neural information processing systems. 802–810

  43. [43]

    Chengcheng Xu, Junyi Ji, and Pan Liu. 2018. The station-free sharing bike demand forecasting with a deep learning approach and large-scale datasets.Transportation Research Part C: Emerging Technologies 95 (2018), 47–60

  44. [44]

    Huaxiu Yao, Fei Wu, Jintao Ke, Xianfeng Tang, Yitian Jia, Siyu Lu, Pinghua Gong, Jieping Ye, and Zhenhui Li. 2018. Deep Multi-View Spatial-Temporal Network for Taxi Demand Prediction. In AAAI

  45. [45]

    Ji Won Yoon, Fabio Pinelli, and Francesco Calabrese. 2012. Cityride: a predictive bike sharing journey advisor. In Mobile Data Management (MDM), 2012 IEEE 13th International Conference on. IEEE, 306–311

  46. [46]

    Zhuoning Yuan, Xun Zhou, and Tianbao Yang. 2018. Hetero-ConvLSTM: A Deep Learning Approach to Traffic Accident Prediction on Heterogeneous Spatio- Temporal Data. In Proceedings of the 24th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining . ACM, 984–992

  47. [47]

    Muhammad Bilal Zafar, Isabel Valera, Manuel Gomez Rodriguez, and Krishna P Gummadi. 2015. Fairness constraints: Mechanisms for fair classification. arXiv preprint arXiv:1507.05259 (2015)

  48. [48]

    Richard Zemel, Yu Wu, Kevin Swersky, Toniann Pitassi, and Cynthia Dwork. 2013. Learning Fair Representations. In Proceedings of the 30th International Conference on International Conference on Machine Learning - Volume 28 (ICML’13) . III–325– III–333

  49. [49]

    Junbo Zhang, Yu Zheng, Dekang Qi, Ruiyuan Li, Xiuwen Yi, and Tianrui Li. 2018. Predicting citywide crowd flows using deep spatio-temporal residual networks. Artificial Intelligence 259 (2018), 147 – 166. https://doi.org/10.1016/j.artint.2018. 03.002

  50. [50]

    Yu Zheng, Licia Capra, Ouri Wolfson, and Hai Yang. 2014. Urban computing: concepts, methodologies, and applications. ACM Transactions on Intelligent Systems and Technology (TIST) 5, 3 (2014), 38