pith. sign in

arxiv: 2605.00083 · v1 · submitted 2026-04-30 · 💻 cs.LG

Comparative Analysis of Polygon-Based and Global Machine Learning Models for Bus Occupancy Prediction

Pith reviewed 2026-05-09 20:26 UTC · model grok-4.3

classification 💻 cs.LG
keywords bus occupancy predictionspatial clusteringmachine learning modelspublic transport forecastinglocal vs global modelspolygon-based analysisridership prediction
0
0 comments X

The pith

Clustering bus stops into spatial polygons and training local models for each yields bus occupancy forecasts as accurate as one city-wide global model.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper tests whether splitting a city into polygon regions of nearby bus stops, then training a separate machine learning model on each region, can predict passenger numbers as well as a single model trained on all stops together. The local models use the same mix of features: time of day, day of week, weather, and nearby attractions. If the local versions perform equally well, transit planners could build forecasts tuned to specific neighborhoods without losing overall reliability. The work compares the two strategies on real bus data and finds comparable accuracy between them.

Core claim

By grouping bus stops into polygons on the principle that nearby stops share similar ridership patterns, and training a dedicated forecasting model for each polygon using temporal, meteorological, and spatial features, the localized approach achieves predictive accuracy comparable to that of a single global model applied across the entire urban area.

What carries the argument

Polygon-based spatial clustering of bus stops, which groups proximate stops assumed to have similar ridership characteristics so that a separate machine learning model can be trained per cluster.

If this is right

  • Transit agencies could run neighborhood-specific forecasts while maintaining city-level reliability.
  • Local models allow service adjustments targeted to individual polygons rather than uniform city rules.
  • The same multi-source feature set (time, weather, attractions) supports both local and global training without modification.
  • Spatially aware clustering offers a practical alternative to treating the whole city as one homogeneous area.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same polygon clustering idea could be tested on other transport modes such as trains or shared bikes where stop proximity also matters.
  • Agencies might lower training costs by maintaining many small local models instead of one large global model.
  • Dynamic polygons that shift with seasons or events could be explored as an extension beyond fixed boundaries.

Load-bearing premise

Bus stops that are close together share similar enough ridership characteristics that they can be grouped into polygons and modeled separately without reducing overall forecast accuracy.

What would settle it

If, on the same held-out bus ridership dataset, the polygon-specific models produce measurably lower accuracy than the single global model across standard error metrics, the claim of comparable performance would not hold.

Figures

Figures reproduced from arXiv: 2605.00083 by Daniel Azenkot, Eran Ben Elia, Michael Fire.

Figure 1
Figure 1. Figure 1: Methodology overview. Since PT demand is strongly influenced by stop location, many researchers have developed spatially aware models that incorporate geographic information into their features. These models are based on Tobler’s first law (see [17]), which state that geographically closer or more similar areas tend to exhibit similar patterns. Wang et al. [26] used Geographically Weighted Regression (GWR)… view at source ↗
Figure 2
Figure 2. Figure 2: Max-p regions for March (training on all days except the last 7). Bus stops are grouped into spatially contiguous regions, shown in different colors, as determined by the optimal threshold τ selected using the Calinski–Harabasz (CH) index. effect size analysis. For the high-performing tree-based mod￾els, the effect size remains in the “small” to “negligible” range (e.g., δ = 0.194 for LightGBM and δ = 0.13… view at source ↗
Figure 3
Figure 3. Figure 3: sMPAE of LightGBM across ridership buckets (0–10 to 41–50) over all experiments, comparing global and polygon strategies. Both approaches show similar performance, with the highest errors in the lowest ridership bucket and a monotone increase in error from 11–20 up to 41–50 ridership. Figure S8 shows that extending the LightGBM training set to include the second-to-last week did not yield meaningful improv… view at source ↗
Figure 4
Figure 4. Figure 4: Distribution of mean absolute error (MAE) for four tree-based models (XGBoost, LightGBM, CatBoost, and Random Forest) under the global and polygon-based strategies, for all the test sets used in the experiment. The boxplots show that both approaches yield comparable error distributions, with the polygon-based models generally achieving slightly lower median MAE values. that both modeling strategies maintai… view at source ↗
Figure 5
Figure 5. Figure 5: SHAP values for January (polygon-level LightGBM). sengers. Although the polygon-wise model achieved slightly lower median errors in both weeks, the differences were mi￾nor. The stability of MAE across weeks suggests that the temporal variability of ridership patterns limits the benefit of simple training set extensions, highlighting the need for more adaptive temporal features or dynamic learning mechanism… view at source ↗
read the original abstract

Accurate forecasting of bus ridership (passengers numbers) is crucial for efficient management and optimization of public transport systems. Traditional forecasting models often fail to capture the unique and localized dynamics of different urban areas by treating the entire city as a single, homogeneous region. This paper introduces a novel framework that enhances bus ridership prediction by integrating a spatial clustering methodology with multi-dimensional feature analysis. The proposed framework utilizes a diverse set of data, including bus ridership data (by route number, time, and bus stop) complemented by a variety of open source data, such as spatial features (e.g., attractive destinations), meteorological conditions (e.g., temperature, rainfall), and temporal patterns (e.g., time of day, day of week). By clustering the urban area into distinct regions, based on the principle that bus stops in close proximity share similar ridership characteristics, a separate local forecasting model is trained for each of these clusters. This localized approach demonstrates an accuracy comparable to that of global models. The findings suggest that a spatially-aware, localized modeling strategy is effective for public transport prediction, paving the way for more targeted and efficient service improvements.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes a framework for bus occupancy prediction that integrates spatial clustering of bus stops into polygons (based on proximity) with per-cluster machine learning models trained on multi-feature data including ridership, spatial attributes of destinations, meteorological variables, and temporal patterns. It claims that these localized polygon-based models achieve accuracy comparable to a single global model trained on the full dataset.

Significance. If the empirical comparison holds under proper validation, the work could support more spatially targeted forecasting in public transit, potentially improving operational efficiency. The reliance on open-source auxiliary data is a methodological strength that enhances reproducibility and generalizability.

major comments (2)
  1. Abstract: The central claim that 'this localized approach demonstrates an accuracy comparable to that of global models' is stated without any quantitative metrics (e.g., MAE, RMSE, R²), error bars, baseline details, or statistical tests. This absence makes the primary result unverifiable from the provided text and is load-bearing for the paper's contribution.
  2. Methodology (clustering and evaluation procedure): No description is given of the specific clustering algorithm, how the number of polygons/clusters is determined, the criteria for assigning stops to polygons, the train/validation/test split strategy, or the cross-validation method used to compare local versus global models. These details are required to assess whether the 'comparable accuracy' result is robust or an artifact of the experimental design.
minor comments (1)
  1. The abstract and introduction would benefit from a concise statement of the exact performance metrics and the magnitude of any observed differences between local and global models.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive and detailed comments on our manuscript. We address each major point below and will revise the paper to improve the verifiability and reproducibility of our results.

read point-by-point responses
  1. Referee: Abstract: The central claim that 'this localized approach demonstrates an accuracy comparable to that of global models' is stated without any quantitative metrics (e.g., MAE, RMSE, R²), error bars, baseline details, or statistical tests. This absence makes the primary result unverifiable from the provided text and is load-bearing for the paper's contribution.

    Authors: We agree that the abstract should contain quantitative support for the comparability claim to allow immediate verification. The results section of the manuscript reports MAE, RMSE, and R² values for the polygon-based models versus the single global model (using the same feature set), along with the baselines and a note that performance differences fall within statistical noise. We will revise the abstract to include the key aggregate metrics and a brief reference to the statistical comparison, ensuring the central result is verifiable from the abstract. revision: yes

  2. Referee: Methodology (clustering and evaluation procedure): No description is given of the specific clustering algorithm, how the number of polygons/clusters is determined, the criteria for assigning stops to polygons, the train/validation/test split strategy, or the cross-validation method used to compare local versus global models. These details are required to assess whether the 'comparable accuracy' result is robust or an artifact of the experimental design.

    Authors: We acknowledge that the current description of the clustering and evaluation procedure is insufficiently detailed. We will expand the Methodology section to specify the clustering algorithm, the criterion or method used to select the number of polygons, the precise assignment rule for bus stops, the train/validation/test partitioning approach (including how temporal ordering is respected), and the cross-validation procedure employed for the local-versus-global comparison. These additions will enable readers to assess the robustness of the reported accuracy comparability. revision: yes

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper is an empirical comparative study that clusters bus stops into polygons by spatial proximity, extracts features from external open-source spatial/meteorological/temporal data, trains separate supervised models per cluster, and reports that their accuracy is comparable to a single global model. This workflow contains no mathematical derivation chain; the central claim is a direct outcome of standard clustering plus ML training/evaluation on held-out data. No step defines a quantity in terms of itself, renames a fitted parameter as a prediction, or relies on a self-citation whose content is itself unverified or tautological. The methodology is self-contained against external benchmarks and does not reduce to its inputs by construction.

Axiom & Free-Parameter Ledger

2 free parameters · 2 axioms · 0 invented entities

The central claim rests on the domain assumption of spatial similarity in ridership and on standard machine-learning modeling assumptions. No new physical entities are postulated. Several free parameters are implicit in any clustering-plus-modeling pipeline.

free parameters (2)
  • number of clusters / polygon count
    The number of regions is chosen or optimized from data and directly determines the localization granularity.
  • model hyperparameters
    Typical supervised learning parameters (learning rate, tree depth, regularization, etc.) are tuned to the ridership dataset.
axioms (2)
  • domain assumption Bus stops in close proximity share similar ridership characteristics
    Explicitly invoked in the abstract as the principle justifying the spatial clustering step.
  • domain assumption Integration of open-source spatial, meteorological, and temporal features improves predictive accuracy
    Assumed when the framework is described as using these complementary data sources.

pith-pipeline@v0.9.0 · 5502 in / 1553 out tokens · 38022 ms · 2026-05-09T20:26:04.880697+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

80 extracted references · 80 canonical work pages

  1. [1]

    Appraisal of urbanization and traffic on environmental quality.Journal of CO2 Utilization, 16:428–430, 2016

    Nikola Petrovi´c, Nebojsa Bojovi´c, and Jelena Petrovi ´c. Appraisal of urbanization and traffic on environmental quality.Journal of CO2 Utilization, 16:428–430, 2016

  2. [2]

    Island Press, 2024

    Jarrett Walker.Human transit, revised edition: how clearer thinking about public transit can enrich our com- munities and our lives. Island Press, 2024

  3. [3]

    A composite index of public transit accessibility.Journal of Public Transportation, 14(2):69–87, 2011

    Md Sha Al Mamun and Nicholas E Lownes. A composite index of public transit accessibility.Journal of Public Transportation, 14(2):69–87, 2011

  4. [4]

    Efficient timetabling and vehicle scheduling for public transport

    Avi Ceder. Efficient timetabling and vehicle scheduling for public transport. InComputer-aided scheduling of public transport, pages 37–52. Springer, 2001

  5. [5]

    Workshop synthesis: Representative- ness in surveys: challenges and solutions.Transportation Research Procedia, 32:224–228, 2018

    Jimmy Armoogum, Adrian B Ellison, and Marie- José Olde Kalter. Workshop synthesis: Representative- ness in surveys: challenges and solutions.Transportation Research Procedia, 32:224–228, 2018

  6. [6]

    Public-transit frequency setting using minimum-cost approach with stochastic demand and travel time.Transportation Re- search Part B: Methodological, 46(8):1068–1084, 2012

    Yuval Hadas and Matan Shnaiderman. Public-transit frequency setting using minimum-cost approach with stochastic demand and travel time.Transportation Re- search Part B: Methodological, 46(8):1068–1084, 2012

  7. [7]

    CRC press, 2016

    Avishai Ceder.Public transit planning and operation: Modeling, practice and behavior. CRC press, 2016

  8. [8]

    Smart cities, big data and urban policy: Towards urban analytics for the long run

    Jens Kandt and Michael Batty. Smart cities, big data and urban policy: Towards urban analytics for the long run. Cities, 109:102992, 2021

  9. [9]

    Towards smart card based mutual authentication schemes in cloud computing.KSII Transactions on In- ternet and Information Systems (TIIS), 9(7):2719–2735, 2015

    Haoxing Li, Fenghua Li, Chenggen Song, and Yalong Yan. Towards smart card based mutual authentication schemes in cloud computing.KSII Transactions on In- ternet and Information Systems (TIIS), 9(7):2719–2735, 2015

  10. [10]

    Smart card data mining of public transport destination: A litera- ture review.Information, 9(1):18, 2018

    Tian Li, Dazhi Sun, Peng Jing, and Kaixi Yang. Smart card data mining of public transport destination: A litera- ture review.Information, 9(1):18, 2018

  11. [11]

    Understanding commuting patterns using transit smart card data.Journal of Transport Geog- raphy, 58:135–145, 2017

    Xiaolei Ma, Congcong Liu, Huimin Wen, Yunpeng Wang, and Yao-Jan Wu. Understanding commuting patterns using transit smart card data.Journal of Transport Geog- raphy, 58:135–145, 2017

  12. [12]

    Mining smart card data for transit riders’ travel patterns.Transportation Research Part C: Emerg- ing Technologies, 36:1–12, 2013

    Xiaolei Ma, Yao-Jan Wu, Yinhai Wang, Feng Chen, and Jianfeng Liu. Mining smart card data for transit riders’ travel patterns.Transportation Research Part C: Emerg- ing Technologies, 36:1–12, 2013

  13. [13]

    Urban transportation data research overview: A bibliometric analysis based on citespace.Sustainability, 16(22):9615, 2024

    Yanni Liang, Jianxin You, Ran Wang, Bo Qin, and Shuo Han. Urban transportation data research overview: A bibliometric analysis based on citespace.Sustainability, 16(22):9615, 2024

  14. [14]

    Khatun E Zannat and Charisma F Choudhury. Emerging big data sources for public transport planning: A sys- tematic review on current state of art and future research directions.Journal of the Indian Institute of Science, 99 (4):601–619, 2019

  15. [15]

    Pioneering open data standards: The gtfs story

    B McHugh. Pioneering open data standards: The gtfs story. beyond transparency: open data and the future of civic innovation.Beyond transparency: open data and the future of civic innovation, pages 123–135, 2013

  16. [16]

    Behavioural data mining of transit smart card data: A data fusion approach

    Takahiko Kusakabe and Yasuo Asakura. Behavioural data mining of transit smart card data: A data fusion approach. Transportation Research Part C: Emerging Technologies, 46:179–191, 2014

  17. [17]

    Tobler’s first law and spatial analysis

    Harvey J Miller. Tobler’s first law and spatial analysis. Annals of the association of American geographers, 94 (2):284–289, 2004. Comparative Analysis of Polygon-Based and Global Machine Learning Models for Bus Occupancy Prediction — 20/34

  18. [18]

    The max- p-regions problem.Journal of Regional Science, 52(3): 397–419, 2012

    Juan C Duque, Luc Anselin, and Sergio J Rey. The max- p-regions problem.Journal of Regional Science, 52(3): 397–419, 2012

  19. [19]

    Explanation of machine learning models using improved shapley additive expla- nation

    Yasunobu Nohara, Koutarou Matsumoto, Hidehisa Soe- jima, and Naoki Nakashima. Explanation of machine learning models using improved shapley additive expla- nation. InProceedings of the 10th ACM international conference on bioinformatics, computational biology and health informatics, pages 546–546, 2019

  20. [20]

    De- velopment and evaluation of frameworks for real-time bus passenger occupancy prediction.International Journal of Transportation Science and Technology, 12(2):399–413, 2023

    Jonathan Wood, Zhengyao Yu, and Vikash V Gayah. De- velopment and evaluation of frameworks for real-time bus passenger occupancy prediction.International Journal of Transportation Science and Technology, 12(2):399–413, 2023

  21. [21]

    Framework for onboard bus comfort level predictions using the markov chain concept

    Paweł Wi˛ ecek, Daniel Kubek, Jan Hipolit Aleksandrow- icz, and Aleksandra Stró˙zek. Framework for onboard bus comfort level predictions using the markov chain concept. Symmetry, 11(6):755, 2019

  22. [22]

    Research on forecast of rail traffic flow based on arima model

    Shu Ying Liu, Shuo Liu, Ye Tian, Quan Long Sun, and Yu Yang Tang. Research on forecast of rail traffic flow based on arima model. InJournal of Physics: Conference Series, volume 1792, page 012065. IOP Publishing, 2021

  23. [23]

    Short-term passenger flow prediction in urban public transport: Kalman filtering combined k-nearest neighbor approach.Ieee Access, 7:120937–120949, 2019

    Shidong Liang, Minghui Ma, Shengxue He, and Hu Zhang. Short-term passenger flow prediction in urban public transport: Kalman filtering combined k-nearest neighbor approach.Ieee Access, 7:120937–120949, 2019

  24. [24]

    Passenger flow prediction using smart card data from connected bus system based on interpretable xgboost.Wireless Communications and Mobile Computing, 2022(1):5872225, 2022

    Liang Zou, Sisi Shu, Xiang Lin, Kaisheng Lin, Jiasong Zhu, and Linchao Li. Passenger flow prediction using smart card data from connected bus system based on interpretable xgboost.Wireless Communications and Mobile Computing, 2022(1):5872225, 2022

  25. [25]

    Designing on- board explainable passenger flow prediction.Engineering Applications of Artificial Intelligence, 139:109648, 2025

    Mario Barbareschi, Antonio Emmanuele, Nicola Maz- zocca, and Franca Rocco di Torrepadula. Designing on- board explainable passenger flow prediction.Engineering Applications of Artificial Intelligence, 139:109648, 2025

  26. [26]

    Bus ridership and its determinants in beijing: A spatial econometric perspective.Transportation, 50(2): 383–406, 2023

    Jiaoe Wang, Yanan Li, Jingjuan Jiao, Haitao Jin, and Fangye Du. Bus ridership and its determinants in beijing: A spatial econometric perspective.Transportation, 50(2): 383–406, 2023

  27. [27]

    An adapted geographically weighted lasso (ada-gwl) model for pre- dicting subway ridership.Transportation, 48(3):1185– 1216, 2021

    Yuxin He, Yang Zhao, and Kwok Leung Tsui. An adapted geographically weighted lasso (ada-gwl) model for pre- dicting subway ridership.Transportation, 48(3):1185– 1216, 2021

  28. [28]

    ‘centrality measures’ as a tool to identify the transit demand at public transit stops; a case of ahmedabad city, india.International Journal, 2 (7):1063–1074, 2014

    TalatMunshi AmilaJayasinghe. ‘centrality measures’ as a tool to identify the transit demand at public transit stops; a case of ahmedabad city, india.International Journal, 2 (7):1063–1074, 2014

  29. [29]

    Exploring the nonlinear effects of built environment on bus-transfer ridership: take shanghai as an example.Ap- plied Sciences, 12(11):5755, 2022

    Ding Liu, Wuyue Rong, Jin Zhang, and Ying-En Ge. Exploring the nonlinear effects of built environment on bus-transfer ridership: take shanghai as an example.Ap- plied Sciences, 12(11):5755, 2022

  30. [30]

    Exploring the association be- tween network centralities and passenger flows in metro systems.Applied Network Science, 8(1):69, 2023

    Athanasios Kopsidas, Aristeides Douvaras, and Kon- stantinos Kepaptsoglou. Exploring the association be- tween network centralities and passenger flows in metro systems.Applied Network Science, 8(1):69, 2023

  31. [31]

    Predicting bus ridership based on the weather conditions using deep learning algorithms.Trans- portation Research Interdisciplinary Perspectives, 19: 100833, 2023

    Zakir H Farahmand, Konstantinos Gkiotsalitis, and Karst T Geurs. Predicting bus ridership based on the weather conditions using deep learning algorithms.Trans- portation Research Interdisciplinary Perspectives, 19: 100833, 2023

  32. [32]

    Artificial neural networks for fore- casting passenger flows on metro lines.Sensors, 19(15): 3424, 2019

    Mariano Gallo, Giuseppina De Luca, Luca D’Acierno, and Marilisa Botte. Artificial neural networks for fore- casting passenger flows on metro lines.Sensors, 19(15): 3424, 2019

  33. [33]

    Learning to forget: Continual prediction with lstm.Neu- ral computation, 12(10):2451–2471, 2000

    Felix A Gers, Jürgen Schmidhuber, and Fred Cummins. Learning to forget: Continual prediction with lstm.Neu- ral computation, 12(10):2451–2471, 2000

  34. [34]

    Deep learning based lstm model for predicting the number of passengers for public transport bus operators.Jurnal Online Informatika, 9(1):18–28, 2024

    Joko Siswanto, Danny Manongga, Irwan Sembiring, and Sutarto Wijono. Deep learning based lstm model for predicting the number of passengers for public transport bus operators.Jurnal Online Informatika, 9(1):18–28, 2024

  35. [35]

    Fore- casting the short-term metro ridership with seasonal and trend decomposition using loess and lstm neural networks

    Dewang Chen, Jianhua Zhang, and Shixiong Jiang. Fore- casting the short-term metro ridership with seasonal and trend decomposition using loess and lstm neural networks. Ieee Access, 8:91181–91187, 2020

  36. [36]

    Deeppf: A deep learning based architecture for metro passenger flow pre- diction.Transportation Research Part C: Emerging Tech- nologies, 101:18–34, 2019

    Yang Liu, Zhiyuan Liu, and Ruo Jia. Deeppf: A deep learning based architecture for metro passenger flow pre- diction.Transportation Research Part C: Emerging Tech- nologies, 101:18–34, 2019

  37. [37]

    Ai-based neural network models for bus passenger demand forecasting using smart card data

    Sohani Liyanage, Rusul Abduljabbar, Hussein Dia, and Pei-Wei Tsai. Ai-based neural network models for bus passenger demand forecasting using smart card data. Journal of Urban Management, 11(3):365–380, 2022

  38. [38]

    Short-term bus passenger flow forecast based on cnn-bilstm.Advances in Engineer- ing Technology Research, 5(1):448–448, 2023

    Chaohua Wu and Xingzu Qi. Short-term bus passenger flow forecast based on cnn-bilstm.Advances in Engineer- ing Technology Research, 5(1):448–448, 2023

  39. [39]

    Comparative analysis of deep-learning-based models for hourly bus passenger flow forecasting.Transportation, 51(5):1759–1784, 2024

    Yu Zhang, Xiaodan Wang, Jingjing Xie, and Yun Bai. Comparative analysis of deep-learning-based models for hourly bus passenger flow forecasting.Transportation, 51(5):1759–1784, 2024

  40. [40]

    Transparency and the black box problem: Why we do not trust ai.Philosophy & Technology, 34(4):1607–1622, 2021

    Warren J V on Eschenbach. Transparency and the black box problem: Why we do not trust ai.Philosophy & Technology, 34(4):1607–1622, 2021. Comparative Analysis of Polygon-Based and Global Machine Learning Models for Bus Occupancy Prediction — 21/34

  41. [41]

    Deep learning xai for bus passenger forecasting: A use case in spain.Mathematics, 10(9):1428, 2022

    Leticia Monje, Ramón A Carrasco, Carlos Rosado, and Manuel Sánchez-Montañés. Deep learning xai for bus passenger forecasting: A use case in spain.Mathematics, 10(9):1428, 2022

  42. [42]

    A novel passenger flow prediction model using deep learning methods.Trans- portation Research Part C: Emerging Technologies, 84: 74–91, 2017

    Lijuan Liu and Rung-Ching Chen. A novel passenger flow prediction model using deep learning methods.Trans- portation Research Part C: Emerging Technologies, 84: 74–91, 2017

  43. [43]

    Prediction of public bus passenger flow using spatial–temporal hybrid model of deep learning

    Tao Chen, Jie Fang, Mengyun Xu, Yingfang Tong, and Wentian Chen. Prediction of public bus passenger flow using spatial–temporal hybrid model of deep learning. Journal of Transportation Engineering, Part A: Systems, 148(4):04022007, 2022

  44. [44]

    Short-term abnormal passenger flow pre- diction based on the fusion of svr and lstm.Ieee Access, 7:42946–42955, 2019

    Jianyuan Guo, Zhen Xie, Yong Qin, Limin Jia, and Yaguan Wang. Short-term abnormal passenger flow pre- diction based on the fusion of svr and lstm.Ieee Access, 7:42946–42955, 2019

  45. [45]

    Short-term passenger flow prediction for multi-traffic modes: A transformer and residual network based multi-task learning method.In- formation Sciences, 642:119144, 2023

    Yongjie Yang, Jinlei Zhang, Lixing Yang, Yang Yang, Xiaohong Li, and Ziyou Gao. Short-term passenger flow prediction for multi-traffic modes: A transformer and residual network based multi-task learning method.In- formation Sciences, 642:119144, 2023

  46. [46]

    Hy- brid hidden markov lstm for short-term traffic flow pre- diction.arXiv preprint arXiv:2307.04954, 2023

    Agnimitra Sengupta, Adway Das, and S Ilgin Guler. Hy- brid hidden markov lstm for short-term traffic flow pre- diction.arXiv preprint arXiv:2307.04954, 2023

  47. [47]

    A novel cnn-gru-lstm based deep learning model for accurate traffic prediction.Discover Comput- ing, 28(1):38, 2025

    Vandana Singh, Sudip Kumar Sahana, and Vandana Bhat- tacharjee. A novel cnn-gru-lstm based deep learning model for accurate traffic prediction.Discover Comput- ing, 28(1):38, 2025

  48. [48]

    Short-term trafficflow- forecasting modelbasedonga-tcn.JournalofAdvanced- Transportation, 1338607:13, 2021

    SUNF Zhangrj, W SONGZ, et al. Short-term trafficflow- forecasting modelbasedonga-tcn.JournalofAdvanced- Transportation, 1338607:13, 2021

  49. [49]

    Passenger flow prediction of scenic spot using a gcn–rnn model.Sustainability, 14(6):3295, 2022

    Zhijie Xu, Liyan Hou, Yueying Zhang, and Jianqin Zhang. Passenger flow prediction of scenic spot using a gcn–rnn model.Sustainability, 14(6):3295, 2022

  50. [50]

    Lightgbm-based model for metro passenger volume fore- casting.IET Intelligent Transport Systems, 14(13):1815– 1823, 2020

    Youyang Zhang, Changfeng Zhu, and Qingrong Wang. Lightgbm-based model for metro passenger volume fore- casting.IET Intelligent Transport Systems, 14(13):1815– 1823, 2020

  51. [51]

    A novel wavelet- svm short-time passenger flow prediction in beijing sub- way system.Neurocomputing, 166:109–121, 2015

    Yuxing Sun, Biao Leng, and Wei Guan. A novel wavelet- svm short-time passenger flow prediction in beijing sub- way system.Neurocomputing, 166:109–121, 2015

  52. [52]

    Clustering and forecast- ing urban bus passenger demand with a combination of time series models.Mathematics, 10(15):2670, 2022

    Irene Mariñas-Collado, Ana E Sipols, M Teresa Santos- Martín, and Elisa Frutos-Bernal. Clustering and forecast- ing urban bus passenger demand with a combination of time series models.Mathematics, 10(15):2670, 2022

  53. [53]

    Relative neighborhood graphs and their relatives.Proceedings of the IEEE, 80(9):1502–1517, 2002

    Jerzy W Jaromczyk and Godfried T Toussaint. Relative neighborhood graphs and their relatives.Proceedings of the IEEE, 80(9):1502–1517, 2002

  54. [54]

    A., Hekker, S., Stello, D., Guti ´errez-Soto, J., Handberg, R., Huber, D., et al

    David W. Matula and Robert R. Sokal. Properties of gabriel graphs relevant to geographic variation research and the clustering of points in the plane.Geograph- ical Analysis, 12(3):205–222, 1980. doi: 10.1111/j. 1538-4632.1980.tb00031.x

  55. [55]

    Degree centrality, between- ness centrality, and closeness centrality in social network

    Junlong Zhang and Yu Luo. Degree centrality, between- ness centrality, and closeness centrality in social network. In2017 2nd international conference on modelling, sim- ulation and applied mathematics (MSAM2017), pages 300–303. Atlantis press, 2017

  56. [56]

    Some unique properties of eigenvector centrality.Social networks, 29(4):555–564, 2007

    Phillip Bonacich. Some unique properties of eigenvector centrality.Social networks, 29(4):555–564, 2007

  57. [57]

    The interaction of size and density with graph-level indices.Social networks, 21(3):239–267, 1999

    Brigham S Anderson, Carter Butts, and Kathleen Car- ley. The interaction of size and density with graph-level indices.Social networks, 21(3):239–267, 1999

  58. [58]

    Collective dy- namics of ‘small-world’networks.nature, 393(6684): 440–442, 1998

    Duncan J Watts and Steven H Strogatz. Collective dy- namics of ‘small-world’networks.nature, 393(6684): 440–442, 1998

  59. [59]

    A dendrite method for cluster analysis.Communications in Statistics-theory and Methods, 3(1):1–27, 1974

    Tadeusz Cali´nski and Jerzy Harabasz. A dendrite method for cluster analysis.Communications in Statistics-theory and Methods, 3(1):1–27, 1974

  60. [60]

    Interactions between bus, metro, and taxi use before and after the chinese spring festival.ISPRS International Journal of Geo-Information, 8(10):445, 2019

    Jianwei Huang, Xintao Liu, Pengxiang Zhao, Junwei Zhang, and Mei-Po Kwan. Interactions between bus, metro, and taxi use before and after the chinese spring festival.ISPRS International Journal of Geo-Information, 8(10):445, 2019

  61. [61]

    Using fuzzy clustering of user perception to determine the number of level-of-service categories for bus rapid transit.Journal of Public Transportation, 24:100017, 2022

    Yueying Huo, Jinhua Zhao, Xiaojuan Li, and Chen Guo. Using fuzzy clustering of user perception to determine the number of level-of-service categories for bus rapid transit.Journal of Public Transportation, 24:100017, 2022

  62. [62]

    Lightgbm: A highly efficient gradient boosting decision tree.Advances in neural information processing systems, 30, 2017

    Guolin Ke, Qi Meng, Thomas Finley, Taifeng Wang, Wei Chen, Weidong Ma, Qiwei Ye, and Tie-Yan Liu. Lightgbm: A highly efficient gradient boosting decision tree.Advances in neural information processing systems, 30, 2017

  63. [63]

    Random forests.Machine learning, 45: 5–32, 2001

    Leo Breiman. Random forests.Machine learning, 45: 5–32, 2001

  64. [64]

    Xgboost: A scalable tree boosting system

    Tianqi Chen and Carlos Guestrin. Xgboost: A scalable tree boosting system. InProceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, pages 785–794, 2016

  65. [65]

    Catboost: unbiased boosting with categorical features

    Liudmila Prokhorenkova, Gleb Gusev, Aleksandr V orobev, Anna Veronika Dorogush, and Andrey Gulin. Catboost: unbiased boosting with categorical features. Advances in neural information processing systems, 31, 2018. Comparative Analysis of Polygon-Based and Global Machine Learning Models for Bus Occupancy Prediction — 22/34

  66. [66]

    Forecasting of short-term metro ridership with support vector machine online model.Journal of Ad- vanced Transportation, 2018(1):3189238, 2018

    Xuemei Wang, Ning Zhang, Yunlong Zhang, and Zhuang- bin Shi. Forecasting of short-term metro ridership with support vector machine online model.Journal of Ad- vanced Transportation, 2018(1):3189238, 2018

  67. [67]

    Statistical comparisons of classifiers over multiple data sets.Journal of Machine learning research, 7(Jan):1–30, 2006

    Janez Demšar. Statistical comparisons of classifiers over multiple data sets.Journal of Machine learning research, 7(Jan):1–30, 2006

  68. [68]

    New evidence on walking distances to transit stops: Identifying redun- dancies and gaps using variable service areas.Transporta- tion, 41(1):193–210, 2014

    Ahmed El-Geneidy, Michael Grimsrud, Rania Wasfi, Paul Tétreault, and Julien Surprenant-Legault. New evidence on walking distances to transit stops: Identifying redun- dancies and gaps using variable service areas.Transporta- tion, 41(1):193–210, 2014

  69. [69]

    Explaining walking distance to public transport: The dominance of public transport supply.Journal of Transport and Land Use, 6 (2):5–20, 2013

    Rhonda Daniels and Corinne Mulley. Explaining walking distance to public transport: The dominance of public transport supply.Journal of Transport and Land Use, 6 (2):5–20, 2013

  70. [70]

    Yinghui Wu, Jiafu Tang, Yang Yu, and Zhendong Pan. A stochastic optimization model for transit network timetable design to mitigate the randomness of travel- ing time by adding slack time.Transportation Research Part C: Emerging Technologies, 52:15–31, 2015

  71. [71]

    Stochastic bus schedule coordination considering demand assignment and rerouting of passengers.Transportation Research Part B: Methodological, 121:275–303, 2019

    Weitiao Wu, Ronghui Liu, Wenzhou Jin, and Changxi Ma. Stochastic bus schedule coordination considering demand assignment and rerouting of passengers.Transportation Research Part B: Methodological, 121:275–303, 2019

  72. [72]

    A matheuristic for transfer synchronization through integrated timetabling and vehi- cle scheduling.Transportation Research Part B: Method- ological, 109:128–149, 2018

    João Paiva Fonseca, Evelien van der Hurk, Roberto Roberti, and Allan Larsen. A matheuristic for transfer synchronization through integrated timetabling and vehi- cle scheduling.Transportation Research Part B: Method- ological, 109:128–149, 2018

  73. [73]

    Single bus line timetable optimization with big data: A case study in beijing.Information Sciences, 536:53–66, 2020

    Hongguang Ma, Xiang Li, and Haitao Yu. Single bus line timetable optimization with big data: A case study in beijing.Information Sciences, 536:53–66, 2020

  74. [74]

    Research on bus scheduling optimization considering exhaust emission based on genetic algorithm: Taking a route in nanjing city as an example.Applied Sciences, 14(10):4126, 2024

    Meixia Wang, Baohua Guo, Zhezhe Zhang, and Yan- shuang Zhang. Research on bus scheduling optimization considering exhaust emission based on genetic algorithm: Taking a route in nanjing city as an example.Applied Sciences, 14(10):4126, 2024

  75. [75]

    Robust optimization model of schedule design for a fixed bus route.Transportation Research Part C: Emerging Technologies, 25:113–121, 2012

    Yadan Yan, Qiang Meng, Shuaian Wang, and Xiucheng Guo. Robust optimization model of schedule design for a fixed bus route.Transportation Research Part C: Emerging Technologies, 25:113–121, 2012

  76. [76]

    Estimating the robustness of public transport schedules using machine learning

    Matthias Müller-Hannemann, Ralf Rückert, Alexander Schiewe, and Anita Schöbel. Estimating the robustness of public transport schedules using machine learning. Transportation Research Part C: Emerging Technologies, 137:103566, 2022

  77. [77]

    Validation of automatic passenger counting: introducing the t-test- induced equivalence test.Transportation, 47(6):3031– 3045, 2020

    Michael Siebert and David Ellenberger. Validation of automatic passenger counting: introducing the t-test- induced equivalence test.Transportation, 47(6):3031– 3045, 2020

  78. [78]

    Cross-checking automated passenger counts for ridership analysis.Journal of Public Transportation, 24:100008, 2022

    Simon J Berrebi, Sanskruti Joshi, and Kari E Watkins. Cross-checking automated passenger counts for ridership analysis.Journal of Public Transportation, 24:100008, 2022. Appendices

  79. [79]

    This underscores the need for improved data collection and validation methods in subsequent research

    APC data limitation Although cleaning and filtering procedures were implemented to mitigate these issues, the persistence of such errors reduces the reliability of the dataset and could compromise the accu- racy of the results. This underscores the need for improved data collection and validation methods in subsequent research. Figure S2 shows the distrib...

  80. [80]

    Global MAE Differences Across 14 Matched Test Sets Note: Negative differences indicate lower polygon MAE

    Supplementary Figures and Tables Table S4.Wilcoxon Signed-Rank Test Results: Polygon vs. Global MAE Differences Across 14 Matched Test Sets Note: Negative differences indicate lower polygon MAE. Statistically significantp-values (<0.05) are bolded. Cliff’s Delta (δ) indicates the effect size. Model Mean Diff (MAE) Median Diff (MAE) Wilcoxon Stat p- Value ...