Comparative Analysis of Polygon-Based and Global Machine Learning Models for Bus Occupancy Prediction

Daniel Azenkot; Eran Ben Elia; Michael Fire

arxiv: 2605.00083 · v1 · submitted 2026-04-30 · 💻 cs.LG

Comparative Analysis of Polygon-Based and Global Machine Learning Models for Bus Occupancy Prediction

Daniel Azenkot , Michael Fire , Eran Ben Elia This is my paper

Pith reviewed 2026-05-09 20:26 UTC · model grok-4.3

classification 💻 cs.LG

keywords bus occupancy predictionspatial clusteringmachine learning modelspublic transport forecastinglocal vs global modelspolygon-based analysisridership prediction

0 comments

The pith

Clustering bus stops into spatial polygons and training local models for each yields bus occupancy forecasts as accurate as one city-wide global model.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper tests whether splitting a city into polygon regions of nearby bus stops, then training a separate machine learning model on each region, can predict passenger numbers as well as a single model trained on all stops together. The local models use the same mix of features: time of day, day of week, weather, and nearby attractions. If the local versions perform equally well, transit planners could build forecasts tuned to specific neighborhoods without losing overall reliability. The work compares the two strategies on real bus data and finds comparable accuracy between them.

Core claim

By grouping bus stops into polygons on the principle that nearby stops share similar ridership patterns, and training a dedicated forecasting model for each polygon using temporal, meteorological, and spatial features, the localized approach achieves predictive accuracy comparable to that of a single global model applied across the entire urban area.

What carries the argument

Polygon-based spatial clustering of bus stops, which groups proximate stops assumed to have similar ridership characteristics so that a separate machine learning model can be trained per cluster.

If this is right

Transit agencies could run neighborhood-specific forecasts while maintaining city-level reliability.
Local models allow service adjustments targeted to individual polygons rather than uniform city rules.
The same multi-source feature set (time, weather, attractions) supports both local and global training without modification.
Spatially aware clustering offers a practical alternative to treating the whole city as one homogeneous area.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same polygon clustering idea could be tested on other transport modes such as trains or shared bikes where stop proximity also matters.
Agencies might lower training costs by maintaining many small local models instead of one large global model.
Dynamic polygons that shift with seasons or events could be explored as an extension beyond fixed boundaries.

Load-bearing premise

Bus stops that are close together share similar enough ridership characteristics that they can be grouped into polygons and modeled separately without reducing overall forecast accuracy.

What would settle it

If, on the same held-out bus ridership dataset, the polygon-specific models produce measurably lower accuracy than the single global model across standard error metrics, the claim of comparable performance would not hold.

Figures

Figures reproduced from arXiv: 2605.00083 by Daniel Azenkot, Eran Ben Elia, Michael Fire.

**Figure 1.** Figure 1: Methodology overview. Since PT demand is strongly influenced by stop location, many researchers have developed spatially aware models that incorporate geographic information into their features. These models are based on Tobler’s first law (see [17]), which state that geographically closer or more similar areas tend to exhibit similar patterns. Wang et al. [26] used Geographically Weighted Regression (GWR)… view at source ↗

**Figure 2.** Figure 2: Max-p regions for March (training on all days except the last 7). Bus stops are grouped into spatially contiguous regions, shown in different colors, as determined by the optimal threshold τ selected using the Calinski–Harabasz (CH) index. effect size analysis. For the high-performing tree-based models, the effect size remains in the “small” to “negligible” range (e.g., δ = 0.194 for LightGBM and δ = 0.13… view at source ↗

**Figure 3.** Figure 3: sMPAE of LightGBM across ridership buckets (0–10 to 41–50) over all experiments, comparing global and polygon strategies. Both approaches show similar performance, with the highest errors in the lowest ridership bucket and a monotone increase in error from 11–20 up to 41–50 ridership. Figure S8 shows that extending the LightGBM training set to include the second-to-last week did not yield meaningful improv… view at source ↗

**Figure 4.** Figure 4: Distribution of mean absolute error (MAE) for four tree-based models (XGBoost, LightGBM, CatBoost, and Random Forest) under the global and polygon-based strategies, for all the test sets used in the experiment. The boxplots show that both approaches yield comparable error distributions, with the polygon-based models generally achieving slightly lower median MAE values. that both modeling strategies maintai… view at source ↗

**Figure 5.** Figure 5: SHAP values for January (polygon-level LightGBM). sengers. Although the polygon-wise model achieved slightly lower median errors in both weeks, the differences were minor. The stability of MAE across weeks suggests that the temporal variability of ridership patterns limits the benefit of simple training set extensions, highlighting the need for more adaptive temporal features or dynamic learning mechanism… view at source ↗

read the original abstract

Accurate forecasting of bus ridership (passengers numbers) is crucial for efficient management and optimization of public transport systems. Traditional forecasting models often fail to capture the unique and localized dynamics of different urban areas by treating the entire city as a single, homogeneous region. This paper introduces a novel framework that enhances bus ridership prediction by integrating a spatial clustering methodology with multi-dimensional feature analysis. The proposed framework utilizes a diverse set of data, including bus ridership data (by route number, time, and bus stop) complemented by a variety of open source data, such as spatial features (e.g., attractive destinations), meteorological conditions (e.g., temperature, rainfall), and temporal patterns (e.g., time of day, day of week). By clustering the urban area into distinct regions, based on the principle that bus stops in close proximity share similar ridership characteristics, a separate local forecasting model is trained for each of these clusters. This localized approach demonstrates an accuracy comparable to that of global models. The findings suggest that a spatially-aware, localized modeling strategy is effective for public transport prediction, paving the way for more targeted and efficient service improvements.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper applies spatial clustering of bus stops into polygons for local ML models on occupancy prediction and finds they match global model performance, though the abstract provides no supporting metrics.

read the letter

What stands out is that the paper clusters bus stops spatially into polygons and trains separate machine learning models for each, claiming this localized setup achieves accuracy comparable to a single global model across the city. They incorporate ridership records along with spatial features like nearby destinations, weather data, and temporal patterns. The work does a good job of describing a practical framework that pulls together diverse open data sources for the prediction task. The clustering principle based on proximity makes intuitive sense for capturing neighborhood-specific ridership behaviors, and training per-cluster models is a direct way to address variation without needing more complex architectures. It is an honest applied study that focuses on real-world transport optimization rather than theoretical advances. The soft spots are in the presentation of results. The abstract states the comparable accuracy without providing any specific metrics, error bars, baseline models, or details on the clustering process and validation. This leaves the central empirical claim unverified from the text alone. The number of clusters is a free parameter that could affect outcomes, yet no sensitivity analysis is mentioned in the summary. This paper would be useful for readers in urban planning or public transit operations who are looking to adapt standard ML techniques to their forecasting needs. Someone already working on geospatial prediction might see it as a routine extension rather than a breakthrough. It deserves a serious referee because the question is well-posed and the approach is reproducible in principle; a review could clarify the experimental details and assess the strength of the comparison. I recommend putting it through peer review to get input on whether the full results support the claims.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes a framework for bus occupancy prediction that integrates spatial clustering of bus stops into polygons (based on proximity) with per-cluster machine learning models trained on multi-feature data including ridership, spatial attributes of destinations, meteorological variables, and temporal patterns. It claims that these localized polygon-based models achieve accuracy comparable to a single global model trained on the full dataset.

Significance. If the empirical comparison holds under proper validation, the work could support more spatially targeted forecasting in public transit, potentially improving operational efficiency. The reliance on open-source auxiliary data is a methodological strength that enhances reproducibility and generalizability.

major comments (2)

Abstract: The central claim that 'this localized approach demonstrates an accuracy comparable to that of global models' is stated without any quantitative metrics (e.g., MAE, RMSE, R²), error bars, baseline details, or statistical tests. This absence makes the primary result unverifiable from the provided text and is load-bearing for the paper's contribution.
Methodology (clustering and evaluation procedure): No description is given of the specific clustering algorithm, how the number of polygons/clusters is determined, the criteria for assigning stops to polygons, the train/validation/test split strategy, or the cross-validation method used to compare local versus global models. These details are required to assess whether the 'comparable accuracy' result is robust or an artifact of the experimental design.

minor comments (1)

The abstract and introduction would benefit from a concise statement of the exact performance metrics and the magnitude of any observed differences between local and global models.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive and detailed comments on our manuscript. We address each major point below and will revise the paper to improve the verifiability and reproducibility of our results.

read point-by-point responses

Referee: Abstract: The central claim that 'this localized approach demonstrates an accuracy comparable to that of global models' is stated without any quantitative metrics (e.g., MAE, RMSE, R²), error bars, baseline details, or statistical tests. This absence makes the primary result unverifiable from the provided text and is load-bearing for the paper's contribution.

Authors: We agree that the abstract should contain quantitative support for the comparability claim to allow immediate verification. The results section of the manuscript reports MAE, RMSE, and R² values for the polygon-based models versus the single global model (using the same feature set), along with the baselines and a note that performance differences fall within statistical noise. We will revise the abstract to include the key aggregate metrics and a brief reference to the statistical comparison, ensuring the central result is verifiable from the abstract. revision: yes
Referee: Methodology (clustering and evaluation procedure): No description is given of the specific clustering algorithm, how the number of polygons/clusters is determined, the criteria for assigning stops to polygons, the train/validation/test split strategy, or the cross-validation method used to compare local versus global models. These details are required to assess whether the 'comparable accuracy' result is robust or an artifact of the experimental design.

Authors: We acknowledge that the current description of the clustering and evaluation procedure is insufficiently detailed. We will expand the Methodology section to specify the clustering algorithm, the criterion or method used to select the number of polygons, the precise assignment rule for bus stops, the train/validation/test partitioning approach (including how temporal ordering is respected), and the cross-validation procedure employed for the local-versus-global comparison. These additions will enable readers to assess the robustness of the reported accuracy comparability. revision: yes

Circularity Check

0 steps flagged

No significant circularity

full rationale

The paper is an empirical comparative study that clusters bus stops into polygons by spatial proximity, extracts features from external open-source spatial/meteorological/temporal data, trains separate supervised models per cluster, and reports that their accuracy is comparable to a single global model. This workflow contains no mathematical derivation chain; the central claim is a direct outcome of standard clustering plus ML training/evaluation on held-out data. No step defines a quantity in terms of itself, renames a fitted parameter as a prediction, or relies on a self-citation whose content is itself unverified or tautological. The methodology is self-contained against external benchmarks and does not reduce to its inputs by construction.

Axiom & Free-Parameter Ledger

2 free parameters · 2 axioms · 0 invented entities

The central claim rests on the domain assumption of spatial similarity in ridership and on standard machine-learning modeling assumptions. No new physical entities are postulated. Several free parameters are implicit in any clustering-plus-modeling pipeline.

free parameters (2)

number of clusters / polygon count
The number of regions is chosen or optimized from data and directly determines the localization granularity.
model hyperparameters
Typical supervised learning parameters (learning rate, tree depth, regularization, etc.) are tuned to the ridership dataset.

axioms (2)

domain assumption Bus stops in close proximity share similar ridership characteristics
Explicitly invoked in the abstract as the principle justifying the spatial clustering step.
domain assumption Integration of open-source spatial, meteorological, and temporal features improves predictive accuracy
Assumed when the framework is described as using these complementary data sources.

pith-pipeline@v0.9.0 · 5502 in / 1553 out tokens · 38022 ms · 2026-05-09T20:26:04.880697+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

80 extracted references · 80 canonical work pages

[1]

Appraisal of urbanization and traffic on environmental quality.Journal of CO2 Utilization, 16:428–430, 2016

Nikola Petrovi´c, Nebojsa Bojovi´c, and Jelena Petrovi ´c. Appraisal of urbanization and traffic on environmental quality.Journal of CO2 Utilization, 16:428–430, 2016

work page 2016
[2]

Island Press, 2024

Jarrett Walker.Human transit, revised edition: how clearer thinking about public transit can enrich our com- munities and our lives. Island Press, 2024

work page 2024
[3]

A composite index of public transit accessibility.Journal of Public Transportation, 14(2):69–87, 2011

Md Sha Al Mamun and Nicholas E Lownes. A composite index of public transit accessibility.Journal of Public Transportation, 14(2):69–87, 2011

work page 2011
[4]

Efficient timetabling and vehicle scheduling for public transport

Avi Ceder. Efficient timetabling and vehicle scheduling for public transport. InComputer-aided scheduling of public transport, pages 37–52. Springer, 2001

work page 2001
[5]

Workshop synthesis: Representative- ness in surveys: challenges and solutions.Transportation Research Procedia, 32:224–228, 2018

Jimmy Armoogum, Adrian B Ellison, and Marie- José Olde Kalter. Workshop synthesis: Representative- ness in surveys: challenges and solutions.Transportation Research Procedia, 32:224–228, 2018

work page 2018
[6]

Public-transit frequency setting using minimum-cost approach with stochastic demand and travel time.Transportation Re- search Part B: Methodological, 46(8):1068–1084, 2012

Yuval Hadas and Matan Shnaiderman. Public-transit frequency setting using minimum-cost approach with stochastic demand and travel time.Transportation Re- search Part B: Methodological, 46(8):1068–1084, 2012

work page 2012
[7]

CRC press, 2016

Avishai Ceder.Public transit planning and operation: Modeling, practice and behavior. CRC press, 2016

work page 2016
[8]

Smart cities, big data and urban policy: Towards urban analytics for the long run

Jens Kandt and Michael Batty. Smart cities, big data and urban policy: Towards urban analytics for the long run. Cities, 109:102992, 2021

work page 2021
[9]

Towards smart card based mutual authentication schemes in cloud computing.KSII Transactions on In- ternet and Information Systems (TIIS), 9(7):2719–2735, 2015

Haoxing Li, Fenghua Li, Chenggen Song, and Yalong Yan. Towards smart card based mutual authentication schemes in cloud computing.KSII Transactions on In- ternet and Information Systems (TIIS), 9(7):2719–2735, 2015

work page 2015
[10]

Smart card data mining of public transport destination: A litera- ture review.Information, 9(1):18, 2018

Tian Li, Dazhi Sun, Peng Jing, and Kaixi Yang. Smart card data mining of public transport destination: A litera- ture review.Information, 9(1):18, 2018

work page 2018
[11]

Understanding commuting patterns using transit smart card data.Journal of Transport Geog- raphy, 58:135–145, 2017

Xiaolei Ma, Congcong Liu, Huimin Wen, Yunpeng Wang, and Yao-Jan Wu. Understanding commuting patterns using transit smart card data.Journal of Transport Geog- raphy, 58:135–145, 2017

work page 2017
[12]

Mining smart card data for transit riders’ travel patterns.Transportation Research Part C: Emerg- ing Technologies, 36:1–12, 2013

Xiaolei Ma, Yao-Jan Wu, Yinhai Wang, Feng Chen, and Jianfeng Liu. Mining smart card data for transit riders’ travel patterns.Transportation Research Part C: Emerg- ing Technologies, 36:1–12, 2013

work page 2013
[13]

Urban transportation data research overview: A bibliometric analysis based on citespace.Sustainability, 16(22):9615, 2024

Yanni Liang, Jianxin You, Ran Wang, Bo Qin, and Shuo Han. Urban transportation data research overview: A bibliometric analysis based on citespace.Sustainability, 16(22):9615, 2024

work page 2024
[14]

Khatun E Zannat and Charisma F Choudhury. Emerging big data sources for public transport planning: A sys- tematic review on current state of art and future research directions.Journal of the Indian Institute of Science, 99 (4):601–619, 2019

work page 2019
[15]

Pioneering open data standards: The gtfs story

B McHugh. Pioneering open data standards: The gtfs story. beyond transparency: open data and the future of civic innovation.Beyond transparency: open data and the future of civic innovation, pages 123–135, 2013

work page 2013
[16]

Behavioural data mining of transit smart card data: A data fusion approach

Takahiko Kusakabe and Yasuo Asakura. Behavioural data mining of transit smart card data: A data fusion approach. Transportation Research Part C: Emerging Technologies, 46:179–191, 2014

work page 2014
[17]

Tobler’s first law and spatial analysis

Harvey J Miller. Tobler’s first law and spatial analysis. Annals of the association of American geographers, 94 (2):284–289, 2004. Comparative Analysis of Polygon-Based and Global Machine Learning Models for Bus Occupancy Prediction — 20/34

work page 2004
[18]

The max- p-regions problem.Journal of Regional Science, 52(3): 397–419, 2012

Juan C Duque, Luc Anselin, and Sergio J Rey. The max- p-regions problem.Journal of Regional Science, 52(3): 397–419, 2012

work page 2012
[19]

Explanation of machine learning models using improved shapley additive expla- nation

Yasunobu Nohara, Koutarou Matsumoto, Hidehisa Soe- jima, and Naoki Nakashima. Explanation of machine learning models using improved shapley additive expla- nation. InProceedings of the 10th ACM international conference on bioinformatics, computational biology and health informatics, pages 546–546, 2019

work page 2019
[20]

De- velopment and evaluation of frameworks for real-time bus passenger occupancy prediction.International Journal of Transportation Science and Technology, 12(2):399–413, 2023

Jonathan Wood, Zhengyao Yu, and Vikash V Gayah. De- velopment and evaluation of frameworks for real-time bus passenger occupancy prediction.International Journal of Transportation Science and Technology, 12(2):399–413, 2023

work page 2023
[21]

Framework for onboard bus comfort level predictions using the markov chain concept

Paweł Wi˛ ecek, Daniel Kubek, Jan Hipolit Aleksandrow- icz, and Aleksandra Stró˙zek. Framework for onboard bus comfort level predictions using the markov chain concept. Symmetry, 11(6):755, 2019

work page 2019
[22]

Research on forecast of rail traffic flow based on arima model

Shu Ying Liu, Shuo Liu, Ye Tian, Quan Long Sun, and Yu Yang Tang. Research on forecast of rail traffic flow based on arima model. InJournal of Physics: Conference Series, volume 1792, page 012065. IOP Publishing, 2021

work page 2021
[23]

Short-term passenger flow prediction in urban public transport: Kalman filtering combined k-nearest neighbor approach.Ieee Access, 7:120937–120949, 2019

Shidong Liang, Minghui Ma, Shengxue He, and Hu Zhang. Short-term passenger flow prediction in urban public transport: Kalman filtering combined k-nearest neighbor approach.Ieee Access, 7:120937–120949, 2019

work page 2019
[24]

Passenger flow prediction using smart card data from connected bus system based on interpretable xgboost.Wireless Communications and Mobile Computing, 2022(1):5872225, 2022

Liang Zou, Sisi Shu, Xiang Lin, Kaisheng Lin, Jiasong Zhu, and Linchao Li. Passenger flow prediction using smart card data from connected bus system based on interpretable xgboost.Wireless Communications and Mobile Computing, 2022(1):5872225, 2022

work page 2022
[25]

Designing on- board explainable passenger flow prediction.Engineering Applications of Artificial Intelligence, 139:109648, 2025

Mario Barbareschi, Antonio Emmanuele, Nicola Maz- zocca, and Franca Rocco di Torrepadula. Designing on- board explainable passenger flow prediction.Engineering Applications of Artificial Intelligence, 139:109648, 2025

work page 2025
[26]

Bus ridership and its determinants in beijing: A spatial econometric perspective.Transportation, 50(2): 383–406, 2023

Jiaoe Wang, Yanan Li, Jingjuan Jiao, Haitao Jin, and Fangye Du. Bus ridership and its determinants in beijing: A spatial econometric perspective.Transportation, 50(2): 383–406, 2023

work page 2023
[27]

An adapted geographically weighted lasso (ada-gwl) model for pre- dicting subway ridership.Transportation, 48(3):1185– 1216, 2021

Yuxin He, Yang Zhao, and Kwok Leung Tsui. An adapted geographically weighted lasso (ada-gwl) model for pre- dicting subway ridership.Transportation, 48(3):1185– 1216, 2021

work page 2021
[28]

‘centrality measures’ as a tool to identify the transit demand at public transit stops; a case of ahmedabad city, india.International Journal, 2 (7):1063–1074, 2014

TalatMunshi AmilaJayasinghe. ‘centrality measures’ as a tool to identify the transit demand at public transit stops; a case of ahmedabad city, india.International Journal, 2 (7):1063–1074, 2014

work page 2014
[29]

Exploring the nonlinear effects of built environment on bus-transfer ridership: take shanghai as an example.Ap- plied Sciences, 12(11):5755, 2022

Ding Liu, Wuyue Rong, Jin Zhang, and Ying-En Ge. Exploring the nonlinear effects of built environment on bus-transfer ridership: take shanghai as an example.Ap- plied Sciences, 12(11):5755, 2022

work page 2022
[30]

Exploring the association be- tween network centralities and passenger flows in metro systems.Applied Network Science, 8(1):69, 2023

Athanasios Kopsidas, Aristeides Douvaras, and Kon- stantinos Kepaptsoglou. Exploring the association be- tween network centralities and passenger flows in metro systems.Applied Network Science, 8(1):69, 2023

work page 2023
[31]

Predicting bus ridership based on the weather conditions using deep learning algorithms.Trans- portation Research Interdisciplinary Perspectives, 19: 100833, 2023

Zakir H Farahmand, Konstantinos Gkiotsalitis, and Karst T Geurs. Predicting bus ridership based on the weather conditions using deep learning algorithms.Trans- portation Research Interdisciplinary Perspectives, 19: 100833, 2023

work page 2023
[32]

Artificial neural networks for fore- casting passenger flows on metro lines.Sensors, 19(15): 3424, 2019

Mariano Gallo, Giuseppina De Luca, Luca D’Acierno, and Marilisa Botte. Artificial neural networks for fore- casting passenger flows on metro lines.Sensors, 19(15): 3424, 2019

work page 2019
[33]

Learning to forget: Continual prediction with lstm.Neu- ral computation, 12(10):2451–2471, 2000

Felix A Gers, Jürgen Schmidhuber, and Fred Cummins. Learning to forget: Continual prediction with lstm.Neu- ral computation, 12(10):2451–2471, 2000

work page 2000
[34]

Deep learning based lstm model for predicting the number of passengers for public transport bus operators.Jurnal Online Informatika, 9(1):18–28, 2024

Joko Siswanto, Danny Manongga, Irwan Sembiring, and Sutarto Wijono. Deep learning based lstm model for predicting the number of passengers for public transport bus operators.Jurnal Online Informatika, 9(1):18–28, 2024

work page 2024
[35]

Fore- casting the short-term metro ridership with seasonal and trend decomposition using loess and lstm neural networks

Dewang Chen, Jianhua Zhang, and Shixiong Jiang. Fore- casting the short-term metro ridership with seasonal and trend decomposition using loess and lstm neural networks. Ieee Access, 8:91181–91187, 2020

work page 2020
[36]

Deeppf: A deep learning based architecture for metro passenger flow pre- diction.Transportation Research Part C: Emerging Tech- nologies, 101:18–34, 2019

Yang Liu, Zhiyuan Liu, and Ruo Jia. Deeppf: A deep learning based architecture for metro passenger flow pre- diction.Transportation Research Part C: Emerging Tech- nologies, 101:18–34, 2019

work page 2019
[37]

Ai-based neural network models for bus passenger demand forecasting using smart card data

Sohani Liyanage, Rusul Abduljabbar, Hussein Dia, and Pei-Wei Tsai. Ai-based neural network models for bus passenger demand forecasting using smart card data. Journal of Urban Management, 11(3):365–380, 2022

work page 2022
[38]

Short-term bus passenger flow forecast based on cnn-bilstm.Advances in Engineer- ing Technology Research, 5(1):448–448, 2023

Chaohua Wu and Xingzu Qi. Short-term bus passenger flow forecast based on cnn-bilstm.Advances in Engineer- ing Technology Research, 5(1):448–448, 2023

work page 2023
[39]

Comparative analysis of deep-learning-based models for hourly bus passenger flow forecasting.Transportation, 51(5):1759–1784, 2024

Yu Zhang, Xiaodan Wang, Jingjing Xie, and Yun Bai. Comparative analysis of deep-learning-based models for hourly bus passenger flow forecasting.Transportation, 51(5):1759–1784, 2024

work page 2024
[40]

Transparency and the black box problem: Why we do not trust ai.Philosophy & Technology, 34(4):1607–1622, 2021

Warren J V on Eschenbach. Transparency and the black box problem: Why we do not trust ai.Philosophy & Technology, 34(4):1607–1622, 2021. Comparative Analysis of Polygon-Based and Global Machine Learning Models for Bus Occupancy Prediction — 21/34

work page 2021
[41]

Deep learning xai for bus passenger forecasting: A use case in spain.Mathematics, 10(9):1428, 2022

Leticia Monje, Ramón A Carrasco, Carlos Rosado, and Manuel Sánchez-Montañés. Deep learning xai for bus passenger forecasting: A use case in spain.Mathematics, 10(9):1428, 2022

work page 2022
[42]

A novel passenger flow prediction model using deep learning methods.Trans- portation Research Part C: Emerging Technologies, 84: 74–91, 2017

Lijuan Liu and Rung-Ching Chen. A novel passenger flow prediction model using deep learning methods.Trans- portation Research Part C: Emerging Technologies, 84: 74–91, 2017

work page 2017
[43]

Prediction of public bus passenger flow using spatial–temporal hybrid model of deep learning

Tao Chen, Jie Fang, Mengyun Xu, Yingfang Tong, and Wentian Chen. Prediction of public bus passenger flow using spatial–temporal hybrid model of deep learning. Journal of Transportation Engineering, Part A: Systems, 148(4):04022007, 2022

work page 2022
[44]

Short-term abnormal passenger flow pre- diction based on the fusion of svr and lstm.Ieee Access, 7:42946–42955, 2019

Jianyuan Guo, Zhen Xie, Yong Qin, Limin Jia, and Yaguan Wang. Short-term abnormal passenger flow pre- diction based on the fusion of svr and lstm.Ieee Access, 7:42946–42955, 2019

work page 2019
[45]

Short-term passenger flow prediction for multi-traffic modes: A transformer and residual network based multi-task learning method.In- formation Sciences, 642:119144, 2023

Yongjie Yang, Jinlei Zhang, Lixing Yang, Yang Yang, Xiaohong Li, and Ziyou Gao. Short-term passenger flow prediction for multi-traffic modes: A transformer and residual network based multi-task learning method.In- formation Sciences, 642:119144, 2023

work page 2023
[46]

Hy- brid hidden markov lstm for short-term traffic flow pre- diction.arXiv preprint arXiv:2307.04954, 2023

Agnimitra Sengupta, Adway Das, and S Ilgin Guler. Hy- brid hidden markov lstm for short-term traffic flow pre- diction.arXiv preprint arXiv:2307.04954, 2023

work page arXiv 2023
[47]

A novel cnn-gru-lstm based deep learning model for accurate traffic prediction.Discover Comput- ing, 28(1):38, 2025

Vandana Singh, Sudip Kumar Sahana, and Vandana Bhat- tacharjee. A novel cnn-gru-lstm based deep learning model for accurate traffic prediction.Discover Comput- ing, 28(1):38, 2025

work page 2025
[48]

Short-term trafficflow- forecasting modelbasedonga-tcn.JournalofAdvanced- Transportation, 1338607:13, 2021

SUNF Zhangrj, W SONGZ, et al. Short-term trafficflow- forecasting modelbasedonga-tcn.JournalofAdvanced- Transportation, 1338607:13, 2021

work page 2021
[49]

Passenger flow prediction of scenic spot using a gcn–rnn model.Sustainability, 14(6):3295, 2022

Zhijie Xu, Liyan Hou, Yueying Zhang, and Jianqin Zhang. Passenger flow prediction of scenic spot using a gcn–rnn model.Sustainability, 14(6):3295, 2022

work page 2022
[50]

Lightgbm-based model for metro passenger volume fore- casting.IET Intelligent Transport Systems, 14(13):1815– 1823, 2020

Youyang Zhang, Changfeng Zhu, and Qingrong Wang. Lightgbm-based model for metro passenger volume fore- casting.IET Intelligent Transport Systems, 14(13):1815– 1823, 2020

work page 2020
[51]

A novel wavelet- svm short-time passenger flow prediction in beijing sub- way system.Neurocomputing, 166:109–121, 2015

Yuxing Sun, Biao Leng, and Wei Guan. A novel wavelet- svm short-time passenger flow prediction in beijing sub- way system.Neurocomputing, 166:109–121, 2015

work page 2015
[52]

Clustering and forecast- ing urban bus passenger demand with a combination of time series models.Mathematics, 10(15):2670, 2022

Irene Mariñas-Collado, Ana E Sipols, M Teresa Santos- Martín, and Elisa Frutos-Bernal. Clustering and forecast- ing urban bus passenger demand with a combination of time series models.Mathematics, 10(15):2670, 2022

work page 2022
[53]

Relative neighborhood graphs and their relatives.Proceedings of the IEEE, 80(9):1502–1517, 2002

Jerzy W Jaromczyk and Godfried T Toussaint. Relative neighborhood graphs and their relatives.Proceedings of the IEEE, 80(9):1502–1517, 2002

work page 2002
[54]

A., Hekker, S., Stello, D., Guti ´errez-Soto, J., Handberg, R., Huber, D., et al

David W. Matula and Robert R. Sokal. Properties of gabriel graphs relevant to geographic variation research and the clustering of points in the plane.Geograph- ical Analysis, 12(3):205–222, 1980. doi: 10.1111/j. 1538-4632.1980.tb00031.x

work page doi:10.1111/j 1980
[55]

Degree centrality, between- ness centrality, and closeness centrality in social network

Junlong Zhang and Yu Luo. Degree centrality, between- ness centrality, and closeness centrality in social network. In2017 2nd international conference on modelling, sim- ulation and applied mathematics (MSAM2017), pages 300–303. Atlantis press, 2017

work page 2017
[56]

Some unique properties of eigenvector centrality.Social networks, 29(4):555–564, 2007

Phillip Bonacich. Some unique properties of eigenvector centrality.Social networks, 29(4):555–564, 2007

work page 2007
[57]

The interaction of size and density with graph-level indices.Social networks, 21(3):239–267, 1999

Brigham S Anderson, Carter Butts, and Kathleen Car- ley. The interaction of size and density with graph-level indices.Social networks, 21(3):239–267, 1999

work page 1999
[58]

Collective dy- namics of ‘small-world’networks.nature, 393(6684): 440–442, 1998

Duncan J Watts and Steven H Strogatz. Collective dy- namics of ‘small-world’networks.nature, 393(6684): 440–442, 1998

work page 1998
[59]

A dendrite method for cluster analysis.Communications in Statistics-theory and Methods, 3(1):1–27, 1974

Tadeusz Cali´nski and Jerzy Harabasz. A dendrite method for cluster analysis.Communications in Statistics-theory and Methods, 3(1):1–27, 1974

work page 1974
[60]

Interactions between bus, metro, and taxi use before and after the chinese spring festival.ISPRS International Journal of Geo-Information, 8(10):445, 2019

Jianwei Huang, Xintao Liu, Pengxiang Zhao, Junwei Zhang, and Mei-Po Kwan. Interactions between bus, metro, and taxi use before and after the chinese spring festival.ISPRS International Journal of Geo-Information, 8(10):445, 2019

work page 2019
[61]

Using fuzzy clustering of user perception to determine the number of level-of-service categories for bus rapid transit.Journal of Public Transportation, 24:100017, 2022

Yueying Huo, Jinhua Zhao, Xiaojuan Li, and Chen Guo. Using fuzzy clustering of user perception to determine the number of level-of-service categories for bus rapid transit.Journal of Public Transportation, 24:100017, 2022

work page 2022
[62]

Lightgbm: A highly efficient gradient boosting decision tree.Advances in neural information processing systems, 30, 2017

Guolin Ke, Qi Meng, Thomas Finley, Taifeng Wang, Wei Chen, Weidong Ma, Qiwei Ye, and Tie-Yan Liu. Lightgbm: A highly efficient gradient boosting decision tree.Advances in neural information processing systems, 30, 2017

work page 2017
[63]

Random forests.Machine learning, 45: 5–32, 2001

Leo Breiman. Random forests.Machine learning, 45: 5–32, 2001

work page 2001
[64]

Xgboost: A scalable tree boosting system

Tianqi Chen and Carlos Guestrin. Xgboost: A scalable tree boosting system. InProceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, pages 785–794, 2016

work page 2016
[65]

Catboost: unbiased boosting with categorical features

Liudmila Prokhorenkova, Gleb Gusev, Aleksandr V orobev, Anna Veronika Dorogush, and Andrey Gulin. Catboost: unbiased boosting with categorical features. Advances in neural information processing systems, 31, 2018. Comparative Analysis of Polygon-Based and Global Machine Learning Models for Bus Occupancy Prediction — 22/34

work page 2018
[66]

Forecasting of short-term metro ridership with support vector machine online model.Journal of Ad- vanced Transportation, 2018(1):3189238, 2018

Xuemei Wang, Ning Zhang, Yunlong Zhang, and Zhuang- bin Shi. Forecasting of short-term metro ridership with support vector machine online model.Journal of Ad- vanced Transportation, 2018(1):3189238, 2018

work page 2018
[67]

Statistical comparisons of classifiers over multiple data sets.Journal of Machine learning research, 7(Jan):1–30, 2006

Janez Demšar. Statistical comparisons of classifiers over multiple data sets.Journal of Machine learning research, 7(Jan):1–30, 2006

work page 2006
[68]

New evidence on walking distances to transit stops: Identifying redun- dancies and gaps using variable service areas.Transporta- tion, 41(1):193–210, 2014

Ahmed El-Geneidy, Michael Grimsrud, Rania Wasfi, Paul Tétreault, and Julien Surprenant-Legault. New evidence on walking distances to transit stops: Identifying redun- dancies and gaps using variable service areas.Transporta- tion, 41(1):193–210, 2014

work page 2014
[69]

Explaining walking distance to public transport: The dominance of public transport supply.Journal of Transport and Land Use, 6 (2):5–20, 2013

Rhonda Daniels and Corinne Mulley. Explaining walking distance to public transport: The dominance of public transport supply.Journal of Transport and Land Use, 6 (2):5–20, 2013

work page 2013
[70]

Yinghui Wu, Jiafu Tang, Yang Yu, and Zhendong Pan. A stochastic optimization model for transit network timetable design to mitigate the randomness of travel- ing time by adding slack time.Transportation Research Part C: Emerging Technologies, 52:15–31, 2015

work page 2015
[71]

Stochastic bus schedule coordination considering demand assignment and rerouting of passengers.Transportation Research Part B: Methodological, 121:275–303, 2019

Weitiao Wu, Ronghui Liu, Wenzhou Jin, and Changxi Ma. Stochastic bus schedule coordination considering demand assignment and rerouting of passengers.Transportation Research Part B: Methodological, 121:275–303, 2019

work page 2019
[72]

A matheuristic for transfer synchronization through integrated timetabling and vehi- cle scheduling.Transportation Research Part B: Method- ological, 109:128–149, 2018

João Paiva Fonseca, Evelien van der Hurk, Roberto Roberti, and Allan Larsen. A matheuristic for transfer synchronization through integrated timetabling and vehi- cle scheduling.Transportation Research Part B: Method- ological, 109:128–149, 2018

work page 2018
[73]

Single bus line timetable optimization with big data: A case study in beijing.Information Sciences, 536:53–66, 2020

Hongguang Ma, Xiang Li, and Haitao Yu. Single bus line timetable optimization with big data: A case study in beijing.Information Sciences, 536:53–66, 2020

work page 2020
[74]

Research on bus scheduling optimization considering exhaust emission based on genetic algorithm: Taking a route in nanjing city as an example.Applied Sciences, 14(10):4126, 2024

Meixia Wang, Baohua Guo, Zhezhe Zhang, and Yan- shuang Zhang. Research on bus scheduling optimization considering exhaust emission based on genetic algorithm: Taking a route in nanjing city as an example.Applied Sciences, 14(10):4126, 2024

work page 2024
[75]

Robust optimization model of schedule design for a fixed bus route.Transportation Research Part C: Emerging Technologies, 25:113–121, 2012

Yadan Yan, Qiang Meng, Shuaian Wang, and Xiucheng Guo. Robust optimization model of schedule design for a fixed bus route.Transportation Research Part C: Emerging Technologies, 25:113–121, 2012

work page 2012
[76]

Estimating the robustness of public transport schedules using machine learning

Matthias Müller-Hannemann, Ralf Rückert, Alexander Schiewe, and Anita Schöbel. Estimating the robustness of public transport schedules using machine learning. Transportation Research Part C: Emerging Technologies, 137:103566, 2022

work page 2022
[77]

Validation of automatic passenger counting: introducing the t-test- induced equivalence test.Transportation, 47(6):3031– 3045, 2020

Michael Siebert and David Ellenberger. Validation of automatic passenger counting: introducing the t-test- induced equivalence test.Transportation, 47(6):3031– 3045, 2020

work page 2020
[78]

Cross-checking automated passenger counts for ridership analysis.Journal of Public Transportation, 24:100008, 2022

Simon J Berrebi, Sanskruti Joshi, and Kari E Watkins. Cross-checking automated passenger counts for ridership analysis.Journal of Public Transportation, 24:100008, 2022. Appendices

work page 2022
[79]

This underscores the need for improved data collection and validation methods in subsequent research

APC data limitation Although cleaning and filtering procedures were implemented to mitigate these issues, the persistence of such errors reduces the reliability of the dataset and could compromise the accu- racy of the results. This underscores the need for improved data collection and validation methods in subsequent research. Figure S2 shows the distrib...

work page
[80]

Global MAE Differences Across 14 Matched Test Sets Note: Negative differences indicate lower polygon MAE

Supplementary Figures and Tables Table S4.Wilcoxon Signed-Rank Test Results: Polygon vs. Global MAE Differences Across 14 Matched Test Sets Note: Negative differences indicate lower polygon MAE. Statistically significantp-values (<0.05) are bolded. Cliff’s Delta (δ) indicates the effect size. Model Mean Diff (MAE) Median Diff (MAE) Wilcoxon Stat p- Value ...

work page

[1] [1]

Appraisal of urbanization and traffic on environmental quality.Journal of CO2 Utilization, 16:428–430, 2016

Nikola Petrovi´c, Nebojsa Bojovi´c, and Jelena Petrovi ´c. Appraisal of urbanization and traffic on environmental quality.Journal of CO2 Utilization, 16:428–430, 2016

work page 2016

[2] [2]

Island Press, 2024

Jarrett Walker.Human transit, revised edition: how clearer thinking about public transit can enrich our com- munities and our lives. Island Press, 2024

work page 2024

[3] [3]

A composite index of public transit accessibility.Journal of Public Transportation, 14(2):69–87, 2011

Md Sha Al Mamun and Nicholas E Lownes. A composite index of public transit accessibility.Journal of Public Transportation, 14(2):69–87, 2011

work page 2011

[4] [4]

Efficient timetabling and vehicle scheduling for public transport

Avi Ceder. Efficient timetabling and vehicle scheduling for public transport. InComputer-aided scheduling of public transport, pages 37–52. Springer, 2001

work page 2001

[5] [5]

Workshop synthesis: Representative- ness in surveys: challenges and solutions.Transportation Research Procedia, 32:224–228, 2018

Jimmy Armoogum, Adrian B Ellison, and Marie- José Olde Kalter. Workshop synthesis: Representative- ness in surveys: challenges and solutions.Transportation Research Procedia, 32:224–228, 2018

work page 2018

[6] [6]

Public-transit frequency setting using minimum-cost approach with stochastic demand and travel time.Transportation Re- search Part B: Methodological, 46(8):1068–1084, 2012

Yuval Hadas and Matan Shnaiderman. Public-transit frequency setting using minimum-cost approach with stochastic demand and travel time.Transportation Re- search Part B: Methodological, 46(8):1068–1084, 2012

work page 2012

[7] [7]

CRC press, 2016

Avishai Ceder.Public transit planning and operation: Modeling, practice and behavior. CRC press, 2016

work page 2016

[8] [8]

Smart cities, big data and urban policy: Towards urban analytics for the long run

Jens Kandt and Michael Batty. Smart cities, big data and urban policy: Towards urban analytics for the long run. Cities, 109:102992, 2021

work page 2021

[9] [9]

Towards smart card based mutual authentication schemes in cloud computing.KSII Transactions on In- ternet and Information Systems (TIIS), 9(7):2719–2735, 2015

Haoxing Li, Fenghua Li, Chenggen Song, and Yalong Yan. Towards smart card based mutual authentication schemes in cloud computing.KSII Transactions on In- ternet and Information Systems (TIIS), 9(7):2719–2735, 2015

work page 2015

[10] [10]

Smart card data mining of public transport destination: A litera- ture review.Information, 9(1):18, 2018

Tian Li, Dazhi Sun, Peng Jing, and Kaixi Yang. Smart card data mining of public transport destination: A litera- ture review.Information, 9(1):18, 2018

work page 2018

[11] [11]

Understanding commuting patterns using transit smart card data.Journal of Transport Geog- raphy, 58:135–145, 2017

Xiaolei Ma, Congcong Liu, Huimin Wen, Yunpeng Wang, and Yao-Jan Wu. Understanding commuting patterns using transit smart card data.Journal of Transport Geog- raphy, 58:135–145, 2017

work page 2017

[12] [12]

Mining smart card data for transit riders’ travel patterns.Transportation Research Part C: Emerg- ing Technologies, 36:1–12, 2013

Xiaolei Ma, Yao-Jan Wu, Yinhai Wang, Feng Chen, and Jianfeng Liu. Mining smart card data for transit riders’ travel patterns.Transportation Research Part C: Emerg- ing Technologies, 36:1–12, 2013

work page 2013

[13] [13]

Urban transportation data research overview: A bibliometric analysis based on citespace.Sustainability, 16(22):9615, 2024

Yanni Liang, Jianxin You, Ran Wang, Bo Qin, and Shuo Han. Urban transportation data research overview: A bibliometric analysis based on citespace.Sustainability, 16(22):9615, 2024

work page 2024

[14] [14]

Khatun E Zannat and Charisma F Choudhury. Emerging big data sources for public transport planning: A sys- tematic review on current state of art and future research directions.Journal of the Indian Institute of Science, 99 (4):601–619, 2019

work page 2019

[15] [15]

Pioneering open data standards: The gtfs story

B McHugh. Pioneering open data standards: The gtfs story. beyond transparency: open data and the future of civic innovation.Beyond transparency: open data and the future of civic innovation, pages 123–135, 2013

work page 2013

[16] [16]

Behavioural data mining of transit smart card data: A data fusion approach

Takahiko Kusakabe and Yasuo Asakura. Behavioural data mining of transit smart card data: A data fusion approach. Transportation Research Part C: Emerging Technologies, 46:179–191, 2014

work page 2014

[17] [17]

Tobler’s first law and spatial analysis

Harvey J Miller. Tobler’s first law and spatial analysis. Annals of the association of American geographers, 94 (2):284–289, 2004. Comparative Analysis of Polygon-Based and Global Machine Learning Models for Bus Occupancy Prediction — 20/34

work page 2004

[18] [18]

The max- p-regions problem.Journal of Regional Science, 52(3): 397–419, 2012

Juan C Duque, Luc Anselin, and Sergio J Rey. The max- p-regions problem.Journal of Regional Science, 52(3): 397–419, 2012

work page 2012

[19] [19]

Explanation of machine learning models using improved shapley additive expla- nation

Yasunobu Nohara, Koutarou Matsumoto, Hidehisa Soe- jima, and Naoki Nakashima. Explanation of machine learning models using improved shapley additive expla- nation. InProceedings of the 10th ACM international conference on bioinformatics, computational biology and health informatics, pages 546–546, 2019

work page 2019

[20] [20]

De- velopment and evaluation of frameworks for real-time bus passenger occupancy prediction.International Journal of Transportation Science and Technology, 12(2):399–413, 2023

Jonathan Wood, Zhengyao Yu, and Vikash V Gayah. De- velopment and evaluation of frameworks for real-time bus passenger occupancy prediction.International Journal of Transportation Science and Technology, 12(2):399–413, 2023

work page 2023

[21] [21]

Framework for onboard bus comfort level predictions using the markov chain concept

Paweł Wi˛ ecek, Daniel Kubek, Jan Hipolit Aleksandrow- icz, and Aleksandra Stró˙zek. Framework for onboard bus comfort level predictions using the markov chain concept. Symmetry, 11(6):755, 2019

work page 2019

[22] [22]

Research on forecast of rail traffic flow based on arima model

Shu Ying Liu, Shuo Liu, Ye Tian, Quan Long Sun, and Yu Yang Tang. Research on forecast of rail traffic flow based on arima model. InJournal of Physics: Conference Series, volume 1792, page 012065. IOP Publishing, 2021

work page 2021

[23] [23]

Short-term passenger flow prediction in urban public transport: Kalman filtering combined k-nearest neighbor approach.Ieee Access, 7:120937–120949, 2019

Shidong Liang, Minghui Ma, Shengxue He, and Hu Zhang. Short-term passenger flow prediction in urban public transport: Kalman filtering combined k-nearest neighbor approach.Ieee Access, 7:120937–120949, 2019

work page 2019

[24] [24]

Passenger flow prediction using smart card data from connected bus system based on interpretable xgboost.Wireless Communications and Mobile Computing, 2022(1):5872225, 2022

Liang Zou, Sisi Shu, Xiang Lin, Kaisheng Lin, Jiasong Zhu, and Linchao Li. Passenger flow prediction using smart card data from connected bus system based on interpretable xgboost.Wireless Communications and Mobile Computing, 2022(1):5872225, 2022

work page 2022

[25] [25]

Designing on- board explainable passenger flow prediction.Engineering Applications of Artificial Intelligence, 139:109648, 2025

Mario Barbareschi, Antonio Emmanuele, Nicola Maz- zocca, and Franca Rocco di Torrepadula. Designing on- board explainable passenger flow prediction.Engineering Applications of Artificial Intelligence, 139:109648, 2025

work page 2025

[26] [26]

Bus ridership and its determinants in beijing: A spatial econometric perspective.Transportation, 50(2): 383–406, 2023

Jiaoe Wang, Yanan Li, Jingjuan Jiao, Haitao Jin, and Fangye Du. Bus ridership and its determinants in beijing: A spatial econometric perspective.Transportation, 50(2): 383–406, 2023

work page 2023

[27] [27]

An adapted geographically weighted lasso (ada-gwl) model for pre- dicting subway ridership.Transportation, 48(3):1185– 1216, 2021

Yuxin He, Yang Zhao, and Kwok Leung Tsui. An adapted geographically weighted lasso (ada-gwl) model for pre- dicting subway ridership.Transportation, 48(3):1185– 1216, 2021

work page 2021

[28] [28]

‘centrality measures’ as a tool to identify the transit demand at public transit stops; a case of ahmedabad city, india.International Journal, 2 (7):1063–1074, 2014

TalatMunshi AmilaJayasinghe. ‘centrality measures’ as a tool to identify the transit demand at public transit stops; a case of ahmedabad city, india.International Journal, 2 (7):1063–1074, 2014

work page 2014

[29] [29]

Exploring the nonlinear effects of built environment on bus-transfer ridership: take shanghai as an example.Ap- plied Sciences, 12(11):5755, 2022

Ding Liu, Wuyue Rong, Jin Zhang, and Ying-En Ge. Exploring the nonlinear effects of built environment on bus-transfer ridership: take shanghai as an example.Ap- plied Sciences, 12(11):5755, 2022

work page 2022

[30] [30]

Exploring the association be- tween network centralities and passenger flows in metro systems.Applied Network Science, 8(1):69, 2023

Athanasios Kopsidas, Aristeides Douvaras, and Kon- stantinos Kepaptsoglou. Exploring the association be- tween network centralities and passenger flows in metro systems.Applied Network Science, 8(1):69, 2023

work page 2023

[31] [31]

Predicting bus ridership based on the weather conditions using deep learning algorithms.Trans- portation Research Interdisciplinary Perspectives, 19: 100833, 2023

Zakir H Farahmand, Konstantinos Gkiotsalitis, and Karst T Geurs. Predicting bus ridership based on the weather conditions using deep learning algorithms.Trans- portation Research Interdisciplinary Perspectives, 19: 100833, 2023

work page 2023

[32] [32]

Artificial neural networks for fore- casting passenger flows on metro lines.Sensors, 19(15): 3424, 2019

Mariano Gallo, Giuseppina De Luca, Luca D’Acierno, and Marilisa Botte. Artificial neural networks for fore- casting passenger flows on metro lines.Sensors, 19(15): 3424, 2019

work page 2019

[33] [33]

Learning to forget: Continual prediction with lstm.Neu- ral computation, 12(10):2451–2471, 2000

Felix A Gers, Jürgen Schmidhuber, and Fred Cummins. Learning to forget: Continual prediction with lstm.Neu- ral computation, 12(10):2451–2471, 2000

work page 2000

[34] [34]

Deep learning based lstm model for predicting the number of passengers for public transport bus operators.Jurnal Online Informatika, 9(1):18–28, 2024

Joko Siswanto, Danny Manongga, Irwan Sembiring, and Sutarto Wijono. Deep learning based lstm model for predicting the number of passengers for public transport bus operators.Jurnal Online Informatika, 9(1):18–28, 2024

work page 2024

[35] [35]

Fore- casting the short-term metro ridership with seasonal and trend decomposition using loess and lstm neural networks

Dewang Chen, Jianhua Zhang, and Shixiong Jiang. Fore- casting the short-term metro ridership with seasonal and trend decomposition using loess and lstm neural networks. Ieee Access, 8:91181–91187, 2020

work page 2020

[36] [36]

Deeppf: A deep learning based architecture for metro passenger flow pre- diction.Transportation Research Part C: Emerging Tech- nologies, 101:18–34, 2019

Yang Liu, Zhiyuan Liu, and Ruo Jia. Deeppf: A deep learning based architecture for metro passenger flow pre- diction.Transportation Research Part C: Emerging Tech- nologies, 101:18–34, 2019

work page 2019

[37] [37]

Ai-based neural network models for bus passenger demand forecasting using smart card data

Sohani Liyanage, Rusul Abduljabbar, Hussein Dia, and Pei-Wei Tsai. Ai-based neural network models for bus passenger demand forecasting using smart card data. Journal of Urban Management, 11(3):365–380, 2022

work page 2022

[38] [38]

Short-term bus passenger flow forecast based on cnn-bilstm.Advances in Engineer- ing Technology Research, 5(1):448–448, 2023

Chaohua Wu and Xingzu Qi. Short-term bus passenger flow forecast based on cnn-bilstm.Advances in Engineer- ing Technology Research, 5(1):448–448, 2023

work page 2023

[39] [39]

Comparative analysis of deep-learning-based models for hourly bus passenger flow forecasting.Transportation, 51(5):1759–1784, 2024

Yu Zhang, Xiaodan Wang, Jingjing Xie, and Yun Bai. Comparative analysis of deep-learning-based models for hourly bus passenger flow forecasting.Transportation, 51(5):1759–1784, 2024

work page 2024

[40] [40]

Transparency and the black box problem: Why we do not trust ai.Philosophy & Technology, 34(4):1607–1622, 2021

Warren J V on Eschenbach. Transparency and the black box problem: Why we do not trust ai.Philosophy & Technology, 34(4):1607–1622, 2021. Comparative Analysis of Polygon-Based and Global Machine Learning Models for Bus Occupancy Prediction — 21/34

work page 2021

[41] [41]

Deep learning xai for bus passenger forecasting: A use case in spain.Mathematics, 10(9):1428, 2022

Leticia Monje, Ramón A Carrasco, Carlos Rosado, and Manuel Sánchez-Montañés. Deep learning xai for bus passenger forecasting: A use case in spain.Mathematics, 10(9):1428, 2022

work page 2022

[42] [42]

A novel passenger flow prediction model using deep learning methods.Trans- portation Research Part C: Emerging Technologies, 84: 74–91, 2017

Lijuan Liu and Rung-Ching Chen. A novel passenger flow prediction model using deep learning methods.Trans- portation Research Part C: Emerging Technologies, 84: 74–91, 2017

work page 2017

[43] [43]

Prediction of public bus passenger flow using spatial–temporal hybrid model of deep learning

Tao Chen, Jie Fang, Mengyun Xu, Yingfang Tong, and Wentian Chen. Prediction of public bus passenger flow using spatial–temporal hybrid model of deep learning. Journal of Transportation Engineering, Part A: Systems, 148(4):04022007, 2022

work page 2022

[44] [44]

Short-term abnormal passenger flow pre- diction based on the fusion of svr and lstm.Ieee Access, 7:42946–42955, 2019

Jianyuan Guo, Zhen Xie, Yong Qin, Limin Jia, and Yaguan Wang. Short-term abnormal passenger flow pre- diction based on the fusion of svr and lstm.Ieee Access, 7:42946–42955, 2019

work page 2019

[45] [45]

Short-term passenger flow prediction for multi-traffic modes: A transformer and residual network based multi-task learning method.In- formation Sciences, 642:119144, 2023

Yongjie Yang, Jinlei Zhang, Lixing Yang, Yang Yang, Xiaohong Li, and Ziyou Gao. Short-term passenger flow prediction for multi-traffic modes: A transformer and residual network based multi-task learning method.In- formation Sciences, 642:119144, 2023

work page 2023

[46] [46]

Hy- brid hidden markov lstm for short-term traffic flow pre- diction.arXiv preprint arXiv:2307.04954, 2023

Agnimitra Sengupta, Adway Das, and S Ilgin Guler. Hy- brid hidden markov lstm for short-term traffic flow pre- diction.arXiv preprint arXiv:2307.04954, 2023

work page arXiv 2023

[47] [47]

A novel cnn-gru-lstm based deep learning model for accurate traffic prediction.Discover Comput- ing, 28(1):38, 2025

Vandana Singh, Sudip Kumar Sahana, and Vandana Bhat- tacharjee. A novel cnn-gru-lstm based deep learning model for accurate traffic prediction.Discover Comput- ing, 28(1):38, 2025

work page 2025

[48] [48]

Short-term trafficflow- forecasting modelbasedonga-tcn.JournalofAdvanced- Transportation, 1338607:13, 2021

SUNF Zhangrj, W SONGZ, et al. Short-term trafficflow- forecasting modelbasedonga-tcn.JournalofAdvanced- Transportation, 1338607:13, 2021

work page 2021

[49] [49]

Passenger flow prediction of scenic spot using a gcn–rnn model.Sustainability, 14(6):3295, 2022

Zhijie Xu, Liyan Hou, Yueying Zhang, and Jianqin Zhang. Passenger flow prediction of scenic spot using a gcn–rnn model.Sustainability, 14(6):3295, 2022

work page 2022

[50] [50]

Lightgbm-based model for metro passenger volume fore- casting.IET Intelligent Transport Systems, 14(13):1815– 1823, 2020

Youyang Zhang, Changfeng Zhu, and Qingrong Wang. Lightgbm-based model for metro passenger volume fore- casting.IET Intelligent Transport Systems, 14(13):1815– 1823, 2020

work page 2020

[51] [51]

A novel wavelet- svm short-time passenger flow prediction in beijing sub- way system.Neurocomputing, 166:109–121, 2015

Yuxing Sun, Biao Leng, and Wei Guan. A novel wavelet- svm short-time passenger flow prediction in beijing sub- way system.Neurocomputing, 166:109–121, 2015

work page 2015

[52] [52]

Clustering and forecast- ing urban bus passenger demand with a combination of time series models.Mathematics, 10(15):2670, 2022

Irene Mariñas-Collado, Ana E Sipols, M Teresa Santos- Martín, and Elisa Frutos-Bernal. Clustering and forecast- ing urban bus passenger demand with a combination of time series models.Mathematics, 10(15):2670, 2022

work page 2022

[53] [53]

Relative neighborhood graphs and their relatives.Proceedings of the IEEE, 80(9):1502–1517, 2002

Jerzy W Jaromczyk and Godfried T Toussaint. Relative neighborhood graphs and their relatives.Proceedings of the IEEE, 80(9):1502–1517, 2002

work page 2002

[54] [54]

A., Hekker, S., Stello, D., Guti ´errez-Soto, J., Handberg, R., Huber, D., et al

David W. Matula and Robert R. Sokal. Properties of gabriel graphs relevant to geographic variation research and the clustering of points in the plane.Geograph- ical Analysis, 12(3):205–222, 1980. doi: 10.1111/j. 1538-4632.1980.tb00031.x

work page doi:10.1111/j 1980

[55] [55]

Degree centrality, between- ness centrality, and closeness centrality in social network

Junlong Zhang and Yu Luo. Degree centrality, between- ness centrality, and closeness centrality in social network. In2017 2nd international conference on modelling, sim- ulation and applied mathematics (MSAM2017), pages 300–303. Atlantis press, 2017

work page 2017

[56] [56]

Some unique properties of eigenvector centrality.Social networks, 29(4):555–564, 2007

Phillip Bonacich. Some unique properties of eigenvector centrality.Social networks, 29(4):555–564, 2007

work page 2007

[57] [57]

The interaction of size and density with graph-level indices.Social networks, 21(3):239–267, 1999

Brigham S Anderson, Carter Butts, and Kathleen Car- ley. The interaction of size and density with graph-level indices.Social networks, 21(3):239–267, 1999

work page 1999

[58] [58]

Collective dy- namics of ‘small-world’networks.nature, 393(6684): 440–442, 1998

Duncan J Watts and Steven H Strogatz. Collective dy- namics of ‘small-world’networks.nature, 393(6684): 440–442, 1998

work page 1998

[59] [59]

A dendrite method for cluster analysis.Communications in Statistics-theory and Methods, 3(1):1–27, 1974

Tadeusz Cali´nski and Jerzy Harabasz. A dendrite method for cluster analysis.Communications in Statistics-theory and Methods, 3(1):1–27, 1974

work page 1974

[60] [60]

Interactions between bus, metro, and taxi use before and after the chinese spring festival.ISPRS International Journal of Geo-Information, 8(10):445, 2019

Jianwei Huang, Xintao Liu, Pengxiang Zhao, Junwei Zhang, and Mei-Po Kwan. Interactions between bus, metro, and taxi use before and after the chinese spring festival.ISPRS International Journal of Geo-Information, 8(10):445, 2019

work page 2019

[61] [61]

Using fuzzy clustering of user perception to determine the number of level-of-service categories for bus rapid transit.Journal of Public Transportation, 24:100017, 2022

Yueying Huo, Jinhua Zhao, Xiaojuan Li, and Chen Guo. Using fuzzy clustering of user perception to determine the number of level-of-service categories for bus rapid transit.Journal of Public Transportation, 24:100017, 2022

work page 2022

[62] [62]

Lightgbm: A highly efficient gradient boosting decision tree.Advances in neural information processing systems, 30, 2017

Guolin Ke, Qi Meng, Thomas Finley, Taifeng Wang, Wei Chen, Weidong Ma, Qiwei Ye, and Tie-Yan Liu. Lightgbm: A highly efficient gradient boosting decision tree.Advances in neural information processing systems, 30, 2017

work page 2017

[63] [63]

Random forests.Machine learning, 45: 5–32, 2001

Leo Breiman. Random forests.Machine learning, 45: 5–32, 2001

work page 2001

[64] [64]

Xgboost: A scalable tree boosting system

Tianqi Chen and Carlos Guestrin. Xgboost: A scalable tree boosting system. InProceedings of the 22nd acm sigkdd international conference on knowledge discovery and data mining, pages 785–794, 2016

work page 2016

[65] [65]

Catboost: unbiased boosting with categorical features

Liudmila Prokhorenkova, Gleb Gusev, Aleksandr V orobev, Anna Veronika Dorogush, and Andrey Gulin. Catboost: unbiased boosting with categorical features. Advances in neural information processing systems, 31, 2018. Comparative Analysis of Polygon-Based and Global Machine Learning Models for Bus Occupancy Prediction — 22/34

work page 2018

[66] [66]

Forecasting of short-term metro ridership with support vector machine online model.Journal of Ad- vanced Transportation, 2018(1):3189238, 2018

Xuemei Wang, Ning Zhang, Yunlong Zhang, and Zhuang- bin Shi. Forecasting of short-term metro ridership with support vector machine online model.Journal of Ad- vanced Transportation, 2018(1):3189238, 2018

work page 2018

[67] [67]

Statistical comparisons of classifiers over multiple data sets.Journal of Machine learning research, 7(Jan):1–30, 2006

Janez Demšar. Statistical comparisons of classifiers over multiple data sets.Journal of Machine learning research, 7(Jan):1–30, 2006

work page 2006

[68] [68]

New evidence on walking distances to transit stops: Identifying redun- dancies and gaps using variable service areas.Transporta- tion, 41(1):193–210, 2014

Ahmed El-Geneidy, Michael Grimsrud, Rania Wasfi, Paul Tétreault, and Julien Surprenant-Legault. New evidence on walking distances to transit stops: Identifying redun- dancies and gaps using variable service areas.Transporta- tion, 41(1):193–210, 2014

work page 2014

[69] [69]

Explaining walking distance to public transport: The dominance of public transport supply.Journal of Transport and Land Use, 6 (2):5–20, 2013

Rhonda Daniels and Corinne Mulley. Explaining walking distance to public transport: The dominance of public transport supply.Journal of Transport and Land Use, 6 (2):5–20, 2013

work page 2013

[70] [70]

Yinghui Wu, Jiafu Tang, Yang Yu, and Zhendong Pan. A stochastic optimization model for transit network timetable design to mitigate the randomness of travel- ing time by adding slack time.Transportation Research Part C: Emerging Technologies, 52:15–31, 2015

work page 2015

[71] [71]

Stochastic bus schedule coordination considering demand assignment and rerouting of passengers.Transportation Research Part B: Methodological, 121:275–303, 2019

Weitiao Wu, Ronghui Liu, Wenzhou Jin, and Changxi Ma. Stochastic bus schedule coordination considering demand assignment and rerouting of passengers.Transportation Research Part B: Methodological, 121:275–303, 2019

work page 2019

[72] [72]

A matheuristic for transfer synchronization through integrated timetabling and vehi- cle scheduling.Transportation Research Part B: Method- ological, 109:128–149, 2018

João Paiva Fonseca, Evelien van der Hurk, Roberto Roberti, and Allan Larsen. A matheuristic for transfer synchronization through integrated timetabling and vehi- cle scheduling.Transportation Research Part B: Method- ological, 109:128–149, 2018

work page 2018

[73] [73]

Single bus line timetable optimization with big data: A case study in beijing.Information Sciences, 536:53–66, 2020

Hongguang Ma, Xiang Li, and Haitao Yu. Single bus line timetable optimization with big data: A case study in beijing.Information Sciences, 536:53–66, 2020

work page 2020

[74] [74]

Research on bus scheduling optimization considering exhaust emission based on genetic algorithm: Taking a route in nanjing city as an example.Applied Sciences, 14(10):4126, 2024

Meixia Wang, Baohua Guo, Zhezhe Zhang, and Yan- shuang Zhang. Research on bus scheduling optimization considering exhaust emission based on genetic algorithm: Taking a route in nanjing city as an example.Applied Sciences, 14(10):4126, 2024

work page 2024

[75] [75]

Robust optimization model of schedule design for a fixed bus route.Transportation Research Part C: Emerging Technologies, 25:113–121, 2012

Yadan Yan, Qiang Meng, Shuaian Wang, and Xiucheng Guo. Robust optimization model of schedule design for a fixed bus route.Transportation Research Part C: Emerging Technologies, 25:113–121, 2012

work page 2012

[76] [76]

Estimating the robustness of public transport schedules using machine learning

Matthias Müller-Hannemann, Ralf Rückert, Alexander Schiewe, and Anita Schöbel. Estimating the robustness of public transport schedules using machine learning. Transportation Research Part C: Emerging Technologies, 137:103566, 2022

work page 2022

[77] [77]

Validation of automatic passenger counting: introducing the t-test- induced equivalence test.Transportation, 47(6):3031– 3045, 2020

Michael Siebert and David Ellenberger. Validation of automatic passenger counting: introducing the t-test- induced equivalence test.Transportation, 47(6):3031– 3045, 2020

work page 2020

[78] [78]

Cross-checking automated passenger counts for ridership analysis.Journal of Public Transportation, 24:100008, 2022

Simon J Berrebi, Sanskruti Joshi, and Kari E Watkins. Cross-checking automated passenger counts for ridership analysis.Journal of Public Transportation, 24:100008, 2022. Appendices

work page 2022

[79] [79]

This underscores the need for improved data collection and validation methods in subsequent research

APC data limitation Although cleaning and filtering procedures were implemented to mitigate these issues, the persistence of such errors reduces the reliability of the dataset and could compromise the accu- racy of the results. This underscores the need for improved data collection and validation methods in subsequent research. Figure S2 shows the distrib...

work page

[80] [80]

Global MAE Differences Across 14 Matched Test Sets Note: Negative differences indicate lower polygon MAE

Supplementary Figures and Tables Table S4.Wilcoxon Signed-Rank Test Results: Polygon vs. Global MAE Differences Across 14 Matched Test Sets Note: Negative differences indicate lower polygon MAE. Statistically significantp-values (<0.05) are bolded. Cliff’s Delta (δ) indicates the effect size. Model Mean Diff (MAE) Median Diff (MAE) Wilcoxon Stat p- Value ...

work page