A comparison of apartment rent price prediction using a large dataset: Kriging versus DNN
Pith reviewed 2026-05-25 15:50 UTC · model grok-4.3
The pith
With large datasets, deep neural networks match the accuracy of scalable Kriging for apartment rent prediction.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Our analysis showed that, with an increase in sample size, the out-of-sample predictive accuracy of DNN approached that of NNGP and they were nearly equal on the order of n = 10^6. Furthermore, it is suggested that, for both higher and lower end properties whose rent price deviates from the median, DNN may have a higher predictive accuracy than that of NNGP.
What carries the argument
Nearest neighbor Gaussian processes (NNGP), a scalable approximation of Kriging that incorporates spatial dependence, versus deep neural networks (DNN) trained on the same hedonic rent data at varying sample sizes.
Load-bearing premise
Both models receive comparable optimization and the input features permit a fair test of spatial versus non-spatial methods, even though exact variable lists and training details are not specified.
What would settle it
A replication on another large Japanese or comparable real-estate dataset in which DNN out-of-sample error stays materially higher than NNGP error at one million observations, or in which DNN loses its reported advantage on extreme rent values.
Figures
read the original abstract
The hedonic approach based on a regression model has been widely adopted for the prediction of real estate property price and rent. In particular, a spatial regression technique called Kriging, a method of interpolation that was advanced in the field of spatial statistics, are known to enable high accuracy prediction in light of the spatial dependence of real estate property data. Meanwhile, there has been a rapid increase in machine learning-based prediction using a large (big) dataset and its effectiveness has been demonstrated in previous studies. However, no studies have ever shown the extent to which predictive accuracy differs for Kriging and machine learning techniques using big data. Thus, this study compares the predictive accuracy of apartment rent price in Japan between the nearest neighbor Gaussian processes (NNGP) model, which enables application of Kriging to big data, and the deep neural network (DNN), a representative machine learning technique, with a particular focus on the data sample size (n = 10^4, 10^5, 10^6) and differences in predictive performance. Our analysis showed that, with an increase in sample size, the out-of-sample predictive accuracy of DNN approached that of NNGP and they were nearly equal on the order of n = 10^6. Furthermore, it is suggested that, for both higher and lower end properties whose rent price deviates from the median, DNN may have a higher predictive accuracy than that of NNGP.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims that the out-of-sample predictive accuracy of a deep neural network (DNN) for apartment rent prices approaches that of the nearest neighbor Gaussian process (NNGP) model as the sample size increases to n = 10^6, becoming nearly equal at that scale, and that the DNN may outperform NNGP for properties with rent prices at the higher and lower ends of the distribution.
Significance. If substantiated by a controlled experimental design, the finding would contribute empirical evidence on the scaling behavior of spatial statistical models versus deep learning methods for large spatial datasets in real estate valuation. The emphasis on sample-size dependence and differential performance in the tails of the response distribution adds practical value beyond a simple head-to-head comparison.
major comments (3)
- [Abstract] Abstract: The abstract provides no information on the predictors supplied to the DNN. In particular, it is not stated whether latitude/longitude or other spatial covariates were included. This detail is essential because the NNGP model is constructed to capture spatial dependence through the covariance function on coordinates; without equivalent spatial information the reported convergence cannot be attributed to model capacity rather than information disparity.
- [Methods] Experimental setup (Methods): The manuscript does not describe the DNN architecture, the procedure for hyperparameter tuning, the validation strategy, or the exact error metric(s) used to quantify predictive accuracy. These omissions prevent assessment of whether the two models received comparable optimization effort and whether the comparison is robust to implementation choices.
- [Results] Results: The suggestion that DNN may have higher accuracy for tail properties is not accompanied by a quantitative breakdown (e.g., performance stratified by rent quantiles) or statistical tests; the claim therefore rests on an unshown analysis and requires supporting tables or figures to be load-bearing.
minor comments (1)
- [Abstract] Abstract: The term “Kriging” is used for NNGP without noting that NNGP is an approximation designed for large n; a brief clarification would improve readability.
Simulated Author's Rebuttal
We thank the referee for their thoughtful and constructive comments, which highlight important areas for improving the clarity and rigor of our manuscript. We address each major comment below and have prepared revisions to strengthen the paper where appropriate.
read point-by-point responses
-
Referee: [Abstract] Abstract: The abstract provides no information on the predictors supplied to the DNN. In particular, it is not stated whether latitude/longitude or other spatial covariates were included. This detail is essential because the NNGP model is constructed to capture spatial dependence through the covariance function on coordinates; without equivalent spatial information the reported convergence cannot be attributed to model capacity rather than information disparity.
Authors: We agree that the abstract should explicitly mention the input features. Both the NNGP and DNN models were trained on identical predictors, which included latitude, longitude, and additional property attributes such as size, age, and location descriptors. This ensures the comparison reflects differences in modeling approach rather than input information. We will revise the abstract to state that the DNN received the same spatial and non-spatial covariates as the NNGP model. revision: yes
-
Referee: [Methods] Experimental setup (Methods): The manuscript does not describe the DNN architecture, the procedure for hyperparameter tuning, the validation strategy, or the exact error metric(s) used to quantify predictive accuracy. These omissions prevent assessment of whether the two models received comparable optimization effort and whether the comparison is robust to implementation choices.
Authors: The original manuscript provides a basic description of the DNN (number of hidden layers and nodes) but lacks the full details requested. We will expand the Methods section to specify the exact architecture (layer sizes, activation functions), hyperparameter search procedure (grid or random search ranges), validation approach (e.g., random hold-out or spatial cross-validation), and the primary error metrics (RMSE and MAE) used for all reported comparisons. This will allow readers to evaluate the fairness of the experimental design. revision: yes
-
Referee: [Results] Results: The suggestion that DNN may have higher accuracy for tail properties is not accompanied by a quantitative breakdown (e.g., performance stratified by rent quantiles) or statistical tests; the claim therefore rests on an unshown analysis and requires supporting tables or figures to be load-bearing.
Authors: The tail-performance observation originated from an internal stratified error analysis that was not presented in the main text. We agree this claim requires explicit supporting evidence to be credible. We will add a new table (or supplementary figure) reporting RMSE and MAE stratified by rent-price quantiles (e.g., <10th, 10-25th, 75-90th, >90th percentiles) for both models at each sample size, along with a brief note on any observed differences. If the underlying per-quantile results are statistically noisy, we will qualify the statement accordingly. revision: yes
Circularity Check
No circularity: purely empirical held-out comparison with no derivations
full rationale
The manuscript contains no mathematical derivations, uniqueness theorems, fitted-parameter predictions, or self-citation chains that reduce any claim to its own inputs. All reported results are direct out-of-sample RMSE/MAE comparisons on random train/test splits of the rent dataset at three sample sizes; the central claim (DNN accuracy converging to NNGP at n=10^6) is therefore an observable empirical fact rather than a constructed identity. No equations are presented that equate a model output to a fitted input by definition, and external benchmarks (held-out data) remain independent of any internal fitting step.
Axiom & Free-Parameter Ledger
free parameters (1)
- NNGP and DNN hyperparameters
axioms (1)
- domain assumption Real estate rent prices exhibit spatial autocorrelation that can be modeled via Gaussian processes
Reference graph
Works this paper leans on
-
[1]
Efficient algorithms for Bayesian Nearest Neighbor Gaussian Processes
Abidoye, R.B. and Chan, A.P. (2017) Artificia l neural network in property valuation: application framework and research trend, Property Management, 35(5), 554–571. Abidoye, R.B. and Chan, A.P. (2018) Improving property valuation accuracy: a comparison of hedonic pricing model and artificial neural network, Pacific Rim Property Research Journal, 24 (1...
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[2]
Raju, M.M., Srivastava, R.K., Bisht, D., Shar ma, H.C. and Kumar, A. (2011) Development of artificial neural-network-based models for the simulation of spring discharge, Advances in Artificial Intelligence, 2011, Article ID 686258, online. Rosen, S. (1974) Hedonic prices and implic it markets: Product differentiation in pure competition, Journal of Po...
work page 2011
-
[3]
Variable name Coef. t value Constant term 10.81 4505 Years built -0.001155 -444 Walk time to nearest station -0.00004840 -98.3 Floor-area ratio 0.001294 228 Number of rooms 0.1486 257 Direction_Northeast 0.08202 28.3 Direction_East -0.006518 -3.41 Direction_Southeast 0.0008989 0.454 Direction_South -0.02640 -14.7 Direction_Southwest 0.001473 0.740 Directi...
work page 2078
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.