Improved identification of breakpoints in piecewise regression and its applications

Hayoung Choi; Hyungu Lee; Myungjin Kim; Taehyeong Kim

arxiv: 2408.13751 · v3 · submitted 2024-08-25 · 📊 stat.ML · cs.LG· math.OC

Improved identification of breakpoints in piecewise regression and its applications

Taehyeong Kim , Hyungu Lee , Myungjin Kim , Hayoung Choi This is my paper

Pith reviewed 2026-05-23 22:19 UTC · model grok-4.3

classification 📊 stat.ML cs.LGmath.OC

keywords piecewise regressionbreakpointsgreedy algorithmchange point detectionpolynomial regressionoptimizationdata fittingmodel interpretability

0 comments

The pith

A greedy neighborhood exploration algorithm locates optimal breakpoints in piecewise polynomial regression more accurately than existing methods.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents new greedy algorithms for identifying breakpoints where the data switches between different polynomial regimes. These algorithms repeatedly check points near each current breakpoint and move it to the spot that most reduces the overall fitting error. The approach converges quickly, stays stable across runs, and can automatically select how many breakpoints to use. Experiments on both synthetic data with known structure and real datasets show higher accuracy than previous techniques. The identified breakpoints make the fitted models easier to interpret and reveal useful structure in the data.

Core claim

The proposed greedy algorithm updates breakpoints by exploring their local neighborhoods to minimize the regression error, achieving fast convergence and stability while also determining the optimal number of breakpoints, outperforming existing methods on synthetic and real-world data.

What carries the argument

Greedy local neighborhood search that iteratively updates each breakpoint position to reduce total piecewise polynomial error.

If this is right

Piecewise models fitted with these breakpoints achieve lower error on data containing abrupt changes.
The number of segments can be chosen automatically rather than by manual search or cross-validation alone.
Breakpoints recovered this way yield more stable and interpretable segment boundaries in applications.
The same procedure applies directly to both polynomial and other basis expansions in regression.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The local-search idea might transfer to change-point problems in time-series classification or survival models.
Combining the method with global techniques such as dynamic programming could handle very high-dimensional inputs.
Real-time monitoring systems could use the fast convergence to update breakpoint locations as new observations arrive.

Load-bearing premise

Repeated small local moves around each breakpoint will reliably reach the global best locations and the correct total number without becoming trapped in worse configurations.

What would settle it

A synthetic dataset whose true breakpoints are known in advance, on which the algorithm returns a different set whose total squared error is higher than the known optimum.

Figures

Figures reproduced from arXiv: 2408.13751 by Hayoung Choi, Hyungu Lee, Myungjin Kim, Taehyeong Kim.

**Figure 1.** Figure 1: Update breakpoints among ξ − j , ξ j , ξ+ j . where pj : Ij → R is a dth degree polynomial for all j = 1, . . . , k, given by pj(x) = θ0 j + θ1 jx + θ2 jx 2 + · · · + θd jx d . Since this optimization problem is nonconvex [17], it is not easy to solve the problem. Due to the number of too many breakpoint candidates, numerical algorithms may need an extensive computational cost. To overcome this situation,… view at source ↗

**Figure 2.** Figure 2: Comparing MSE as the number of breakpoints is reduced from 7 to 4. [PITH_FULL_IMAGE:figures/full_fig_p006_2.png] view at source ↗

**Figure 3.** Figure 3: Algorithm 4 can avoid local minimum well. [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: Result of each regression models. 9 [PITH_FULL_IMAGE:figures/full_fig_p009_4.png] view at source ↗

**Figure 5.** Figure 5: Comparison of ℓ1 trend filter, APLR, PELT and proposed method for S&P 500 data. 11 [PITH_FULL_IMAGE:figures/full_fig_p011_5.png] view at source ↗

**Figure 6.** Figure 6: Comparison of ℓ1 trend filter, APLR, PELT and proposed method for COVID-19 cases dataset. 12 [PITH_FULL_IMAGE:figures/full_fig_p012_6.png] view at source ↗

read the original abstract

Identifying breakpoints in piecewise regression is critical in enhancing the reliability and interpretability of data fitting. In this paper, we propose novel algorithms based on the greedy algorithm to accurately and efficiently identify breakpoints in piecewise polynomial regression. The algorithm updates the breakpoints to minimize the error by exploring the neighborhood of each breakpoint. It has a fast convergence rate and stability to find optimal breakpoints. Moreover, it can determine the optimal number of breakpoints. The computational results for real and synthetic data show that its accuracy is better than any existing methods. The real-world datasets demonstrate that breakpoints through the proposed algorithm provide valuable data information.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

A neighborhood-greedy local search for piecewise breakpoints that reports good empirical accuracy but carries the usual risk of missing global optima.

read the letter

The paper describes a greedy procedure for locating breakpoints in piecewise polynomial regression. It initializes some breakpoints and then repeatedly scans the immediate neighborhood of each one, shifting the breakpoint to the position that most reduces the total squared error. The method also includes a rule for choosing the number of breakpoints. Tests on synthetic data and a few real datasets show faster convergence and lower error than the baselines the authors tried. The real-data examples illustrate how the detected breakpoints can highlight meaningful changes in the underlying process. That practical demonstration is the clearest positive. The neighborhood update is a straightforward extension of standard local-search ideas for change-point problems, and the automatic count selection removes one tuning step that users often face. The experiments appear to include direct comparisons, which is the right direction for an applied methods paper. The main limitation is that the search remains strictly local. Nothing in the description prevents the procedure from settling into a configuration that is good only relative to its immediate neighbors but far from the global minimum, especially when both locations and count are being optimized together. Breakpoint placement is known to have combinatorial structure with multiple local minima, so the performance edge rests entirely on whether the reported runs happened to avoid those traps. The claim of being more accurate than any existing method is therefore only as strong as the breadth of the baselines and the number of replications. Readers working on interpretability in regression or change-point detection in applied statistics would get the most from the empirical section. The work is incremental rather than foundational, but the experiments are concrete enough that a referee could usefully check the implementation, the choice of baselines, and the stability across random starts. It should go to peer review.

Referee Report

2 major / 0 minor

Summary. The paper proposes novel greedy algorithms for breakpoint identification in piecewise polynomial regression. Breakpoints are updated by exploring the local neighborhood of each to minimize fitting error; the method is claimed to have fast convergence and stability, to automatically determine the optimal number of breakpoints, and to achieve higher accuracy than existing methods on both synthetic and real data, with the resulting breakpoints providing useful information on real-world datasets.

Significance. If the empirical claims are substantiated with reproducible experiments and the local-search procedure is shown to be reliable, the work could supply a practical, automatic tool for piecewise regression that improves interpretability without manual tuning of breakpoint count. The emphasis on real-data applications is a positive aspect.

major comments (2)

[Abstract] Abstract: the central claims (fast convergence to optimal breakpoints, automatic determination of the optimal number, and accuracy superior to all existing methods) rest entirely on computational experiments whose methodology, baselines, error bars, and data details are not supplied, so the assertions cannot be checked against the evidence.
[Abstract] Abstract: the algorithm is described as a greedy procedure that repeatedly updates each breakpoint by exploring its immediate neighborhood; no argument is given that this local operator is sufficient to escape the combinatorial local minima known to exist in breakpoint placement, leaving the headline optimality claims without theoretical support.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments. We address each major comment point by point below and indicate the revisions we will make.

read point-by-point responses

Referee: [Abstract] Abstract: the central claims (fast convergence to optimal breakpoints, automatic determination of the optimal number, and accuracy superior to all existing methods) rest entirely on computational experiments whose methodology, baselines, error bars, and data details are not supplied, so the assertions cannot be checked against the evidence.

Authors: The experimental methodology, baselines, error bars, and data details are described in Section 4 (Experimental Results) and the appendix of the full manuscript. The abstract is intended as a high-level summary. We will revise the abstract to add a sentence directing readers to Section 4 for the supporting experimental details, methodology, and data descriptions. revision: yes
Referee: [Abstract] Abstract: the algorithm is described as a greedy procedure that repeatedly updates each breakpoint by exploring its immediate neighborhood; no argument is given that this local operator is sufficient to escape the combinatorial local minima known to exist in breakpoint placement, leaving the headline optimality claims without theoretical support.

Authors: We agree that the manuscript provides no theoretical argument or proof that the local neighborhood search escapes combinatorial local minima. The claims of finding optimal breakpoints are supported solely by empirical results on synthetic data (where ground truth is known) and real data (where the method outperforms baselines). We will revise the abstract and introduction to replace references to 'optimal breakpoints' with 'breakpoints that minimize fitting error in practice' and add a paragraph in the discussion section acknowledging the lack of theoretical guarantees. revision: yes

Circularity Check

0 steps flagged

No circularity; empirical validation of greedy algorithm

full rationale

The paper introduces a greedy neighborhood-exploration algorithm for breakpoint detection in piecewise polynomial regression and supports its claims of convergence, stability, and superior accuracy solely through computational experiments on synthetic and real datasets. No derivation chain exists that reduces a claimed result to its own inputs by definition, no fitted parameters are relabeled as predictions, and no load-bearing self-citations or uniqueness theorems are invoked. The central assertions rest on external empirical benchmarks rather than any self-referential construction.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Review performed on abstract only; no explicit free parameters, axioms, or invented entities are stated in the provided text.

pith-pipeline@v0.9.0 · 5631 in / 1153 out tokens · 25395 ms · 2026-05-23T22:19:04.337088+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

35 extracted references · 35 canonical work pages

[1]

J. Bai. Estimation of a change point in multiple regression models.Review of Economics and Statistics , 79(4):551–563, 1997

work page 1997
[2]

Bai and P

J. Bai and P. Perron. Estimating and testing linear models with multiple structural changes. Econometrica, pages 47–78, 1998

work page 1998
[3]

Bai and P

J. Bai and P. Perron. Computation and analysis of multiple structural change models. Journal of applied econometrics , 18(1):1–22, 2003

work page 2003
[4]

Boyd and L

S. Boyd and L. Vandenberghe. Introduction to applied linear algebra: vectors, matrices, and least squares . Cambridge university press, 2018

work page 2018
[5]

L. Breiman. Random forests. Machine learning, 45:5–32, 2001

work page 2001
[6]

Chiou, Y .-T

J.-M. Chiou, Y .-T. Chen, and T. Hsing. Identifying multiple changes for a functional data sequence with application to freeway traffic segmentation. The Annals of Applied Statistics , 13(3):1430–1463, 2019

work page 2019
[7]

Dell’Accio and F

F. Dell’Accio and F. Nudo. Polynomial approximation of derivatives through a regression–interpolation method. Applied Mathematics Letters, 152:109010, 2024

work page 2024
[8]

Dell’Accio, F

F. Dell’Accio, F. Di Tommaso, and F. Nudo. Generalizations of the con- strained mock-chebyshev least squares in two variables: Tensor product vs total degree polynomial interpolation. Applied Mathematics Letters , 125:107732, 2022

work page 2022
[9]

Y . Ding. Data science for wind energy . Chapman and Hall/CRC, 2019

work page 2019
[10]

Fearnhead and Z

P. Fearnhead and Z. Liu. On-line inference for multiple changepoint prob- lems. Journal of the Royal Statistical Society Series B: Statistical Method- ology, 69(4):589–605, 2007

work page 2007
[11]

J. H. Friedman. Greedy function approximation: a gradient boosting ma- chine. Annals of statistics, pages 1189–1232, 2001

work page 2001
[12]

Fryzlewicz

P. Fryzlewicz. Wild binary segmentation for multiple change-point detec- tion. The Annals of Statistics , 42(6):2243–2281, 2014

work page 2014
[13]

Greene, O

M. Greene, O. Rolfson, G. Garellick, M. Gordon, and S. Nemes. Im- proved statistical analysis of pre-and post-treatment patient-reported out- come measures (proms): the applicability of piecewise linear regression splines. Quality of Life Research, 24:567–573, 2015

work page 2015
[14]

Y . Guan, K. Pan, and K. Zhou. Polynomial time algorithms and extended formulations for unit commitment problems. IISE transactions , 50(8): 735–751, 2018

work page 2018
[15]

Gunnerud and B

V . Gunnerud and B. Foss. Oil production optimization—a piecewise linear model, solved with two decomposition strategies. Computers & Chemical Engineering, 34(11):1803–1812, 2010

work page 2010
[16]

Hwangbo, A

H. Hwangbo, A. L. Johnson, and Y . Ding. Spline model for wake e ffect analysis: Characteristics of a single wake and its impacts on wind turbine power generation. IISE Transactions, 50(2):112–125, 2018

work page 2018
[17]

Jeong, Y

J. Jeong, Y . M. Jung, S. H. Kim, and S. Yun. Trend filtering by adap- tive piecewise polynomials. Communications in Nonlinear Science and Numerical Simulation, 116:106866, 2023

work page 2023
[18]

Killick, P

R. Killick, P. Fearnhead, and I. A. Eckley. Optimal detection of change- points with a linear computational cost. Journal of the American Statisti- cal Association, 107(500):1590–1598, 2012

work page 2012
[19]

S.-J. Kim, K. Koh, S. Boyd, and D. Gorinevsky. ℓ1 trend filtering. SIAM review, 51(2):339–360, 2009

work page 2009
[20]

H. Lee, G. Jang, and G. Cho. Forecasting covid-19 cases by assessing control-intervention effects in republic of korea: a statistical modeling approach. Alexandria Engineering Journal, 61(11):9203–9217, 2022

work page 2022
[21]

W.-Y . Loh. Classification and regression trees. Wiley interdisciplinary reviews: data mining and knowledge discovery , 1(1):14–23, 2011

work page 2011
[22]

V . M. Muggeo. Estimating regression models with unknown break-points. Statistics in medicine, 22(19):3055–3071, 2003

work page 2003
[23]

Muriel and F

A. Muriel and F. N. Munshi. Capacitated multicommodity network flow problems with piecewise linear concave costs. IIE Transactions, 36(7): 683–696, 2004

work page 2004
[24]

Quarteroni, R

A. Quarteroni, R. Sacco, and F. Saleri. Numerical mathematics , vol- ume 37. Springer Science & Business Media, 2006

work page 2006
[25]

A. J. Smola and B. Sch ¨olkopf. A tutorial on support vector regression. Statistics and computing, 14:199–222, 2004

work page 2004
[26]

S. M. Stigler. Optimal experimental design for polynomial regression. Journal of the American Statistical Association , 66(334):311–318, 1971

work page 1971
[27]

https:// dportal.kdca.go.kr/pot/index.do, 2021

Korea Disease Control and Prevention Agency (KDCA). https:// dportal.kdca.go.kr/pot/index.do, 2021. [accessed 18 June 2024]

work page 2021
[28]

R. J. Tibshirani. Adaptive piecewise polynomial estimation via trend fil- tering. The Annals of Statistics , 42(1):285, 2014

work page 2014
[29]

J. H. Tomal and H. Rahman. A bayesian piecewise linear model for the detection of breakpoints in housing prices. Metron, 79(3):361–381, 2021

work page 2021
[30]

Truong, L

C. Truong, L. Oudre, and N. Vayatis. Selective review of o ffline change point detection methods. Signal Processing, 167:107299, 2020

work page 2020
[31]

Tunc and B

H. Tunc and B. Genc ¸. A column generation based heuristic algorithm for piecewise linear regression. Expert Systems with Applications , 171: 114539, 2021

work page 2021
[32]

E. Vieth. Fitting piecewise linear regression functions to biological re- sponses. Journal of applied physiology, 67(1):390–396, 1989

work page 1989
[33]

A. K. Wagner, S. B. Soumerai, F. Zhang, and D. Ross-Degnan. Seg- mented regression analysis of interrupted time series studies in medica- tion use research. Journal of clinical pharmacy and therapeutics , 27(4): 299–309, 2002

work page 2002
[34]

Z. Wu, Y . Li, and L. Hu. A synchronous multiple change-point detecting method for manufacturing process. Computers & Industrial Engineering, 169:108114, 2022

work page 2022
[35]

L. Yang, S. Liu, S. Tsoka, and L. G. Papageorgiou. Mathematical pro- gramming for piecewise linear regression analysis. Expert systems with applications, 44:156–167, 2016. 13

work page 2016

[1] [1]

J. Bai. Estimation of a change point in multiple regression models.Review of Economics and Statistics , 79(4):551–563, 1997

work page 1997

[2] [2]

Bai and P

J. Bai and P. Perron. Estimating and testing linear models with multiple structural changes. Econometrica, pages 47–78, 1998

work page 1998

[3] [3]

Bai and P

J. Bai and P. Perron. Computation and analysis of multiple structural change models. Journal of applied econometrics , 18(1):1–22, 2003

work page 2003

[4] [4]

Boyd and L

S. Boyd and L. Vandenberghe. Introduction to applied linear algebra: vectors, matrices, and least squares . Cambridge university press, 2018

work page 2018

[5] [5]

L. Breiman. Random forests. Machine learning, 45:5–32, 2001

work page 2001

[6] [6]

Chiou, Y .-T

J.-M. Chiou, Y .-T. Chen, and T. Hsing. Identifying multiple changes for a functional data sequence with application to freeway traffic segmentation. The Annals of Applied Statistics , 13(3):1430–1463, 2019

work page 2019

[7] [7]

Dell’Accio and F

F. Dell’Accio and F. Nudo. Polynomial approximation of derivatives through a regression–interpolation method. Applied Mathematics Letters, 152:109010, 2024

work page 2024

[8] [8]

Dell’Accio, F

F. Dell’Accio, F. Di Tommaso, and F. Nudo. Generalizations of the con- strained mock-chebyshev least squares in two variables: Tensor product vs total degree polynomial interpolation. Applied Mathematics Letters , 125:107732, 2022

work page 2022

[9] [9]

Y . Ding. Data science for wind energy . Chapman and Hall/CRC, 2019

work page 2019

[10] [10]

Fearnhead and Z

P. Fearnhead and Z. Liu. On-line inference for multiple changepoint prob- lems. Journal of the Royal Statistical Society Series B: Statistical Method- ology, 69(4):589–605, 2007

work page 2007

[11] [11]

J. H. Friedman. Greedy function approximation: a gradient boosting ma- chine. Annals of statistics, pages 1189–1232, 2001

work page 2001

[12] [12]

Fryzlewicz

P. Fryzlewicz. Wild binary segmentation for multiple change-point detec- tion. The Annals of Statistics , 42(6):2243–2281, 2014

work page 2014

[13] [13]

Greene, O

M. Greene, O. Rolfson, G. Garellick, M. Gordon, and S. Nemes. Im- proved statistical analysis of pre-and post-treatment patient-reported out- come measures (proms): the applicability of piecewise linear regression splines. Quality of Life Research, 24:567–573, 2015

work page 2015

[14] [14]

Y . Guan, K. Pan, and K. Zhou. Polynomial time algorithms and extended formulations for unit commitment problems. IISE transactions , 50(8): 735–751, 2018

work page 2018

[15] [15]

Gunnerud and B

V . Gunnerud and B. Foss. Oil production optimization—a piecewise linear model, solved with two decomposition strategies. Computers & Chemical Engineering, 34(11):1803–1812, 2010

work page 2010

[16] [16]

Hwangbo, A

H. Hwangbo, A. L. Johnson, and Y . Ding. Spline model for wake e ffect analysis: Characteristics of a single wake and its impacts on wind turbine power generation. IISE Transactions, 50(2):112–125, 2018

work page 2018

[17] [17]

Jeong, Y

J. Jeong, Y . M. Jung, S. H. Kim, and S. Yun. Trend filtering by adap- tive piecewise polynomials. Communications in Nonlinear Science and Numerical Simulation, 116:106866, 2023

work page 2023

[18] [18]

Killick, P

R. Killick, P. Fearnhead, and I. A. Eckley. Optimal detection of change- points with a linear computational cost. Journal of the American Statisti- cal Association, 107(500):1590–1598, 2012

work page 2012

[19] [19]

S.-J. Kim, K. Koh, S. Boyd, and D. Gorinevsky. ℓ1 trend filtering. SIAM review, 51(2):339–360, 2009

work page 2009

[20] [20]

H. Lee, G. Jang, and G. Cho. Forecasting covid-19 cases by assessing control-intervention effects in republic of korea: a statistical modeling approach. Alexandria Engineering Journal, 61(11):9203–9217, 2022

work page 2022

[21] [21]

W.-Y . Loh. Classification and regression trees. Wiley interdisciplinary reviews: data mining and knowledge discovery , 1(1):14–23, 2011

work page 2011

[22] [22]

V . M. Muggeo. Estimating regression models with unknown break-points. Statistics in medicine, 22(19):3055–3071, 2003

work page 2003

[23] [23]

Muriel and F

A. Muriel and F. N. Munshi. Capacitated multicommodity network flow problems with piecewise linear concave costs. IIE Transactions, 36(7): 683–696, 2004

work page 2004

[24] [24]

Quarteroni, R

A. Quarteroni, R. Sacco, and F. Saleri. Numerical mathematics , vol- ume 37. Springer Science & Business Media, 2006

work page 2006

[25] [25]

A. J. Smola and B. Sch ¨olkopf. A tutorial on support vector regression. Statistics and computing, 14:199–222, 2004

work page 2004

[26] [26]

S. M. Stigler. Optimal experimental design for polynomial regression. Journal of the American Statistical Association , 66(334):311–318, 1971

work page 1971

[27] [27]

https:// dportal.kdca.go.kr/pot/index.do, 2021

Korea Disease Control and Prevention Agency (KDCA). https:// dportal.kdca.go.kr/pot/index.do, 2021. [accessed 18 June 2024]

work page 2021

[28] [28]

R. J. Tibshirani. Adaptive piecewise polynomial estimation via trend fil- tering. The Annals of Statistics , 42(1):285, 2014

work page 2014

[29] [29]

J. H. Tomal and H. Rahman. A bayesian piecewise linear model for the detection of breakpoints in housing prices. Metron, 79(3):361–381, 2021

work page 2021

[30] [30]

Truong, L

C. Truong, L. Oudre, and N. Vayatis. Selective review of o ffline change point detection methods. Signal Processing, 167:107299, 2020

work page 2020

[31] [31]

Tunc and B

H. Tunc and B. Genc ¸. A column generation based heuristic algorithm for piecewise linear regression. Expert Systems with Applications , 171: 114539, 2021

work page 2021

[32] [32]

E. Vieth. Fitting piecewise linear regression functions to biological re- sponses. Journal of applied physiology, 67(1):390–396, 1989

work page 1989

[33] [33]

A. K. Wagner, S. B. Soumerai, F. Zhang, and D. Ross-Degnan. Seg- mented regression analysis of interrupted time series studies in medica- tion use research. Journal of clinical pharmacy and therapeutics , 27(4): 299–309, 2002

work page 2002

[34] [34]

Z. Wu, Y . Li, and L. Hu. A synchronous multiple change-point detecting method for manufacturing process. Computers & Industrial Engineering, 169:108114, 2022

work page 2022

[35] [35]

L. Yang, S. Liu, S. Tsoka, and L. G. Papageorgiou. Mathematical pro- gramming for piecewise linear regression analysis. Expert systems with applications, 44:156–167, 2016. 13

work page 2016