An Efficient Stochastic Subgradient Method for the Global Placement Problem in Very Large-Scale Integration Circuits
Pith reviewed 2026-05-23 06:32 UTC · model grok-4.3
The pith
Transforming nonoverlapping constraints into ReLU penalties lets stochastic subgradient methods solve VLSI global placement with a convergence guarantee.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By converting nonoverlapping constraints into rectified linear penalty functions, the placement problem becomes analogous to ReLU network training; automatic differentiation then yields the subgradient needed for a stochastic subgradient solver whose convergence is proved for the first time on this class of problems, and the method produces lower wirelength with no overlaps on standard benchmarks.
What carries the argument
Rectified linear penalty functions for nonoverlapping constraints, solved via stochastic subgradient iteration with automatic differentiation.
Load-bearing premise
The nonoverlapping constraints can be turned into rectified linear penalties without introducing bias that changes solution quality or requires extra tuning.
What would settle it
On the ISPD2005 circuits the final placements still contain overlaps or the total wirelength fails to improve over the best published results.
read the original abstract
The placement problem in Very Large-Scale Integration (VLSI) circuits is a critical step in chip design. Its primary goal is to optimize the wirelength of circuit components within a confined area while adhering to nonoverlapping constraints. This paper introduces a novel approach that directly optimizes the original nonsmooth wirelength and proposes an innovative penalty model tailored for the global placement problem. Specifically, we transform the nonoverlapping constraints into rectified linear penalty functions, allowing for a more precise formulation of the problem. Notably, we recast the resultant optimization problem into a form analogous to training deep neural network with Rectified Linear Units (ReLU). Leveraging automatic differentiation techniques from deep learning, we efficiently compute the subgradient of the objective function. This facilitates the application of stochastic subgradient methods to solve the model. To enhance the algorithm's performance, several advanced techniques are further introduced, leading to significant improvements in both efficiency and solution quality. Numerical experiments were conducted on Gigascale Systems Research Center (GSRC) benchmark and International Symposium on Physical Design 2005 (ISPD2005) benchmark circuits. The results demonstrate that our proposed model and algorithm achieve significant reductions in wirelength while effectively eliminating overlaps. This highlights the potential of our approach as a transformative advancement for VLSI placement. Furthermore, we establish a rigorous convergence proof for the proposed stochastic subgradient method. To the best of our knowledge, it constitutes the first such result for the ReLU-type nonsmooth and nonconvex optimization problems.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes transforming the nonoverlapping constraints in VLSI global placement into ReLU penalty functions, recasting the resulting nonsmooth nonconvex problem in a form amenable to stochastic subgradient methods via automatic differentiation (analogous to ReLU network training). It introduces algorithmic enhancements, reports numerical results on GSRC and ISPD2005 benchmarks claiming significant wirelength reductions with effective overlap elimination, and supplies a convergence proof asserted to be the first for ReLU-type nonsmooth nonconvex problems.
Significance. A correct convergence analysis for this class of problems would be a technical contribution to nonsmooth nonconvex optimization. If the penalty formulation produces feasible placements with the reported quality improvements on standard benchmarks, the approach could provide a practical alternative for large-scale placement by importing tools from deep learning.
major comments (1)
- [Abstract and model formulation] Abstract and penalty model: the nonoverlapping constraints are replaced by ReLU penalties of the form max(0, overlap) scaled by a finite multiplier. Stationary points of the penalized objective need not be feasible for the original problem; the manuscript provides no a-priori bound on residual overlap, no adaptive multiplier schedule with feasibility guarantee, and no post-processing whose cost is shown to be negligible. This directly affects the central claim that overlaps are 'effectively eliminated' while achieving wirelength reductions.
minor comments (1)
- [Abstract] The abstract asserts 'significant reductions in wirelength' without numerical values or baseline comparisons; adding concrete metrics (e.g., percentage improvement over a reference placer) would improve clarity.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback. We address the major comment on the penalty formulation below.
read point-by-point responses
-
Referee: [Abstract and model formulation] Abstract and penalty model: the nonoverlapping constraints are replaced by ReLU penalties of the form max(0, overlap) scaled by a finite multiplier. Stationary points of the penalized objective need not be feasible for the original problem; the manuscript provides no a-priori bound on residual overlap, no adaptive multiplier schedule with feasibility guarantee, and no post-processing whose cost is shown to be negligible. This directly affects the central claim that overlaps are 'effectively eliminated' while achieving wirelength reductions.
Authors: We agree that a finite penalty multiplier implies stationary points of the penalized problem need not be feasible for the original constrained formulation, and the manuscript provides neither an a priori bound on residual overlap nor an adaptive schedule guaranteeing feasibility. Our numerical results on the GSRC and ISPD2005 benchmarks show that a fixed, sufficiently large multiplier yields placements with negligible overlap in practice, and no post-processing is applied. To address the comment we will revise the manuscript to report quantitative residual-overlap metrics, detail the penalty-parameter selection procedure, and moderate the language in the abstract and conclusions to reflect empirical effectiveness on the tested instances rather than a general feasibility guarantee. revision: yes
Circularity Check
No significant circularity; derivation relies on independent modeling choice and external convergence analysis
full rationale
The paper selects a ReLU penalty reformulation of nonoverlap constraints as a modeling decision, recasts the problem by analogy to DNN training, applies automatic differentiation and stochastic subgradient methods, and supplies a convergence proof claimed as novel for this problem class. None of these steps reduce by construction to fitted parameters renamed as predictions, self-citations that bear the central load, or self-definitional equivalences; the experimental results on GSRC/ISPD2005 benchmarks and the mathematical convergence argument remain independent of the target claims.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
transform the non-overlapping constraints into rectified linear penalty functions... ϕ_{r,t}(x,y) ... ℓ_{xy}=ϕ... min HPWL + γ ℓ_b + γ ℓ_{xy}
-
IndisputableMonolith/Foundation/BranchSelection.leanbranch_selection unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
reformulate ... as a deep ReLU neural network with max{⌈log M⌉+1,3} ReLU layers
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Alpert, C.J., Mehta, D.P., Sapatnekar, S.S.: Handbook of Algo- rithms for Physical Design Automation. CRC Press, New York (2008 ). https://doi.org/10.1201/9781420013481 19
-
[2]
Springer, Heidelberg, Germany (2011)
Kahng, A.B., Lienig, J., Markov, I.L., Hu, J.: VLSI Physical Design: F rom Graph Partitioning to Timing Closure. Springer, Heidelberg, Germany (2011). https://doi.org/10.1007/978-90-481-9591-6
-
[3]
Sechen, C.: VLSI Placement and Global Routing Using Simulated Ann ealing vol
-
[4]
Springer, New York, NY (2012)
work page 2012
-
[6]
I n: 2007 Asia and South Pacific Design Automation Conference, pp
Viswanathan, N., Pan, M., Chu, C.: FastPlace 3.0: A fast multilevel quadratic placement algorithm with placement congestion control. I n: 2007 Asia and South Pacific Design Automation Conference, pp. 135–140 (2007). https://doi.org/10.1109/ASPDAC.2007.357975 . IEEE
-
[7]
In: Proceedings of the 1999 In ternational Sympo- sium on Physical Design, pp
Caldwell, A.E., Kahng, A.B., Markov, I.L.: Optimal partitioners and en d-case placers for standard-cell layout. In: Proceedings of the 1999 In ternational Sympo- sium on Physical Design, pp. 90–96 (1999). https://doi.org/10.1109/43.892854
-
[8]
Huang, Z., Li, X., Zhu, W.: Optimization models and algorithms for plac ement of very large scale integrated circuits. Operations Research Tran sactions 25(03), 15–36 (2021) https://doi.org/10.15960/j.cnki.issn.1007-6093.2021.03.002
work page doi:10.15960/j.cnki.issn.1007-6093.2021.03.002 2021
-
[9]
In: Proceedings of the 2005 International Sympo sium on Physical Design, pp
Chan, T., Cong, J., Sze, K.: Multilevel generalized force-directed method for cir- cuit placement. In: Proceedings of the 2005 International Sympo sium on Physical Design, pp. 185–192 (2005). https://doi.org/10.1109/IPDPS.2013.70
-
[10]
Naylor, W.C., Donelly, R., Sha, L.: Non-linear optimization system and method for wire length and delay optimization for an automatic electric circuit placer. Google Patents. US Patent 6,301,693 (2001). https://doi.org/US6301693
work page 2001
-
[11]
In: Proceedings of the 48th Design Automation Conf erence, pp
Hsu, M.-K., Chang, Y.-W., Balabanov, V.: TSV-aware analytical pla cement for 3d IC designs. In: Proceedings of the 48th Design Automation Conf erence, pp. 664–669 (2011)
work page 2011
-
[12]
Chen, T.-C., Jiang, Z.-W., Hsu, T.-C., Chen, H.-C., Chang, Y.-W.: NTU place3: An analytical placer for large-scale mixed-size designs with preplace d blocks and density constraints. IEEE Trans. Comput. Aided Des. Integr. Cir cuits Syst. 27(7), 1228–1240 (2008) https://doi.org/10.1109/TCAD.2008.923063
-
[13]
Zhu, W., Chen, J., Peng, Z., Fan, G.: Nonsmooth optimization meth od for VLSI global placement. IEEE Trans. Comput.-Aided Des. Integr. Circuit s Syst. 34(4), 642–655 (2015) https://doi.org/10.1109/TCAD.2015.2394484
-
[14]
In: International Symposium on Intelligence C omputation 20 and Applications, pp
Sun, J., Cheng, H., Wu, J., Zhu, Z., Chen, Y.: Floorplanning of VLSI by mixed- variable optimization. In: International Symposium on Intelligence C omputation 20 and Applications, pp. 137–151 (2023). Springer
work page 2023
-
[15]
Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chan an, G., Killeen, T., Lin, Z., Gimelshein, N., Antiga, L., et al.: Pytorch: An imperative style , high- performance deep learning library. Adv. Neural Inf. Process. Sy st. 32 (2019) https://doi.org/10.48550/arXiv.1912.01703
work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1912.01703 2019
-
[16]
Salama, H.F., Reeves, D.S., Viniotis, Y.: GSRC floorplan benchmarks (1997)
work page 1997
-
[17]
Chang, Y.-W., Jiang, Z.-W., Chen, T.-C.: Essential issues in analytic al place- ment algorithms. IPSJ Trans. Syst. LSI Des. Methodol. 2, 145–166 (2009) https://doi.org/10.2197/ipsjtsldm.2.145
-
[18]
In: Pro- ceedings of the 56th Annual Design Automation Conference 2019, pp
Lin, Y., Dhar, S., Li, W., Ren, H., Khailany, B., Pan, D.Z.: Dreamplace: Deep learning toolkit-enabled GPU acceleration for modern VLSI placemen t. In: Pro- ceedings of the 56th Annual Design Automation Conference 2019, pp. 1–6 (2019). https://doi.org/10.1109/TCAD.2020.3003843
-
[19]
Jin, S., Li, L., Sun, Y.: On the random batch method for second or der interacting particle systems. Multiscale Model. Simul. 20(2), 741–768 (2022) https://doi.org/10.1137/20M1383069
-
[20]
In: Pro- ceedings of the 51st Annual Design Automation Conference, pp
Lu, J., Chen, P., Chang, C.-C., Sha, L., Huang, D.J.-H., Teng, C.-C., Cheng, C.-K.: ePlace: Electrostatics based placement using Nesterov’s met hod. In: Pro- ceedings of the 51st Annual Design Automation Conference, pp. 1 –6 (2014). https://doi.org/10.1145/2593069.2593216
-
[21]
Zhou, T., Ren, J., Medo, M., Zhang, Y.-C.: Bipartite network proj ection and personal recommendation. Phys. Rev. E Stat. Nonlin. Soft Matte r Phys. 76(4), 046115 (2007) https://doi.org/10.1103/PhysRevE.76.046115
-
[22]
Weiss, P.: L’hypoth` ese du champ mol´ eculaire et la propri´ et´ e ferromagn´ etique. J. Phys. Theor. Appl. 6(1), 661–690 (1907) https://doi.org/10.1051/jphystap:019070060066100
-
[23]
Xiao, S., Wang, H., Wang, W., Huang, Z., Zhou, X., Xu, M.: Arti- ficial bee colony algorithm based on adaptive neighborhood search and gaussian perturbation. Appl. Soft Comput. 100, 106955 (2021) https://doi.org/10.1016/j.asoc.2020.106955
-
[24]
Nocedal, J., Wright, S.J.: Numerical Optimization. Springer, New Y ork, NY (1999). https://doi.org/10.1007/978-0-387-40065-5
-
[25]
Adam: A Method for Stochastic Optimization
Kingma, D.P., Ba, J.: Adam: a method for stochastic optimization. arXiv:1412.6980 [cs] (2014). Accessed 2018-04-24 21 0 100 200 300 400 500 600 700 800 W idth 0 100 200 300 400 500 600 700 800 Height RBSM: hpwl = 323264.41 overlap = 1058.74 /c55/c1c/c56 /c4d/c52/c79/c79 /c60/c32/c62/c6d/c48/c69 0 100 200 300 400 500 600 700 800 W idth 0 100 200 300 400 50...
work page internal anchor Pith review Pith/arXiv arXiv 2014
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.