pith. sign in

arxiv: 2607.01015 · v1 · pith:Y242TGS7new · submitted 2026-07-01 · 💻 cs.CV

SuperFlex: Deformable Superquadrics for Point Cloud Decomposition

Pith reviewed 2026-07-02 13:58 UTC · model grok-4.3

classification 💻 cs.CV
keywords superquadricspoint cloud decompositiondeformable primitives3D shape representationpartial point cloudsreconstruction accuracy
0
0 comments X

The pith

Superquadrics gain bending, tapering and a new loss to represent curved shapes more accurately in point clouds.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

SuperFlex improves superquadric decompositions of 3D point clouds by introducing a novel loss for higher accuracy. It adds bending and tapering deformations to capture curved and asymmetric shapes that rigid primitives miss. The improved decompositions serve as supervision to train a model that handles partial, real-world scans. This yields substantially better reconstruction accuracy than previous optimization and learning methods while using a compact set of primitives.

Core claim

The framework adds a novel loss formulation and bending and tapering deformations to superquadrics, enabling high-fidelity representation of curved and asymmetric geometries, and leverages these decompositions to train a model robust to partial point clouds.

What carries the argument

Deformable superquadric primitives equipped with bending and tapering, optimized under a new loss function.

Load-bearing premise

The new loss formulation and bending/tapering deformations can be optimized or learned while preserving the geometric interpretability and compactness of the superquadric primitives.

What would settle it

Running the optimization or training on benchmark datasets and measuring no gain in reconstruction metrics such as Chamfer distance over baselines would disprove the accuracy improvements.

Figures

Figures reproduced from arXiv: 2607.01015 by Elisabetta Fedele, Francis Engelmann, Gabriel Tavernini, Leonidas Guibas, Marc Pollefeys, Tiago Novello.

Figure 1
Figure 1. Figure 1: 3D Object Decomposition with Deformable Superquadrics. [PITH_FULL_IMAGE:figures/full_fig_p001_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Illustration of superquadric deformations: tapering (red) and bending (blue). Both deformations are shown along a single axis (the z-axis). For tapering along z, the parameters are τx (shown) and τy (not shown). For bending, we select a bending angle αz around the z axis, and bend to the curvature defined by kz [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Illustration of the SuperFlex model. Given an input point cloud, SuperFlex decomposes the object into a set of superquadric primitives, each defined by its pose (t, R), shape (s, ϵ), and deformations (τ , β). The model is trained via self-supervised joint volumetric and surface losses. A subsequent (optional) object-specific optimization can further improve the superquadric decomposition quality using the … view at source ↗
Figure 4
Figure 4. Figure 4: Qualitative Results on ShapeNet. Top row: input point clouds. Below: the outputs of baselines and our methods. Different colors indicate different primitives. For SuperFlex, we compare variants with and without tapering and bending (T&B) deformation parameters. Test-time Opt. IoU↑ F [18]↑ L1↓ L2↓ # Primitives↓ Runtime↓ ✗ 0.72 0.37 1.54 0.043 5.64 0.0082 s ✓ 0.87 0.49 1.32 0.036 5.64 5 s [PITH_FULL_IMAGE:f… view at source ↗
Figure 5
Figure 5. Figure 5: Qualitative Test-time Optimization Results. [PITH_FULL_IMAGE:figures/full_fig_p011_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Qualitative Results on ASE. Top row: partial input point clouds. Below: the outputs of our standard SuperFlex and its robust fine-tuned variant. 5.3 Partial Point Clouds Datasets. We evaluate reconstruction under realistic sensing conditions using both quantitative and qualitative experiments. For quantitative evaluation, we use the Aria Synthetic Environments (ASE) dataset [23], while qualitative re￾sults… view at source ↗
Figure 7
Figure 7. Figure 7: Robustness on real-world point clouds. We show results from two Scan￾Net++ scenes where objects are decomposed into sets of superquadrics by our robust SuperFlex variant. Different colors indicate different object instances. A rendering of the original scene is shown in the top-left corner. Qualitative Evaluation [PITH_FULL_IMAGE:figures/full_fig_p013_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Robustness in real-world point clouds. We show four example scene from ScanNet++ where each object is decomposed into a set of superquadrics [PITH_FULL_IMAGE:figures/full_fig_p023_8.png] view at source ↗
Figure 9
Figure 9. Figure 9: Robustness in real-world point clouds. We show four examples scene from Replica where each object is decomposed into a set of superquadrics [PITH_FULL_IMAGE:figures/full_fig_p024_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Qualitative Results on ShapeNet. Top row: input point clouds. Below: the outputs of baselines and our methods. Different colors indicate different primitives. For SuperFlex, we compare variants with and without tapering and bending (T&B) deformation parameters [PITH_FULL_IMAGE:figures/full_fig_p026_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Qualitative results on ABO. In each block, the first row shows the input point clouds, the second row shows the direct predictions of SuperFlex, and the third row shows the corresponding optimized results. Different colors denote different primitives [PITH_FULL_IMAGE:figures/full_fig_p027_11.png] view at source ↗
Figure 12
Figure 12. Figure 12: Additional Qualitative Results on ShapeNet. [PITH_FULL_IMAGE:figures/full_fig_p028_12.png] view at source ↗
Figure 13
Figure 13. Figure 13: SuperFlex from a single RGB Image. From an input RGB frame, we first compute instance point maps using SAM3 and MoGE. Then, we apply SuperFlex on top of each object’s point map [PITH_FULL_IMAGE:figures/full_fig_p028_13.png] view at source ↗
read the original abstract

Superquadrics have proven to provide a compact, geometrically meaningful representation for 3D objects. However, existing methods suffer from limited reconstruction accuracy, are restricted to rigid primitives, and lack robustness to partial point clouds. In this work, we present SuperFlex, an enhanced framework that expands the expressive power and applicability of superquadric decompositions. First, we introduce a novel loss formulation which significantly improves reconstruction accuracy. Second, we include bending and tapering deformations, enabling high-fidelity representation of curved and asymmetric geometries. Finally, we leverage these high-quality decompositions as supervision to train a model that is robust to partial real-world point clouds. Experiments demonstrate substantial improvements in reconstruction accuracy over both optimization- and learning-based baselines while maintaining a highly compact primitive representation.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper introduces SuperFlex, a framework extending superquadric decomposition of point clouds via a novel loss formulation for improved accuracy, bending and tapering deformations to handle curved and asymmetric shapes, and supervision from these decompositions to train a model robust to partial real-world point clouds. It claims substantial reconstruction accuracy gains over optimization- and learning-based baselines while preserving a compact primitive representation.

Significance. If the quantitative results and ablations hold, the work meaningfully extends the utility of geometrically interpretable superquadrics to more complex real-world geometries and partial observations, potentially benefiting downstream tasks in 3D vision that value both compactness and fidelity over black-box alternatives.

major comments (2)
  1. [Experiments] Experiments section: the central claim of 'substantial improvements' over baselines requires explicit reporting of metrics (e.g., Chamfer distance, IoU), ablation tables isolating the novel loss versus deformations, and error analysis on partial clouds; without these the accuracy claim remains unverified from the abstract alone.
  2. [Method] Method, deformation parameterization: the bending and tapering extensions must be shown not to inflate the effective degrees of freedom beyond the claimed compactness (e.g., via a table of primitive parameter counts before/after deformation); otherwise the interpretability advantage over general meshes is at risk.
minor comments (2)
  1. [Abstract] Abstract, paragraph 2: the phrase 'highly compact primitive representation' would benefit from a concrete comparison (e.g., average number of primitives or bits per shape) to prior superquadric methods.
  2. [Related Work] Related work: ensure explicit citation of the specific superquadric fitting losses used as baselines so readers can assess novelty of the proposed formulation.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the thoughtful and constructive feedback. We address each major comment below and commit to revisions that strengthen the experimental reporting and method clarity while preserving the core contributions of the work.

read point-by-point responses
  1. Referee: [Experiments] Experiments section: the central claim of 'substantial improvements' over baselines requires explicit reporting of metrics (e.g., Chamfer distance, IoU), ablation tables isolating the novel loss versus deformations, and error analysis on partial clouds; without these the accuracy claim remains unverified from the abstract alone.

    Authors: The full manuscript reports quantitative results using Chamfer distance and related metrics against both optimization- and learning-based baselines. However, we agree that dedicated ablation tables and partial-cloud error analysis would make the contributions of the novel loss and deformations more transparent. We will add these elements to the experiments section in the revised version. revision: yes

  2. Referee: [Method] Method, deformation parameterization: the bending and tapering extensions must be shown not to inflate the effective degrees of freedom beyond the claimed compactness (e.g., via a table of primitive parameter counts before/after deformation); otherwise the interpretability advantage over general meshes is at risk.

    Authors: We will include a new table in the method section that enumerates the parameter count per primitive for the base superquadric formulation versus the version augmented with bending and tapering. This will explicitly demonstrate that the added deformation parameters remain small in number relative to the gain in expressiveness, thereby preserving the compactness and geometric interpretability of the representation. revision: yes

Circularity Check

0 steps flagged

No significant circularity

full rationale

The available text consists only of the abstract, which states high-level contributions (novel loss, bending/tapering deformations, supervision for partial clouds) without any equations, parameter definitions, loss formulations, or derivation steps. No claimed prediction, uniqueness theorem, or first-principles result is present that could reduce to fitted inputs or self-citations by construction. The experimental claim of improved accuracy is asserted but not supported by any inspectable chain, so no circularity of any enumerated kind can be exhibited. The paper is therefore self-contained against external benchmarks from the given material.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only input supplies no explicit free parameters, axioms, or invented entities; full text required for ledger.

pith-pipeline@v0.9.1-grok · 5672 in / 1031 out tokens · 27398 ms · 2026-07-02T13:58:44.377554+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

37 extracted references · 3 canonical work pages · 2 internal anchors

  1. [1]

    In: European Confer- ence on Computer Vision (ECCV) (2024) 13

    Avetisyan, A., Xie, C., Howard-Jenkins, H., Yang, T.Y., Aroudj, S., Patra, S., Zhang, F., Frost, D., Holland, L., Orme, C., et al.: SceneScript: Reconstructing Scenes with an Autoregressive Structured Language Model. In: European Confer- ence on Computer Vision (ECCV) (2024) 13

  2. [2]

    Barr,A.H.:SuperquadricsandAngle-PreservingTransformations.IEEEComputer Graphics and Applications (1981) 2, 4 16 Tavernini et al

  3. [3]

    ACM SIGGRAPH Computer Graphics (1984) 4

    Barr, A.H.: Global and local deformations of solid primitives. ACM SIGGRAPH Computer Graphics (1984) 4

  4. [4]

    Brazil, G., Kumar, A., Straub, J., Ravi, N., Johnson, J., Gkioxari, G.: Omni3D: A LargeBenchmarkandModelfor3DObjectDetectionintheWild.In:International Conference on Computer Vision and Pattern Recognition (CVPR) (2023) 8

  5. [5]

    Carion, N., Gustafson, L., Hu, Y.T., Debnath, S., Hu, R., Suris, D., Ryali, C., Al- wala, K.V., Khedr, H., Huang, A., Lei, J., Ma, T., Guo, B., Kalla, A., Marks, M., Greer, J., Wang, M., Sun, P., Rädle, R., Afouras, T., Mavroudi, E., Xu, K., Wu, T.H., Zhou, Y., Momeni, L., Hazra, R., Ding, S., Vaze, S., Porcher, F., Li, F., Li, S., Kamath, A., Cheng, H.K....

  6. [6]

    ShapeNet: An Information-Rich 3D Model Repository

    Chang, A.X., Funkhouser, T., Guibas, L., Hanrahan, P., Huang, Q., Li, Z., Savarese, S., Savva, M., Song, S., Su, H., et al.: ShapeNet: An Information-rich 3D Model Repository. arXiv preprint arXiv:1512.03012 (2015) 9, 7

  7. [7]

    In: International Conference on Computer Vision and Pattern Recognition (CVPR) (2026) 7

    Chen, X., Chu, F.J., Gleize, P., Liang, K.J., Sax, A., Tang, H., Wang, W., Guo, M., Hardin, T., Li, X., et al.: SAM 3D: 3Dfy anything in images. In: International Conference on Computer Vision and Pattern Recognition (CVPR) (2026) 7

  8. [8]

    In: European Conference on Computer Vision (ECCV) (2016) 9

    Choy, C.B., Xu, D., Gwak, J., Chen, K., Savarese, S.: 3d-r2n2: A unified approach for single and multi-view 3d object reconstruction. In: European Conference on Computer Vision (ECCV) (2016) 9

  9. [9]

    In: International Conference on Computer Vision and Pattern Recognition (CVPR) (2022) 12, 7

    Collins, J., Goel, S., Deng, K., Luthra, A., Xu, L., Gundogdu, E., Zhang, X., Vi- cente, T.F.Y., Dideriksen, T., Arora, H., et al.: ABO: Dataset and Benchmarks for Real-world 3D Object Understanding. In: International Conference on Computer Vision and Pattern Recognition (CVPR) (2022) 12, 7

  10. [10]

    In: International Conference on Com- puter Vision and Pattern Recognition (CVPR) (2020) 6, 7, 8

    Deng, B., Genova, K., Yazdani, S., Bouaziz, S., Hinton, G., Tagliasacchi, A.: CvxNet: Learnable Convex Decomposition. In: International Conference on Com- puter Vision and Pattern Recognition (CVPR) (2020) 6, 7, 8

  11. [11]

    Inter- national Conference on Learning Representations (ICLR) (2026) 4

    Fedele, E., Engelmann, F., Huang, I., Litany, O., Pollefeys, M., Guibas, L.: Space- Control: Introducing Test-Time Spatial Control to 3D Generative Modeling. Inter- national Conference on Learning Representations (ICLR) (2026) 4

  12. [12]

    In: International Conference on Computer Vision (ICCV) (2025) 1, 2, 3, 5, 6, 7, 9, 10, 11, 13, 14, 8

    Fedele, E., Sun, B., Guibas, L., Pollefeys, M., Engelmann, F.: SuperDec: 3D Scene Decomposition with Superquadric Primitives. In: International Conference on Computer Vision (ICCV) (2025) 1, 2, 3, 5, 6, 7, 9, 10, 11, 13, 14, 8

  13. [13]

    In: International Conference on Computer Vision and Pattern Recognition (CVPR) (2026) 3

    Ganeshan, A., Gadelha, M., Groueix, T., Chen, Z., Chaudhuri, S., Kim, V., Yifan, W., Ritchie, D.: Residual Primitive Fitting of 3D Shapes with SuperFrusta. In: International Conference on Computer Vision and Pattern Recognition (CVPR) (2026) 3

  14. [14]

    In: International Confer- ence on Computer Vision (ICCV) (2019) 3

    Genova, K., Cole, F., Vlasic, D., Sarna, A., Freeman, W.T., Funkhouser, T.: Learn- ing shape templates with structured implicit functions. In: International Confer- ence on Computer Vision (ICCV) (2019) 3

  15. [15]

    Jaklic, A., Leonardis, A., Solina, F.: Segmentation and Recovery of Superquadrics, vol. 20. Springer Science & Business Media (2000) 3, 4, 1

  16. [16]

    In: ACM SIGGRAPH 2007 (2007) 12

    Katz, S., Tal, A., Basri, R.: Direct visibility of point sets. In: ACM SIGGRAPH 2007 (2007) 12

  17. [17]

    Kingma, D.P., Ba, J.: Adam: A method for stochastic optimization (2017) 10

  18. [18]

    ACM Trans

    Knapitsch, A., Park, J., Zhou, Q.Y., Koltun, V.: Tanks and temples: benchmarking large-scale scene reconstruction. ACM Trans. Graph. (2017) 9, 10, 11, 13, 14, 2, 3, 4 SuperFlex17

  19. [19]

    In: International Conference on Computer Vision and Pattern Recognition (CVPR) (2022) 2, 3, 9, 10, 13, 8

    Liu, W., Wu, Y., Ruan, S., Chirikjian, G.S.: Robust and Accurate Superquadric Recovery: A Probabilistic Approach. In: International Conference on Computer Vision and Pattern Recognition (CVPR) (2022) 2, 3, 9, 10, 13, 8

  20. [20]

    In: International Conference on Computer Vi- sion and Pattern Recognition (CVPR)

    Liu, W., Wu, Y., Ruan, S., Chirikjian, G.S.: Marching-primitives: Shape abstrac- tion from signed distance function. In: International Conference on Computer Vi- sion and Pattern Recognition (CVPR). pp. 8771–8780 (2023) 2, 3, 9, 10, 11, 8

  21. [21]

    International Conference on Neural Information Processing Systems (NeurIPS) (2019) 5

    Liu, Z., Tang, H., Lin, Y., Han, S.: Point-voxel cnn for efficient 3d deep learning. International Conference on Neural Information Processing Systems (NeurIPS) (2019) 5

  22. [22]

    International Con- ference on Neural Information Processing Systems (NeurIPS) (2023) 4

    Monnier, T., Austin, J., Kanazawa, A., Efros, A., Aubry, M.: Differentiable blocks world: Qualitative 3d decomposition by rendering primitives. International Con- ference on Neural Information Processing Systems (NeurIPS) (2023) 4

  23. [23]

    In: International Conference on Computer Vision (ICCV) (2023) 12, 13

    Pan, X., Charron, N., Yang, Y., Peters, S., Whelan, T., Kong, C., Parkhi, O., New- combe, R., Ren, Y.C.: Aria digital twin: A new benchmark dataset for egocentric 3d machine perception. In: International Conference on Computer Vision (ICCV) (2023) 12, 13

  24. [24]

    In: International Conference on Computer Vision and Pattern Recognition (CVPR)

    Paschalidou, D., Gool, L.V., Geiger, A.: Learning unsupervised hierarchical part decomposition of 3d objects from a single rgb image. In: International Conference on Computer Vision and Pattern Recognition (CVPR). pp. 1060–1070 (2020) 3, 4

  25. [25]

    In: International Conference on Computer Vision and Pattern Recognition (CVPR) (2021) 3

    Paschalidou, D., Katharopoulos, A., Geiger, A., Fidler, S.: Neural parts: Learning expressive 3d shape abstractions with invertible neural networks. In: International Conference on Computer Vision and Pattern Recognition (CVPR) (2021) 3

  26. [26]

    In: International Conference on Computer Vision and Pattern Recognition (CVPR) (2019) 2, 3, 4, 9, 10, 13, 8

    Paschalidou, D., Ulusoy, A.O., Geiger, A.: Superquadrics revisited: Learning 3d shape parsing beyond cuboids. In: International Conference on Computer Vision and Pattern Recognition (CVPR) (2019) 2, 3, 4, 9, 10, 13, 8

  27. [27]

    In: International Conference on Neural Infor- mation Processing Systems (NeurIPS) (2017) 9

    Qi, C.R., Yi, L., Su, H., Guibas, L.J.: Pointnet++: Deep hierarchical feature learn- ing on point sets in a metric space. In: International Conference on Neural Infor- mation Processing Systems (NeurIPS) (2017) 9

  28. [28]

    ACM SIG- GRAPH 2026 Conference Papers (2026) 4

    Sella, E., Phung, H., Amiel, N., Litany, O., Patashnik, O., Averbuch-Elor, H.: Prox-e: Fine-grained 3d shape editing via primitive-based abstractions. ACM SIG- GRAPH 2026 Conference Papers (2026) 4

  29. [29]

    IEEE transactions on pattern analysis and machine intelligence (1990) 4

    Solina, F., Bajcsy, R.: Recovery of parametric models from range images: The case for superquadrics with global deformations. IEEE transactions on pattern analysis and machine intelligence (1990) 4

  30. [30]

    In: International Conference on Com- puter Vision and Pattern Recognition (CVPR) (2017) 3

    Tulsiani, S., Su, H., Guibas, L.J., Efros, A.A., Malik, J.: Learning shape abstrac- tions by assembling volumetric primitives. In: International Conference on Com- puter Vision and Pattern Recognition (CVPR) (2017) 3

  31. [31]

    In: International Conference on Com- puter Vision and Pattern Recognition (CVPR) (1998) 5

    Van Dop, E.R., Regtien, P.P.: Fitting undeformed superquadrics to range data: improving model recovery and classification. In: International Conference on Com- puter Vision and Pattern Recognition (CVPR) (1998) 5

  32. [32]

    In: International Conference on Robotics and Automation (ICRA) (2017) 4

    Vezzani,G.,Pattacini,U.,Natale,L.:AGraspingApproachBasedonSuperquadric Models. In: International Conference on Robotics and Automation (ICRA) (2017) 4

  33. [33]

    In: International Conference on Neural Information Processing Systems (NeurIPS) (2025) 7

    Wang, R., Xu, S., Dong, Y., Deng, Y., Xiang, J., Lv, Z., Sun, G., Tong, X., Yang, J.: Moge-2: Accurate monocular geometry with metric scale and sharp details. In: International Conference on Neural Information Processing Systems (NeurIPS) (2025) 7

  34. [34]

    In: ACM SIGGRAPH Asia 2025 Conference Papers (2025) 3 18 Tavernini et al

    Wang, Y., Chen, W., Hu, Z., Zhang, R., Yin, Y., Wu, R., Luo, K., Qian, S., Ma, Y., Li, H., et al.: Light-SQ: Structure-aware Shape Abstraction with Superquadrics for Generated Meshes. In: ACM SIGGRAPH Asia 2025 Conference Papers (2025) 3 18 Tavernini et al

  35. [35]

    ACM Transactions On Graphics (TOG) (2021) 2, 3, 9, 10, 13, 8

    Yang, K., Chen, X.: Unsupervised learning for cuboid shape abstraction via joint segmentation from point clouds. ACM Transactions On Graphics (TOG) (2021) 2, 3, 9, 10, 13, 8

  36. [36]

    In: International Conference on Computer Vision and Pattern Recognition (CVPR) (2023) 12

    Yeshwanth, C., Liu, Y.C., Nießner, M., Dai, A.: Scannet++: A high-fidelity dataset of 3d indoor scenes. In: International Conference on Computer Vision and Pattern Recognition (CVPR) (2023) 12

  37. [37]

    tightens

    Zhao, H., Brunke, L., Lagerquist, O., Zhou, S., Schoellig, A.P.: Sq-cbf: Signed distance functions for numerically stable superquadric-based safety filtering. arXiv preprint arXiv:2602.11049 (2026) 4 SuperFlex1 SuperFlex: Deformable Superquadrics for Point Cloud Decomposition Supplementary Material Abstract.We provide further explanations on the superquad...