pith. sign in

arxiv: 2606.03499 · v1 · pith:PGSUG5MDnew · submitted 2026-06-02 · 💻 cs.CV

Characterizing Detectability in 3DGS Poisoning: A Stage-wise Benchmark

Pith reviewed 2026-06-28 10:25 UTC · model grok-4.3

classification 💻 cs.CV
keywords 3D Gaussian Splattingpoisoning attacksdetectabilitystage-wise benchmarkforensic signalstraining dynamicsnovel view synthesis
0
0 comments X

The pith

Poisoning attacks on 3D Gaussian Splatting leave stage-dependent forensic signals that require evaluating detection at each pipeline stage separately.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows that the 3DGS reconstruction pipeline consists of multiple stages that generate different intermediate data, so the traces left by poisoning attacks appear only at certain points rather than uniformly throughout. To study this, the authors created Poison-3DGS, a benchmark that supplies multi-view images, geometry, training dynamics, and final Gaussian parameters for many scenes and attack types. Systematic tests across these stages demonstrate that detection success changes markedly depending on which stage is examined and that later stages often supply signals, such as training behavior and parameter statistics, that are invisible earlier. A reader should care because this means single-stage detectors are likely to miss attacks that only become visible later in the process.

Core claim

The multi-stage 3DGS pipeline produces heterogeneous representations, so forensic signals for poisoning are stage-dependent; a benchmark exposing signals at image, geometry, training, and parameter stages shows that detectability varies across stages with no single stage dominating, that attacks produce distinct stage-specific signals, and that later-stage cues like training dynamics and Gaussian statistics supply strong evidence unavailable at earlier stages.

What carries the argument

The Poison-3DGS benchmark that systematically exposes stage-specific artifacts (multi-view images, geometry, training dynamics, Gaussian parameters) across scenes and attack types.

If this is right

  • Detection effectiveness depends on the stage at which signals are observed rather than on a universal detector.
  • Later stages such as training dynamics and Gaussian parameter statistics provide cues unavailable at earlier stages.
  • Different attack types exhibit distinct stage-specific forensic signals, so the best observation point varies with the attack.
  • No single stage can be assumed to dominate detection performance across all attacks.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • A combined multi-stage detector could be built by fusing signals that only appear at different points in the pipeline.
  • The same stage-wise lens could be applied to other reconstruction pipelines that produce sequential heterogeneous outputs.
  • Benchmark results could guide the design of attacks that deliberately hide signals until late stages.
  • Extending the benchmark to additional attack variants would test whether the observed stage dependence holds more broadly.

Load-bearing premise

The multi-stage nature of the 3DGS reconstruction pipeline produces heterogeneous intermediate representations from which forensic signals for detecting poisoning are inherently stage dependent.

What would settle it

An experiment in which a detector relying only on early-stage signals (images or geometry) achieves consistently high accuracy across every tested attack type and scene would show that stage dependence is not required.

Figures

Figures reproduced from arXiv: 2606.03499 by Kaixin Xu, Ngai-Man Cheung, Quoc-Anh Bui-Huynh, Thanh Duc Ngo, Wang Zhe, Xue Geng, Xulei Yang.

Figure 1
Figure 1. Figure 1: Our stage-wise view of 3DGS poisoning detection. (a) The multi-stage 3DGS recon￾struction pipeline exposes stage-specific artifacts, including multi-view images, geometry, training dynamics, and Gaussian parameters. (b) Attack injection stages and their corresponding forensic signals across the pipeline. An attack introduced at one stage may produce its most detectable foren￾sic signal at a different stage… view at source ↗
Figure 2
Figure 2. Figure 2: Qualitative examples from Poison-3DGS. StealthAttack shows image injection by inserting an illusory object into a target training image, while Poison-Splat shows image perturbation through subtle adversarial noise and its amplified residual. For 3D-GSW and GuardSplat, we render the same sample view from the clean model and the watermarked model, and visualize amplified differences. The figure illustrates t… view at source ↗
Figure 3
Figure 3. Figure 3: Stage-specific forensic signals for different attacks. (a) Poison-Splat is most visible in the final Gaussian representation: poisoned models contain more Gaussians and show denser Gaussian-center distributions than matched clean models, revealing over-densification rather than render-quality differences. (b) StealthAttack is most visible in training dynamics: its loss variability remains higher throughout… view at source ↗
read the original abstract

3D Gaussian Splatting (3DGS) has rapidly emerged as a leading representation for real-time novel view synthesis, but recent work shows it is vulnerable to diverse poisoning attacks, including illusory object injection, computation cost amplification, and post hoc model watermarking. Despite this expanding threat surface, existing studies focus mainly on attack success, while defense and detection remain underexplored. From a detection perspective, a key challenge and opportunity arise from the multi-stage nature of the 3DGS reconstruction pipeline, which produces heterogeneous intermediate representations. Forensic signals for detecting poisoning are inherently stage dependent: an attack introduced at one stage may produce signals that emerge only at later stages. This motivates a stage-wise view of detectability that goes beyond single-stage evaluation. We introduce Poison-3DGS, a benchmark for stage-wise characterization of poisoning detection in 3DGS. It exposes stage-specific artifacts, including multi-view images, geometry, training dynamics, and Gaussian parameters, across a diverse set of scenes and attacks. Using it, we conduct a systematic study of detectability across pipeline stages. Our analysis reveals several insights. First, detectability varies significantly across stages, and no single stage consistently dominates across attack types. Second, different attacks exhibit distinct stage-specific forensic signals, so detection effectiveness depends critically on where signals are observed. Third, later-stage signals such as training dynamics and Gaussian parameter statistics provide strong cues not observable at earlier stages. Overall, our work provides a principled benchmark and the first systematic characterization of stage-dependent detectability in 3DGS, offering a foundation for future research on robust and reliable 3DGS systems.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

0 major / 2 minor

Summary. The manuscript introduces Poison-3DGS, a benchmark for stage-wise characterization of detectability of poisoning attacks (illusory object injection, computation cost amplification, post-hoc watermarking) in 3D Gaussian Splatting pipelines. It exposes stage-specific artifacts across multi-view images, geometry, training dynamics, and Gaussian parameters, and reports three empirical observations from experiments on diverse scenes and attacks: detectability varies significantly across stages with no single stage dominating; attacks produce distinct stage-specific forensic signals; and later stages (training dynamics, Gaussian statistics) yield strong cues absent at earlier stages.

Significance. If the experimental results hold, the work supplies a reusable benchmark and the first systematic stage-wise analysis of poisoning detectability in 3DGS. The explicit focus on heterogeneous intermediate representations and the finding that later-stage signals are often stronger constitute a concrete foundation for subsequent detection research. The empirical nature of the study (new experiments rather than parameter fitting) is a strength.

minor comments (2)
  1. [Abstract] Abstract and §1: the claims about 'strong cues' and 'no single stage dominating' would be more persuasive if the abstract or introduction stated the number of scenes, attack variants, detection methods, and quantitative metrics (e.g., AUC, precision-recall) used to reach these conclusions.
  2. The manuscript should clarify whether the benchmark release includes code, trained models, and exact attack implementations so that the reported stage-wise differences can be reproduced.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their positive evaluation of our manuscript and the recommendation of minor revision. The referee's summary correctly reflects the core contributions of Poison-3DGS as a stage-wise benchmark for poisoning detectability in 3D Gaussian Splatting. No major comments were provided in the report.

Circularity Check

0 steps flagged

Empirical benchmark study with no circular derivations

full rationale

This is an empirical benchmark paper that introduces Poison-3DGS and reports experimental observations on stage-wise detectability. The central claims consist of three factual findings from new experiments (detectability varies by stage, attacks produce distinct signals, later stages yield unique cues). No equations, fitted parameters, predictions, or self-citation chains are present in the provided text. The stage-dependence follows directly from the documented multi-stage structure of 3DGS pipelines and the new benchmark data, with no reduction to prior fitted quantities or self-referential definitions.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The paper rests on the domain assumption of a multi-stage 3DGS pipeline with stage-dependent signals but introduces no free parameters, new physical entities, or ad-hoc axioms beyond standard computer vision pipeline knowledge.

axioms (1)
  • domain assumption The 3DGS reconstruction pipeline produces heterogeneous intermediate representations at different stages, making forensic signals stage dependent.
    This premise is invoked in the abstract to motivate the stage-wise benchmark and analysis.

pith-pipeline@v0.9.1-grok · 5854 in / 1324 out tokens · 37067 ms · 2026-06-28T10:25:49.114333+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

42 extracted references · 2 canonical work pages

  1. [1]

    Met3r: Measuring multi-view consistency in generated images

    Mohammad Asim, Christopher Wewer, Thomas Wimmer, Bernt Schiele, and Jan Eric Lenssen. Met3r: Measuring multi-view consistency in generated images. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 6034–6044, June 2025

  2. [2]

    Mip- nerf 360: Unbounded anti-aliased neural radiance fields

    Jonathan T Barron, Ben Mildenhall, Dor Verbin, Pratul P Srinivasan, and Peter Hedman. Mip- nerf 360: Unbounded anti-aliased neural radiance fields. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5470–5479, 2022

  3. [3]

    A survey on 3d gaussian splatting.ACM Computing Surveys, 2024

    Guikun Chen and Wenguan Wang. A survey on 3d gaussian splatting.ACM Computing Surveys, 2024

  4. [4]

    Spectral defense against resource-targeting attack in 3d gaussian splatting.arXiv preprint arXiv:2603.12796, 2026

    Yang Chen, Yi Yu, Jiaming He, Yueqi Duan, Zheng Zhu, and Yap-Peng Tan. Spectral defense against resource-targeting attack in 3d gaussian splatting.arXiv preprint arXiv:2603.12796, 2026

  5. [5]

    Gaussianeditor: Swift and controllable 3d editing with gaussian splatting

    Yiwen Chen, Zilong Chen, Chi Zhang, Feng Wang, Xiaofeng Yang, Yikai Wang, Zhongang Cai, Lei Yang, Huaping Liu, and Guosheng Lin. Gaussianeditor: Swift and controllable 3d editing with gaussian splatting. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 21476–21485, 2024

  6. [6]

    Splatformer: Point transformer for robust 3d gaussian splatting, 2025

    Yutong Chen, Marko Mihajlovic, Xiyi Chen, Yiming Wang, Sergey Prokudin, and Siyu Tang. Splatformer: Point transformer for robust 3d gaussian splatting, 2025

  7. [7]

    Guardsplat: efficient and robust watermarking for 3d gaussian splatting

    Zixuan Chen, Guangcong Wang, Jiahao Zhu, Jianhuang Lai, and Xiaohua Xie. Guardsplat: efficient and robust watermarking for 3d gaussian splatting. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 16325–16335, 2025

  8. [8]

    Mvss-net: Multi-view multi-scale supervised networks for image manipulation detection.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(3):3539–3553, 2022

    Chengbo Dong, Xinru Chen, Ruohan Hu, Juan Cao, and Xirong Li. Mvss-net: Multi-view multi-scale supervised networks for image manipulation detection.IEEE Transactions on Pattern Analysis and Machine Intelligence, 45(3):3539–3553, 2022

  9. [9]

    A baseline for detecting misclassified and out-of-distribution examples in neural networks.arXiv preprint arXiv:1610.02136, 2016

    Dan Hendrycks and Kevin Gimpel. A baseline for detecting misclassified and out-of-distribution examples in neural networks.arXiv preprint arXiv:1610.02136, 2016

  10. [10]

    Gausstrap: Stealthy poisoning attacks on 3d gaussian splatting for targeted scene confusion, 2025

    Jiaxin Hong, Sixu Chen, Shuoyang Sun, Hongyao Yu, Hao Fang, Yuqi Tan, Bin Chen, Shuhan Qi, and Jiawei Li. Gausstrap: Stealthy poisoning attacks on 3d gaussian splatting for targeted scene confusion, 2025. URLhttps://arxiv.org/abs/2504.20829

  11. [11]

    Deep learning advancements in anomaly detection: A comprehensive survey.IEEE Internet of Things Journal, 2025

    Haoqi Huang, Ping Wang, Jianhua Pei, Jiacheng Wang, Shahen Alexanian, and Dusit Niyato. Deep learning advancements in anomaly detection: A comprehensive survey.IEEE Internet of Things Journal, 2025

  12. [12]

    On the importance of gradients for detecting distributional shifts in the wild

    Rui Huang, Andrew Geng, and Yixuan Li. On the importance of gradients for detecting distributional shifts in the wild. InProceedings of the 35th International Conference on Neural Information Processing Systems, NIPS ’21, Red Hook, NY , USA, 2021. Curran Associates Inc. ISBN 9781713845393

  13. [13]

    3d-gsw: 3d gaussian splatting for robust watermarking

    Youngdong Jang, Hyunje Park, Feng Yang, Heeju Ko, Euijin Choo, and Sangpil Kim. 3d-gsw: 3d gaussian splatting for robust watermarking. InProceedings of the Computer Vision and Pattern Recognition Conference, pages 5938–5948, 2025. 10

  14. [14]

    Unravelling digital forgeries: A systematic survey on image manipulation detection and localization.ACM Computing Surveys, 57(12):1–36, 2025

    VijayaKumar Kadha, Sambit Bakshi, and Santos Kumar Das. Unravelling digital forgeries: A systematic survey on image manipulation detection and localization.ACM Computing Surveys, 57(12):1–36, 2025

  15. [15]

    Stealthattack: Robust 3d gaussian splatting poisoning via density-guided illusions

    Bo-Hsu Ke, You-Zhe Xie, Yu-Lun Liu, and Wei-Chen Chiu. Stealthattack: Robust 3d gaussian splatting poisoning via density-guided illusions. InProceedings of the IEEE/CVF International Conference on Computer Vision, pages 27400–27411, 2025

  16. [16]

    3d gaussian splatting for real-time radiance field rendering.ACM Trans

    Bernhard Kerbl, Georgios Kopanas, Thomas Leimkühler, George Drettakis, et al. 3d gaussian splatting for real-time radiance field rendering.ACM Trans. Graph., 42(4):139–1, 2023

  17. [17]

    Tanks and temples: Bench- marking large-scale scene reconstruction.ACM Transactions on Graphics (ToG), 36(4):1–13, 2017

    Arno Knapitsch, Jaesik Park, Qian-Yi Zhou, and Vladlen Koltun. Tanks and temples: Bench- marking large-scale scene reconstruction.ACM Transactions on Graphics (ToG), 36(4):1–13, 2017

  18. [18]

    A simple unified framework for detecting out-of-distribution samples and adversarial attacks.Advances in neural information processing systems, 31, 2018

    Kimin Lee, Kibok Lee, Honglak Lee, and Jinwoo Shin. A simple unified framework for detecting out-of-distribution samples and adversarial attacks.Advances in neural information processing systems, 31, 2018

  19. [19]

    Remedygs: Defend 3d gaussian splatting against computation cost attacks.arXiv preprint arXiv:2511.22147, 2025

    Yanping Li, Zhening Liu, Zijian Li, Zehong Lin, and Jun Zhang. Remedygs: Defend 3d gaussian splatting against computation cost attacks.arXiv preprint arXiv:2511.22147, 2025

  20. [20]

    Backdoor learning: A survey, 2022

    Yiming Li, Yong Jiang, Zhifeng Li, and Shu-Tao Xia. Backdoor learning: A survey, 2022. URL https://arxiv.org/abs/2007.08745

  21. [21]

    Oswald, and Danda Pani Paudel

    Yue Li, Qi Ma, Runyi Yang, Huapeng Li, Mengjiao Ma, Bin Ren, Nikola Popovic, Nicu Sebe, Ender Konukoglu, Theo Gevers, Luc Van Gool, Martin R. Oswald, and Danda Pani Paudel. Scenesplat: Gaussian splatting-based scene understanding with vision-language pretraining. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 4961...

  22. [22]

    Isolation forest

    Fei Tony Liu, Kai Ming Ting, and Zhi-Hua Zhou. Isolation forest. In2008 eighth ieee international conference on data mining, pages 413–422. IEEE, 2008

  23. [23]

    Multi-view feature extraction via tunable prompts is enough for image manipulation localization

    Xuntao Liu, Yuzhou Yang, Haoyue Wang, Qichao Ying, Zhenxing Qian, Xinpeng Zhang, and Sheng Li. Multi-view feature extraction via tunable prompts is enough for image manipulation localization. InProceedings of the 32nd ACM International Conference on Multimedia, pages 9999–10007, 2024

  24. [24]

    Poison-splat: Computation cost attack on 3d gaussian splatting.arXiv preprint arXiv:2410.08190, 2024

    Jiahao Lu, Yifan Zhang, Qiuhong Shen, Xinchao Wang, and Shuicheng Yan. Poison-splat: Computation cost attack on 3d gaussian splatting.arXiv preprint arXiv:2410.08190, 2024

  25. [25]

    A large-scale dataset of gaussian splats and their self-supervised pretraining

    Qi Ma, Yue Li, Bin Ren, Nicu Sebe, Ender Konukoglu, Theo Gevers, Luc Van Gool, and Danda Pani Paudel. A large-scale dataset of gaussian splats and their self-supervised pretraining. In2025 International Conference on 3D Vision (3DV), pages 145–155. IEEE, 2025

  26. [26]

    Al Hammadi, and Jizhe Zhou

    Xiaochen Ma, Bo Du, Zhuohang Jiang, Xia Du, Ahmed Y . Al Hammadi, and Jizhe Zhou. Iml-vit: Benchmarking image manipulation localization by vision transformer, 2024. URL https://arxiv.org/abs/2307.14863

  27. [27]

    Gaussian splatting slam

    Hidenobu Matsuki, Riku Murai, Paul HJ Kelly, and Andrew J Davison. Gaussian splatting slam. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 18039–18048, 2024

  28. [28]

    Structure-from-motion revisited

    Johannes Lutz Schönberger and Jan-Michael Frahm. Structure-from-motion revisited. In Conference on Computer Vision and Pattern Recognition (CVPR), 2016

  29. [29]

    Can we get rid of handcrafted feature extractors? sparsevit: Nonsemantics-centered, parameter-efficient image manipulation localization through spare-coding transformer

    Lei Su, Xiaochen Ma, Xuekang Zhu, Chaoqun Niu, Zeyu Lei, and Ji-Zhe Zhou. Can we get rid of handcrafted feature extractors? sparsevit: Nonsemantics-centered, parameter-efficient image manipulation localization through spare-coding transformer. InProceedings of the AAAI conference on artificial intelligence, volume 39, pages 7024–7032, 2025. 11

  30. [30]

    Mv-dust3r+: Single-stage scene reconstruction from sparse views in 2 seconds

    Zhenggang Tang, Yuchen Fan, Dilin Wang, Hongyu Xu, Rakesh Ranjan, Alexander Schwing, and Zhicheng Yan. Mv-dust3r+: Single-stage scene reconstruction from sparse views in 2 seconds. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5283–5293, June 2025

  31. [31]

    Spectral signatures in backdoor attacks

    Brandon Tran, Jerry Li, and Aleksander Madry. Spectral signatures in backdoor attacks. Advances in neural information processing systems, 31, 2018

  32. [32]

    Softpatch+: Fully unsupervised anomaly classification and segmentation.Pattern Recognition, 161:111295, 2025

    Chengjie Wang, Xi Jiang, Bin-Bin Gao, Zhenye Gan, Yong Liu, Feng Zheng, and Lizhuang Ma. Softpatch+: Fully unsupervised anomaly classification and segmentation.Pattern Recognition, 161:111295, 2025. ISSN 0031-3203. doi: https://doi.org/10.1016/j.patcog.2024.111295. URL https://www.sciencedirect.com/science/article/pii/S003132032401046X

  33. [33]

    Vggt: Visual geometry grounded transformer

    Jianyuan Wang, Minghao Chen, Nikita Karaev, Andrea Vedaldi, Christian Rupprecht, and David Novotny. Vggt: Visual geometry grounded transformer. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5294–5306, June 2025

  34. [34]

    F2-nerf: Fast neural radiance field training with free camera trajectories

    Peng Wang, Yuan Liu, Zhaoxi Chen, Lingjie Liu, Ziwei Liu, Taku Komura, Christian Theobalt, and Wenping Wang. F2-nerf: Fast neural radiance field training with free camera trajectories. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 4150–4159, 2023

  35. [35]

    Threats to training: A survey of poisoning attacks and defenses on machine learning systems.ACM Comput

    Zhibo Wang, Jingjing Ma, Xue Wang, Jiahui Hu, Zhan Qin, and Kui Ren. Threats to training: A survey of poisoning attacks and defenses on machine learning systems.ACM Comput. Surv., 55(7), December 2022. ISSN 0360-0300. doi: 10.1145/3538707. URL https://doi.org/ 10.1145/3538707

  36. [36]

    Sonata: Self-supervised learning of reliable point representations

    Xiaoyang Wu, Daniel DeTone, Duncan Frost, Tianwei Shen, Chris Xie, Nan Yang, Jakob Engel, Richard Newcombe, Hengshuang Zhao, and Julian Straub. Sonata: Self-supervised learning of reliable point representations. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 22193–22204, June 2025

  37. [37]

    Openood: Benchmarking generalized out-of- distribution detection.Advances in Neural Information Processing Systems, 35:32598–32611, 2022

    Jingkang Yang, Pengyun Wang, Dejian Zou, Zitang Zhou, Kunyuan Ding, Wenxuan Peng, Haoqi Wang, Guangyao Chen, Bo Li, Yiyou Sun, et al. Openood: Benchmarking generalized out-of- distribution detection.Advances in Neural Information Processing Systems, 35:32598–32611, 2022

  38. [38]

    No pose, no problem: Surprisingly simple 3d gaussian splats from sparse unposed images,

    Botao Ye, Sifei Liu, Haofei Xu, Xueting Li, Marc Pollefeys, Ming-Hsuan Yang, and Songyou Peng. No pose, no problem: Surprisingly simple 3d gaussian splats from sparse unposed images,

  39. [39]

    URLhttps://arxiv.org/abs/2410.24207

  40. [40]

    Splatloc: 3d gaussian splatting-based visual localization for augmented reality

    Hongjia Zhai, Xiyu Zhang, Boming Zhao, Hai Li, Yijia He, Zhaopeng Cui, Hujun Bao, and Guofeng Zhang. Splatloc: 3d gaussian splatting-based visual localization for augmented reality. IEEE Transactions on Visualization and Computer Graphics, 2025

  41. [41]

    Concerto: Joint 2d-3d self-supervised learning emerges spatial representa- tions

    Yujia Zhang, Xiaoyang Wu, Yixing Lao, Chengyao Wang, Zhuotao Tian, Naiyan Wang, and Hengshuang Zhao. Concerto: Joint 2d-3d self-supervised learning emerges spatial representa- tions. InNeurIPS, 2025

  42. [42]

    R3d-ad: Reconstruction via diffusion for 3d anomaly detection

    Zheyuan Zhou, Le Wang, Naiyu Fang, Zili Wang, Lemiao Qiu, and Shuyou Zhang. R3d-ad: Reconstruction via diffusion for 3d anomaly detection. InEuropean conference on computer vision, pages 91–107. Springer, 2024. 12