Scalable Topological Data Analysis and Visualization for Evaluating Data-Driven Models in Scientific Applications

Brian C. Van Essen; Brian K. Spears; Dan Maljovec; David Hysom; Di Wang; Harsh Bhatia; Jae-Seung Yeom; Jayaraman J. Thiagarajan; Jim Gaffney; Luc Peterson

arxiv: 1907.08325 · v1 · pith:2LH5DMVInew · submitted 2019-07-19 · 💻 cs.LG · cs.HC· cs.NE· stat.ML

Scalable Topological Data Analysis and Visualization for Evaluating Data-Driven Models in Scientific Applications

Shusen Liu , Di Wang , Dan Maljovec , Rushil Anirudh , Jayaraman J. Thiagarajan , Sam Ade Jacobs , Brian C. Van Essen , David Hysom

show 8 more authors

Jae-Seung Yeom Jim Gaffney Luc Peterson Peter B. Robinson Harsh Bhatia Valerio Pascucci Brian K. Spears Peer-Timo Bremer

This is my paper

Pith reviewed 2026-05-24 19:25 UTC · model grok-4.3

classification 💻 cs.LG cs.HCcs.NEstat.ML

keywords topological data analysisscalable visualizationhigh-dimensional functionsmachine learning interpretabilitystreaming graphstopology-aware datacubesscientific data analysis

0 comments

The pith

A combination of streaming neighborhood graph construction, topology computation, and topology-aware datacubes enables the first scalable interactive exploration of high-dimensional functions in scientific data.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a method to handle large-scale high-dimensional data from machine learning models in science and engineering. It combines a streaming approach to build neighborhood graphs, computes the corresponding topology, and aggregates data using topology-aware datacubes. This setup supports interactive views of both topological structures and geometric properties at scales with millions of samples. Demonstrations on high-energy-density physics and computational biology datasets illustrate how the approach yields new insights into model behaviors. The work targets the gap between existing interpretability tools, which do not scale, and the needs of scientists working with black-box models on enormous datasets.

Core claim

The authors present the first scalable solution to explore and analyze high-dimensional functions often encountered in the scientific data analysis pipeline. By combining a new streaming neighborhood graph construction, the corresponding topology computation, and a novel data aggregation scheme, namely topology aware datacubes, we enable interactive exploration of both the topological and the geometric aspect of high-dimensional data. Following two use cases from high-energy-density (HED) physics and computational biology, we demonstrate how these capabilities have led to crucial new insights in both applications.

What carries the argument

Streaming neighborhood graph construction together with topology computation and topology-aware datacubes for aggregation.

If this is right

Supports analysis of datasets containing millions of samples instead of being limited to thousands.
Yields interactive exploration of topological and geometric aspects simultaneously.
Produces new insights into black-box model behaviors for high-energy-density physics applications.
Produces new insights into black-box model behaviors for computational biology applications.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same pipeline could be tested on other domains with high-dimensional model outputs, such as climate simulation or materials science.
The aggregation scheme might be adapted to support incremental updates when new model predictions arrive over time.
Comparison against non-topological aggregation methods on the same use-case datasets would quantify the added value of the topology preservation step.

Load-bearing premise

The streaming neighborhood graph construction and topology computation preserve the relevant topological features of the underlying high-dimensional functions without significant distortion or loss for the target scientific applications.

What would settle it

A side-by-side computation on the same high-energy-density physics dataset showing that the topological features recovered from the streaming graph differ substantially from those obtained by exhaustive neighborhood construction on the full data.

Figures

Figures reproduced from arXiv: 1907.08325 by Brian C. Van Essen, Brian K. Spears, Dan Maljovec, David Hysom, Di Wang, Harsh Bhatia, Jae-Seung Yeom, Jayaraman J. Thiagarajan, Jim Gaffney, Luc Peterson, Peer-Timo Bremer, Peter B. Robinson, Rushil Anirudh, Sam Ade Jacobs, Shusen Liu, Valerio Pascucci.

**Figure 2.** Figure 2: The proposed visualization interface consists of three views: [PITH_FULL_IMAGE:figures/full_fig_p004_2.png] view at source ↗

**Figure 3.** Figure 3: Topological spine. A 2D terrain metaphor of high-dimensional [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗

**Figure 5.** Figure 5: System diagram of the deep learning-based surrogate modeling [PITH_FULL_IMAGE:figures/full_fig_p005_5.png] view at source ↗

**Figure 4.** Figure 4: Inertial confinement fusion (ICF). Lasers heat and compress the [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗

**Figure 6.** Figure 6: Joint exploration of both topological and geometric characteristics of the surrogate’s errors as functions in the input parameter space. In (a1), [PITH_FULL_IMAGE:figures/full_fig_p006_6.png] view at source ↗

**Figure 7.** Figure 7: The autoencoder error (R(l)) and the latent space error (Flat(x)) is shown in (a),(b) respectively. The yield of the simulation is shown in (c). In (d), we illustrate the latent space error (Flat(x)) of the model that are trained for 80 epochs instead of 40. of the previously explored error components. This interesting discovery could have a significant impact on the application since the physicists are in… view at source ↗

**Figure 8.** Figure 8: X-ray images with different energy and error conditions. [PITH_FULL_IMAGE:figures/full_fig_p007_8.png] view at source ↗

**Figure 9.** Figure 9: The comparison of cyclic loss (a measure of surrogate self [PITH_FULL_IMAGE:figures/full_fig_p007_9.png] view at source ↗

**Figure 10.** Figure 10: Multiscale simulation of RAS-membrane biology for cancer [PITH_FULL_IMAGE:figures/full_fig_p008_10.png] view at source ↗

**Figure 11.** Figure 11: Visualization of different sampling patterns (middle and bottom) when compared to the overall distribution (top) highlights that the adaptive [PITH_FULL_IMAGE:figures/full_fig_p009_11.png] view at source ↗

read the original abstract

With the rapid adoption of machine learning techniques for large-scale applications in science and engineering comes the convergence of two grand challenges in visualization. First, the utilization of black box models (e.g., deep neural networks) calls for advanced techniques in exploring and interpreting model behaviors. Second, the rapid growth in computing has produced enormous datasets that require techniques that can handle millions or more samples. Although some solutions to these interpretability challenges have been proposed, they typically do not scale beyond thousands of samples, nor do they provide the high-level intuition scientists are looking for. Here, we present the first scalable solution to explore and analyze high-dimensional functions often encountered in the scientific data analysis pipeline. By combining a new streaming neighborhood graph construction, the corresponding topology computation, and a novel data aggregation scheme, namely topology aware datacubes, we enable interactive exploration of both the topological and the geometric aspect of high-dimensional data. Following two use cases from high-energy-density (HED) physics and computational biology, we demonstrate how these capabilities have led to crucial new insights in both applications.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper gives a practical streaming TDA pipeline for large scientific ML datasets via neighborhood graphs and topology-aware datacubes, but rests on qualitative use cases without quantitative checks on topological preservation.

read the letter

This paper introduces a streaming neighborhood graph method combined with topology computation and topology-aware datacubes to make topological data analysis scalable for large scientific datasets. The main takeaway is that it claims to be the first such solution for interactive exploration of high-dimensional functions from ML models, demonstrated on physics and biology cases, but without quantitative checks on whether the approximations preserve the topology accurately.

Referee Report

2 major / 1 minor

Summary. The paper claims to present the first scalable solution to explore and analyze high-dimensional functions in scientific data analysis pipelines. It combines a new streaming neighborhood graph construction, the corresponding topology computation, and a novel data aggregation scheme called topology aware datacubes to enable interactive exploration of both topological and geometric aspects of high-dimensional data from machine learning models. The approach is demonstrated on two use cases from high-energy-density physics and computational biology, where it purportedly leads to crucial new insights.

Significance. If the central claims hold, the work would be significant for enabling TDA-based interpretability on datasets with millions of samples, where prior methods are limited to thousands. The combination of streaming graph construction with topology-aware aggregation addresses a practical bottleneck in applying persistent homology to large scientific ML outputs.

major comments (2)

[Abstract] Abstract: the claim that the streaming neighborhood graph construction plus topology computation 'preserve the relevant topological features of the underlying high-dimensional functions without significant distortion' is load-bearing for the scalability and insight claims, yet the manuscript provides no quantitative validation such as bottleneck or Wasserstein distances between persistence diagrams computed via the streaming pipeline versus an exact batch baseline on data with known ground-truth topology.
[Use cases] Use cases section: the demonstrations are stated to produce 'crucial new insights,' but the description supplies no quantitative metrics, error bars, ablation studies, or comparisons against non-topological baselines or exact TDA methods, making it impossible to evaluate whether the insights are robust or attributable to the proposed pipeline.

minor comments (1)

[Abstract] The novel term 'topology aware datacubes' is introduced without a formal definition or pseudocode in the abstract; a concise mathematical characterization would improve clarity.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments. We address each major point below and will revise the manuscript to incorporate additional quantitative validation.

read point-by-point responses

Referee: [Abstract] Abstract: the claim that the streaming neighborhood graph construction plus topology computation 'preserve the relevant topological features of the underlying high-dimensional functions without significant distortion' is load-bearing for the scalability and insight claims, yet the manuscript provides no quantitative validation such as bottleneck or Wasserstein distances between persistence diagrams computed via the streaming pipeline versus an exact batch baseline on data with known ground-truth topology.

Authors: We agree that the abstract claim regarding topological preservation would be strengthened by explicit quantitative validation. The manuscript prioritizes demonstrating scalability on large scientific datasets and the resulting domain insights, but does not report bottleneck or Wasserstein distances against exact baselines. In the revision we will add a dedicated evaluation subsection using synthetic data with known ground-truth topology, reporting these distances to quantify any distortion introduced by the streaming pipeline. revision: yes
Referee: [Use cases] Use cases section: the demonstrations are stated to produce 'crucial new insights,' but the description supplies no quantitative metrics, error bars, ablation studies, or comparisons against non-topological baselines or exact TDA methods, making it impossible to evaluate whether the insights are robust or attributable to the proposed pipeline.

Authors: We acknowledge that the use-case descriptions would benefit from quantitative support. The current text focuses on the qualitative discoveries enabled by interactive topological exploration; however, we agree that metrics, ablations, and baseline comparisons would make the attribution of insights more rigorous. The revised manuscript will expand the use-case sections with such quantitative elements, including comparisons to non-topological aggregation methods and, where feasible, to exact TDA on subsampled data. revision: yes

Circularity Check

0 steps flagged

No circularity: new algorithmic pipeline presented without self-referential reductions

full rationale

The paper introduces a streaming neighborhood graph construction, corresponding topology computation, and topology-aware datacubes as a scalable TDA pipeline for high-dimensional scientific data. No equations, fitted parameters, or predictions appear in the provided text that reduce by construction to the method's own inputs; the claims rest on the engineering novelty of these components and qualitative use-case demonstrations rather than any derivation chain. The central assumption of topological feature preservation is stated as an empirical requirement for the target applications but is not justified via self-definition, self-citation load-bearing, or renaming of known results. The work is therefore self-contained as a methods contribution.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

Abstract provides no explicit free parameters, axioms, or invented entities beyond naming the new datacube scheme; limited information available.

axioms (1)

domain assumption Topological data analysis reveals meaningful structure in high-dimensional scientific data from ML models
Implicit in the use of TDA for model evaluation in the described applications

invented entities (1)

topology aware datacubes no independent evidence
purpose: Novel data aggregation scheme that respects topology for interactive exploration
Introduced as a core component of the scalable solution

pith-pipeline@v0.9.0 · 5796 in / 1222 out tokens · 20993 ms · 2026-05-24T19:25:30.658080+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

39 extracted references · 39 canonical work pages · 3 internal anchors

[1]

https://github.com/ rushilanirudh/icf-jag-cycleGAN

Jag icf dataset for scientiﬁc machine learning. https://github.com/ rushilanirudh/icf-jag-cycleGAN . Accessed: 2019-07-15

work page 2019
[2]

T. W. Anderson. An introduction to multivariate statistical analysis, vol. 2. Wiley New York, 1958

work page 1958
[3]

A. O. Artero, M. C. F. de Oliveira, and H. Levkowitz. Uncovering clusters in crowded parallel coordinates visualizations. In —, pp. 81–88. IEEE, 9 2004

work page 2004
[4]

Baldi, S

P. Baldi, S. Brunak, and F. Bach. Bioinformatics: the machine learning approach. MIT press, 2001

work page 2001
[5]

J. C. Bennett, H. Abbasi, P.-T. Bremer, R. Grout, A. Gyulassy, T. Jin, S. Klasky, H. Kolla, M. Parashar, V . Pascucci, et al. Combining in- situ and in-transit processing to enable extreme-scale scientiﬁc analysis. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis , p. 49. IEEE Computer Society ...

work page 2012
[6]

Bremer, D

P.-T. Bremer, D. Maljovec, A. Saha, B. Wang, J. Gaffney, B. K. Spears, and V . Pascucci. Nddav: N-dimensional data analysis and visualization analysis for the national ignition campaign. Computing and Visualization in Science, 17(1):1–18, 2015

work page 2015
[7]

Bremer, V

P.-T. Bremer, V . Pascucci, and B. Hamann. Maximizing adaptivity in hierarchical topological models using cancellation trees. In T. Moeller, B. Hamann, and B. Russell, eds., Mathematical Foundations of Scientiﬁc Visualization, Computer Graphics, and Massive Data Exploration, p. to appear. Springer, 2006

work page 2006
[8]

K. T. Butler, D. W. Davies, H. Cartwright, O. Isayev, and A. Walsh. Ma- chine learning for molecular and materials science.Nature, 559(7715):547, 2018

work page 2018
[9]

Correa and P

C. Correa and P. Lindstrom. Towards robust topology of sparsely sam- pled data. IEEE Transactions on Visualization and Computer Graphics, 17(12):1852–1861, Dec. 2011. doi: 10.1109/TVCG.2011.245

work page doi:10.1109/tvcg.2011.245 2011
[10]

Correa, P

C. Correa, P. Lindstrom, and P.-T. Bremer. Topological spines: A structure- preserving visual representation of scalar ﬁelds. IEEE Transactions on Visualization and Computer Graphics, 17(12):1842–1851, Dec. 2011. doi: 10.1109/TVCG.2011.244

work page doi:10.1109/tvcg.2011.244 2011
[11]

T. N. Dang, L. Wilkinson, and A. Anand. Stacking graphic elements to avoid over-plotting. IEEE Transactions on Visualization and Computer Graphics, 16(6):1044–1052, 2010

work page 2010
[12]

Gaffney, P

J. Gaffney, P. Springer, and G. Collins. Thermodynamic modeling of uncertainties in nif icf implosions due to underlying microphysics models. In APS Meeting Abstracts, 2014

work page 2014
[13]

Gerber, P.-T

S. Gerber, P.-T. Bremer, V . Pascucci, and R. Whitaker. Visual exploration of high dimensional scalar functions. IEEE transactions on visualization and computer graphics, 16(6):1271, 2010

work page 2010
[14]

Gerber, P.-T

S. Gerber, P.-T. Bremer, V . Pascucci, and R. Whitaker. Visual exploration of high dimensional scalar functions. IEEE Transactions on Visualization and Computer Graphics, 16(6):1271–1280, 2010

work page 2010
[15]

Gyulassy, P.-T

A. Gyulassy, P.-T. Bremer, B. Hamann, and V . Pascucci. A practical approach to morse-smale complex computation: Scalability and generality. IEEE Transactions on Visualization and Computer Graphics, 14(6), 2008

work page 2008
[16]

Gyulassy and V

A. Gyulassy and V . Natarajan. Topology-based simpliﬁcation for feature extraction from 3d scalar ﬁelds. In Visualization, 2005. VIS 05. IEEE, pp. 535–542. IEEE, 2005

work page 2005
[17]

Krause, A

J. Krause, A. Perer, and K. Ng. Interacting with predictions: Visual inspection of black-box machine learning models. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, pp. 5686–5697. ACM, 2016

work page 2016
[18]

A. G. Landge, V . Pascucci, A. Gyulassy, J. C. Bennett, H. Kolla, J. Chen, and P.-T. Bremer. In-situ feature extraction of large scale combustion sim- ulations using segmented merge trees. In High Performance Computing, Networking, Storage and Analysis, SC14: International Conference for, pp. 1020–1031. IEEE, 2014

work page 2014
[19]

L. Lins, J. T. Klosowski, and C. Scheidegger. Nanocubes for real-time exploration of spatiotemporal datasets.IEEE Transactions on Visualization and Computer Graphics, 19(12):2456–2465, 2013

work page 2013
[20]

M. Liu, S. Liu, H. Su, K. Cao, and J. Zhu. Analyzing the noise robustness of deep neural networks. arXiv preprint arXiv:1810.03913, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018
[21]

S. Liu, K. Humbird, L. Peterson, J. Thiagarajan, B. Spears, and P.-T. Bremer. Topology-driven analysis and exploration of high-dimensional models. In Research Challenges and Opportunities at the interface of Machine Learning and Uncertainty Quantiﬁcation, 2018

work page 2018
[22]

S. Liu, Z. Li, T. Li, V . Srikumar, V . Pascucci, and P.-T. Bremer. Nlize: A perturbation-driven visual interrogation tool for analyzing and interpreting natural language inference models. IEEE transactions on visualization and computer graphics, 25(1):651–660, 2019

work page 2019
[23]

Z. Liu, B. Jiang, and J. Heer. immens: Real-time visual querying of big data. In Computer Graphics Forum, vol. 32, pp. 421–430. Wiley Online Library, 2013

work page 2013
[24]

S. M. Lundberg and S.-I. Lee. A uniﬁed approach to interpreting model predictions. In Advances in Neural Information Processing Systems, pp. 4768–4777, 2017

work page 2017
[25]

Mayorga and M

A. Mayorga and M. Gleicher. Splatterplots: Overcoming overdraw in scatter plots. IEEE transactions on visualization and computer graphics, 19(9):1526–1538, 2013

work page 2013
[26]

Y . Ming, H. Qu, and E. Bertini. Rulematrix: Visualizing and understanding classiﬁers with rules. IEEE transactions on visualization and computer graphics, 25(1):342–352, 2019

work page 2019
[27]

Mjolsness and D

E. Mjolsness and D. DeCoste. Machine learning for science: state of the art and future prospects. science, 293(5537):2051–2055, 2001

work page 2051
[28]

C. Olah, A. Mordvintsev, and L. Schubert. Feature visualization. Distill,

work page
[29]

doi: 10.23915/distill

https://distill.pub/2017/feature-visualization. doi: 10.23915/distill. 00007

work page doi:10.23915/distill 2017
[30]

Peterson, K

J. Peterson, K. Humbird, J. Field, S. Brandon, S. Langer, R. Nora, B. Spears, and P. Springer. Zonal ﬂow generation in inertial conﬁne- ment fusion implosions. Physics of Plasmas, 24(3):032702, 2017

work page 2017
[31]

M. T. Ribeiro, S. Singh, and C. Guestrin. Why should i trust you?: Explaining the predictions of any classiﬁer. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1135–1144. ACM, 2016

work page 2016
[32]

Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps

K. Simonyan, A. Vedaldi, and A. Zisserman. Deep inside convolutional networks: Visualising image classiﬁcation models and saliency maps. arXiv preprint arXiv:1312.6034, 2013

work page internal anchor Pith review Pith/arXiv arXiv 2013
[33]

Springer, C

P. Springer, C. Cerjan, R. Betti, J. Caggiano, M. Edwards, J. Frenje, V . Y . Glebov, S. Glenzer, S. Glenn, N. Izumi, et al. Integrated thermodynamic model for ignition target performance. In EPJ Web of Conferences, vol. 59, p. 04001. EDP Sciences, 2013

work page 2013
[34]

D. M. Thomas and V . Natarajan. Detecting symmetry in scalar ﬁelds using augmented extremum graphs. IEEE Transactions on Visualization and Computer Graphics, 19(12):2663–2672, Dec 2013. doi: 10.1109/TVCG. 2013.148

work page doi:10.1109/tvcg 2013
[35]

Tolstikhin, O

I. Tolstikhin, O. Bousquet, S. Gelly, and B. Schoelkopf. Wasserstein auto-encoders. arXiv preprint arXiv:1711.01558, 2017

work page arXiv 2017
[36]

Van Essen, H

B. Van Essen, H. Kim, R. Pearce, K. Boakye, and B. Chen. Lbann: Livermore big artiﬁcial neural network hpc toolkit. In Proceedings of the Workshop on Machine Learning in High-Performance Computing Environments, p. 5. ACM, 2015

work page 2015
[37]

J. Wang, L. Gou, H.-W. Shen, and H. Yang. Dqnviz: A visual analytics ap- proach to understand deep q-networks. IEEE transactions on visualization and computer graphics, 25(1):288–298, 2019

work page 2019
[38]

Understanding Neural Networks Through Deep Visualization

J. Yosinski, J. Clune, A. Nguyen, T. Fuchs, and H. Lipson. Under- standing neural networks through deep visualization. arXiv preprint arXiv:1506.06579, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015
[39]

M. D. Zeiler and R. Fergus. Visualizing and understanding convolutional networks. In European conference on computer vision , pp. 818–833. Springer, 2014. 10

work page 2014

[1] [1]

https://github.com/ rushilanirudh/icf-jag-cycleGAN

Jag icf dataset for scientiﬁc machine learning. https://github.com/ rushilanirudh/icf-jag-cycleGAN . Accessed: 2019-07-15

work page 2019

[2] [2]

T. W. Anderson. An introduction to multivariate statistical analysis, vol. 2. Wiley New York, 1958

work page 1958

[3] [3]

A. O. Artero, M. C. F. de Oliveira, and H. Levkowitz. Uncovering clusters in crowded parallel coordinates visualizations. In —, pp. 81–88. IEEE, 9 2004

work page 2004

[4] [4]

Baldi, S

P. Baldi, S. Brunak, and F. Bach. Bioinformatics: the machine learning approach. MIT press, 2001

work page 2001

[5] [5]

J. C. Bennett, H. Abbasi, P.-T. Bremer, R. Grout, A. Gyulassy, T. Jin, S. Klasky, H. Kolla, M. Parashar, V . Pascucci, et al. Combining in- situ and in-transit processing to enable extreme-scale scientiﬁc analysis. In Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis , p. 49. IEEE Computer Society ...

work page 2012

[6] [6]

Bremer, D

P.-T. Bremer, D. Maljovec, A. Saha, B. Wang, J. Gaffney, B. K. Spears, and V . Pascucci. Nddav: N-dimensional data analysis and visualization analysis for the national ignition campaign. Computing and Visualization in Science, 17(1):1–18, 2015

work page 2015

[7] [7]

Bremer, V

P.-T. Bremer, V . Pascucci, and B. Hamann. Maximizing adaptivity in hierarchical topological models using cancellation trees. In T. Moeller, B. Hamann, and B. Russell, eds., Mathematical Foundations of Scientiﬁc Visualization, Computer Graphics, and Massive Data Exploration, p. to appear. Springer, 2006

work page 2006

[8] [8]

K. T. Butler, D. W. Davies, H. Cartwright, O. Isayev, and A. Walsh. Ma- chine learning for molecular and materials science.Nature, 559(7715):547, 2018

work page 2018

[9] [9]

Correa and P

C. Correa and P. Lindstrom. Towards robust topology of sparsely sam- pled data. IEEE Transactions on Visualization and Computer Graphics, 17(12):1852–1861, Dec. 2011. doi: 10.1109/TVCG.2011.245

work page doi:10.1109/tvcg.2011.245 2011

[10] [10]

Correa, P

C. Correa, P. Lindstrom, and P.-T. Bremer. Topological spines: A structure- preserving visual representation of scalar ﬁelds. IEEE Transactions on Visualization and Computer Graphics, 17(12):1842–1851, Dec. 2011. doi: 10.1109/TVCG.2011.244

work page doi:10.1109/tvcg.2011.244 2011

[11] [11]

T. N. Dang, L. Wilkinson, and A. Anand. Stacking graphic elements to avoid over-plotting. IEEE Transactions on Visualization and Computer Graphics, 16(6):1044–1052, 2010

work page 2010

[12] [12]

Gaffney, P

J. Gaffney, P. Springer, and G. Collins. Thermodynamic modeling of uncertainties in nif icf implosions due to underlying microphysics models. In APS Meeting Abstracts, 2014

work page 2014

[13] [13]

Gerber, P.-T

S. Gerber, P.-T. Bremer, V . Pascucci, and R. Whitaker. Visual exploration of high dimensional scalar functions. IEEE transactions on visualization and computer graphics, 16(6):1271, 2010

work page 2010

[14] [14]

Gerber, P.-T

S. Gerber, P.-T. Bremer, V . Pascucci, and R. Whitaker. Visual exploration of high dimensional scalar functions. IEEE Transactions on Visualization and Computer Graphics, 16(6):1271–1280, 2010

work page 2010

[15] [15]

Gyulassy, P.-T

A. Gyulassy, P.-T. Bremer, B. Hamann, and V . Pascucci. A practical approach to morse-smale complex computation: Scalability and generality. IEEE Transactions on Visualization and Computer Graphics, 14(6), 2008

work page 2008

[16] [16]

Gyulassy and V

A. Gyulassy and V . Natarajan. Topology-based simpliﬁcation for feature extraction from 3d scalar ﬁelds. In Visualization, 2005. VIS 05. IEEE, pp. 535–542. IEEE, 2005

work page 2005

[17] [17]

Krause, A

J. Krause, A. Perer, and K. Ng. Interacting with predictions: Visual inspection of black-box machine learning models. In Proceedings of the 2016 CHI Conference on Human Factors in Computing Systems, pp. 5686–5697. ACM, 2016

work page 2016

[18] [18]

A. G. Landge, V . Pascucci, A. Gyulassy, J. C. Bennett, H. Kolla, J. Chen, and P.-T. Bremer. In-situ feature extraction of large scale combustion sim- ulations using segmented merge trees. In High Performance Computing, Networking, Storage and Analysis, SC14: International Conference for, pp. 1020–1031. IEEE, 2014

work page 2014

[19] [19]

L. Lins, J. T. Klosowski, and C. Scheidegger. Nanocubes for real-time exploration of spatiotemporal datasets.IEEE Transactions on Visualization and Computer Graphics, 19(12):2456–2465, 2013

work page 2013

[20] [20]

M. Liu, S. Liu, H. Su, K. Cao, and J. Zhu. Analyzing the noise robustness of deep neural networks. arXiv preprint arXiv:1810.03913, 2018

work page internal anchor Pith review Pith/arXiv arXiv 2018

[21] [21]

S. Liu, K. Humbird, L. Peterson, J. Thiagarajan, B. Spears, and P.-T. Bremer. Topology-driven analysis and exploration of high-dimensional models. In Research Challenges and Opportunities at the interface of Machine Learning and Uncertainty Quantiﬁcation, 2018

work page 2018

[22] [22]

S. Liu, Z. Li, T. Li, V . Srikumar, V . Pascucci, and P.-T. Bremer. Nlize: A perturbation-driven visual interrogation tool for analyzing and interpreting natural language inference models. IEEE transactions on visualization and computer graphics, 25(1):651–660, 2019

work page 2019

[23] [23]

Z. Liu, B. Jiang, and J. Heer. immens: Real-time visual querying of big data. In Computer Graphics Forum, vol. 32, pp. 421–430. Wiley Online Library, 2013

work page 2013

[24] [24]

S. M. Lundberg and S.-I. Lee. A uniﬁed approach to interpreting model predictions. In Advances in Neural Information Processing Systems, pp. 4768–4777, 2017

work page 2017

[25] [25]

Mayorga and M

A. Mayorga and M. Gleicher. Splatterplots: Overcoming overdraw in scatter plots. IEEE transactions on visualization and computer graphics, 19(9):1526–1538, 2013

work page 2013

[26] [26]

Y . Ming, H. Qu, and E. Bertini. Rulematrix: Visualizing and understanding classiﬁers with rules. IEEE transactions on visualization and computer graphics, 25(1):342–352, 2019

work page 2019

[27] [27]

Mjolsness and D

E. Mjolsness and D. DeCoste. Machine learning for science: state of the art and future prospects. science, 293(5537):2051–2055, 2001

work page 2051

[28] [28]

C. Olah, A. Mordvintsev, and L. Schubert. Feature visualization. Distill,

work page

[29] [29]

doi: 10.23915/distill

https://distill.pub/2017/feature-visualization. doi: 10.23915/distill. 00007

work page doi:10.23915/distill 2017

[30] [30]

Peterson, K

J. Peterson, K. Humbird, J. Field, S. Brandon, S. Langer, R. Nora, B. Spears, and P. Springer. Zonal ﬂow generation in inertial conﬁne- ment fusion implosions. Physics of Plasmas, 24(3):032702, 2017

work page 2017

[31] [31]

M. T. Ribeiro, S. Singh, and C. Guestrin. Why should i trust you?: Explaining the predictions of any classiﬁer. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 1135–1144. ACM, 2016

work page 2016

[32] [32]

Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps

K. Simonyan, A. Vedaldi, and A. Zisserman. Deep inside convolutional networks: Visualising image classiﬁcation models and saliency maps. arXiv preprint arXiv:1312.6034, 2013

work page internal anchor Pith review Pith/arXiv arXiv 2013

[33] [33]

Springer, C

P. Springer, C. Cerjan, R. Betti, J. Caggiano, M. Edwards, J. Frenje, V . Y . Glebov, S. Glenzer, S. Glenn, N. Izumi, et al. Integrated thermodynamic model for ignition target performance. In EPJ Web of Conferences, vol. 59, p. 04001. EDP Sciences, 2013

work page 2013

[34] [34]

D. M. Thomas and V . Natarajan. Detecting symmetry in scalar ﬁelds using augmented extremum graphs. IEEE Transactions on Visualization and Computer Graphics, 19(12):2663–2672, Dec 2013. doi: 10.1109/TVCG. 2013.148

work page doi:10.1109/tvcg 2013

[35] [35]

Tolstikhin, O

I. Tolstikhin, O. Bousquet, S. Gelly, and B. Schoelkopf. Wasserstein auto-encoders. arXiv preprint arXiv:1711.01558, 2017

work page arXiv 2017

[36] [36]

Van Essen, H

B. Van Essen, H. Kim, R. Pearce, K. Boakye, and B. Chen. Lbann: Livermore big artiﬁcial neural network hpc toolkit. In Proceedings of the Workshop on Machine Learning in High-Performance Computing Environments, p. 5. ACM, 2015

work page 2015

[37] [37]

J. Wang, L. Gou, H.-W. Shen, and H. Yang. Dqnviz: A visual analytics ap- proach to understand deep q-networks. IEEE transactions on visualization and computer graphics, 25(1):288–298, 2019

work page 2019

[38] [38]

Understanding Neural Networks Through Deep Visualization

J. Yosinski, J. Clune, A. Nguyen, T. Fuchs, and H. Lipson. Under- standing neural networks through deep visualization. arXiv preprint arXiv:1506.06579, 2015

work page internal anchor Pith review Pith/arXiv arXiv 2015

[39] [39]

M. D. Zeiler and R. Fergus. Visualizing and understanding convolutional networks. In European conference on computer vision , pp. 818–833. Springer, 2014. 10

work page 2014