pith. sign in

arxiv: 1907.11762 · v1 · pith:A4ZHTSY3new · submitted 2019-07-26 · 💻 cs.HC · cs.GR· cs.IT· math.IT· stat.AP

Multivariate Pointwise Information-Driven Data Sampling and Visualization

Pith reviewed 2026-05-24 15:06 UTC · model grok-4.3

classification 💻 cs.HC cs.GRcs.ITmath.ITstat.AP
keywords data samplingmultivariate datainformation theoryscientific visualizationdata reductionspatiotemporal datastatistical association
0
0 comments X

The pith

A sampling algorithm using pointwise information measures reduces large multivariate datasets while preserving associations among variables.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a sub-sampling method for large scientific simulation data involving multiple variables. It applies pointwise information theoretic measures to identify which data points carry the strongest statistical associations across variables. By keeping those points, the reduced dataset can still support accurate queries and visualizations that involve relationships between two or more variables. This addresses the problem of data growing too large for full storage and analysis on modern supercomputers. A sympathetic reader would care because it offers a way to keep essential multi-variable features without needing the entire dataset.

Core claim

The proposed multivariate association driven sampling algorithm leverages pointwise information theoretic measures to quantify the statistical association of data points considering multiple variables and generates a sub-sampled data that preserves the statistical association among multi-variables, allowing multivariate feature query and analysis to be done effectively on the reduced data.

What carries the argument

Multivariate association driven sampling algorithm that applies pointwise information theoretic measures to select data points based on their contribution to variable associations.

If this is right

  • The reduced data supports effective multivariate feature queries.
  • Visualization and analysis of multiple variables can be performed on the sampled data.
  • The important features involving multiple variables are preserved in the reduced data.
  • The method works on several scientific datasets for summarization.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • If the method works, it could be applied to time-varying data streams where full storage is impossible.
  • Combining this sampling with spatial or temporal compression might yield even smaller representations.
  • Domain scientists could test it by measuring how well specific physical relationships are recovered after sampling.

Load-bearing premise

Pointwise information measures on multiple variables will pick out the points most critical for answering domain-specific questions about variable relationships.

What would settle it

Apply the sampling to a dataset, then compare the accuracy of multivariate queries or feature detection on the full data versus the sampled data; large discrepancies would falsify the claim.

Figures

Figures reproduced from arXiv: 1907.11762 by Ayan Biswas, James Ahrens, Soumya Dutta.

Figure 1
Figure 1. Figure 1: Visualization of Pressure and Velocity field of Hurricane Isabel data set. The hurricane eye at the center of Pressure field and the high velocity region around the hurricane eye can be observed [PITH_FULL_IMAGE:figures/full_fig_p006_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: PMI computed from Pressure and Velocity field of Hurricane Isabel data set is visualized. Figure 2a shows the 2D plot of PMI values for all value pairs of Pressure and Velocity, Figure 2b provides the PMI field for analyzing the PMI values in the spatial domain. It can be seen that around the hurricane eye, the eyewall is highlighted as high PMI-valued region which indicates a joint feature in the data set… view at source ↗
Figure 3
Figure 3. Figure 3: Sampling result on Isabel data set when Pressure and Velocity variables are used. Figure 3a shows results of random sampling and Figure 3b shows results of the proposed pointwise information driven sampling results for sampling fraction 0.03. By observing the PMI field presented in Figure 2b, it can be seen that the proposed sampling method samples densely from the regions where statistical association bet… view at source ↗
Figure 4
Figure 4. Figure 4: Sampling result for Isabel data set when three variables (QGraup, QCloud, and Precipitation) are used to perform sampling. In this case, the generalized specific correlation measure presented in Equation is used to compute multivariate associativity for the data points considering all three variables. Figure 4a–c show the rendering of QGraup, QCloud, and Precipitation fields respectively. Figure 4d present… view at source ↗
Figure 5
Figure 5. Figure 5: Results of the proposed sampling technique when the number of histogram bins is varied while computing the information theoretic measure PMI. It is observed that the overall result remains similar without impacting the outcome of the sampling algorithm significantly [PITH_FULL_IMAGE:figures/full_fig_p010_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Visualization of multivariate query-driven analysis performed on the sampled data using Hurricane Isabel data set. The multivariate query −100 < Pressure < −4900 AND Velocity > 10 is applied on the sampled data sets. Figure 6a shows all the points selected by the proposed sampling algorithm by using Pressure and Velocity variable. Figure 6b shows the data points selected by the query when applied to raw da… view at source ↗
Figure 7
Figure 7. Figure 7: Visualization of multivariate query-driven analysis performed on the sampled data using Turbulent Combustion data set. The multivariate query 0.3 < mixfrac < 0.7 AND 0.0006 < Y_OH 0.1 is applied on the sampled data sets. Figure 7a shows all the points selected by the proposed sampling algorithm by using mixfrac and Y_OH variable. Figure 7b shows the data points selected by the query when applied to raw dat… view at source ↗
Figure 8
Figure 8. Figure 8: Visualization of multivariate query driven analysis performed on the sampled data using Asteroid impact data set. The multivariate query 0.13 < tev < 0.5 AND 0.45 < v02 1.0 is applied on the sampled data sets. Figure 8a shows all the points selected by the proposed sampling algorithm by using tev and v02 variable. Figure 8b shows the data points selected by the query when applied to raw data. Figure 8c sho… view at source ↗
Figure 9
Figure 9. Figure 9: shows the reconstructed data visualization of Velocity field for Hurricane Isabel data set. The visualization is focused on the feature in the data set, i.e., the hurricane eye region. The eye of the hurricane is an important feature where strong wind exists indicated by the dark reddish regions in the image. At the core of the eye, the velocity is low as seen from the blue color and around it the eyewall … view at source ↗
Figure 10
Figure 10. Figure 10: Reconstruction-based visualization of mixfrac field of Turbulent Combustion data set. Linear interpolation is used to reconstruct the data from the sub-sampled data sets. Figure 10a shows the result from the original raw data. Figure 10b provides the reconstruction result from the sub-sampled data generated by the proposed method, and Figure 10c presents the result of reconstruction from random sampled da… view at source ↗
Figure 11
Figure 11. Figure 11: Reconstruction-based visualization of Y_OH field of Turbulent Combustion data set. Linear interpolation is used to reconstruct the data from the sub-sampled data sets. Figure 11a shows the result from the original raw data. Figure 11b provides the reconstruction result from the sub-sampled data generated by the proposed method, and Figure 11c presents the result of reconstruction from random sampled data.… view at source ↗
Figure 12
Figure 12. Figure 12: Reconstruction-based visualization of tev field of Asteroid impact data set. Linear interpolation is used to reconstruct the data from the sub-sampled data sets. Figure 12a shows the result from the original raw data. Figure 12b provides the reconstruction result from the sub-sampled data generated by the proposed method, and Figure 12c presents the result of reconstruction from random sampled data. The s… view at source ↗
Figure 13
Figure 13. Figure 13: Regions of interest (ROI) of different data sets used for analysis. Figure 13a shows the ROI in Isabel data set, where the hurricane eye feature is selected. Figure 13b shows the ROI for Combustion data set, where the turbulent flame region is highlighted. Finally, in Figure 13c the ROI for asteroid data set is shown. The ROI selected in this example indicates the region where the asteroid has impacted th… view at source ↗
read the original abstract

With increasing computing capabilities of modern supercomputers, the size of the data generated from the scientific simulations is growing rapidly. As a result, application scientists need effective data summarization techniques that can reduce large-scale multivariate spatiotemporal data sets while preserving the important data properties so that the reduced data can answer domain-specific queries involving multiple variables with sufficient accuracy. While analyzing complex scientific events, domain experts often analyze and visualize two or more variables together to obtain a better understanding of the characteristics of the data features. Therefore, data summarization techniques are required to analyze multi-variable relationships in detail and then perform data reduction such that the important features involving multiple variables are preserved in the reduced data. To achieve this, in this work, we propose a data sub-sampling algorithm for performing statistical data summarization that leverages pointwise information theoretic measures to quantify the statistical association of data points considering multiple variables and generates a sub-sampled data that preserves the statistical association among multi-variables. Using such reduced sampled data, we show that multivariate feature query and analysis can be done effectively. The efficacy of the proposed multivariate association driven sampling algorithm is presented by applying it on several scientific data sets.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 0 minor

Summary. The manuscript proposes a data sub-sampling algorithm for large multivariate spatiotemporal scientific datasets that applies pointwise information-theoretic measures to quantify per-point statistical associations across multiple variables; high-association points are retained to produce a reduced dataset that preserves cross-variable relationships, enabling effective multivariate feature queries and visualization.

Significance. If empirically validated on the claimed scientific datasets, the approach would supply a concrete, information-theoretic heuristic for multivariate data reduction that directly targets association preservation rather than univariate statistics or random sampling; this addresses a practical need in supercomputing visualization workflows where domain experts routinely examine joint variable behavior.

major comments (2)
  1. [Abstract] Abstract: the central claim that the sub-sampled data 'preserves the statistical association among multi-variables' and permits 'multivariate feature query and analysis ... effectively' is stated without any quantitative support (no before/after mutual-information values, correlation matrices, or query-error metrics); this absence prevents evaluation of whether the heuristic actually works.
  2. [Abstract] The weakest assumption—that retaining points with high pointwise multivariate association is sufficient to protect downstream domain-specific multivariate queries—is presented as self-evident but receives no explicit test against baselines (random sampling, univariate information sampling, or existing feature-preservation methods) on the scientific datasets; without such controls the efficacy claim remains unanchored.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback on the abstract. We agree that strengthening the abstract with quantitative evidence and explicit baseline references will improve clarity and will revise accordingly.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the central claim that the sub-sampled data 'preserves the statistical association among multi-variables' and permits 'multivariate feature query and analysis ... effectively' is stated without any quantitative support (no before/after mutual-information values, correlation matrices, or query-error metrics); this absence prevents evaluation of whether the heuristic actually works.

    Authors: We agree that the abstract would benefit from explicit quantitative support. In the revised manuscript we will update the abstract to report specific before/after mutual-information values, correlation-matrix differences, and query-error metrics drawn from the existing evaluations on the scientific datasets. revision: yes

  2. Referee: [Abstract] The weakest assumption—that retaining points with high pointwise multivariate association is sufficient to protect downstream domain-specific multivariate queries—is presented as self-evident but receives no explicit test against baselines (random sampling, univariate information sampling, or existing feature-preservation methods) on the scientific datasets; without such controls the efficacy claim remains unanchored.

    Authors: We acknowledge that the abstract does not explicitly reference baseline comparisons. While the body of the manuscript presents results on multiple scientific datasets, we will revise the abstract to include a concise statement summarizing the comparisons against random sampling, univariate information-driven sampling, and other feature-preservation methods, citing the key performance differences already obtained. revision: yes

Circularity Check

0 steps flagged

No significant circularity; method is a heuristic using standard information-theoretic quantities

full rationale

The paper proposes a sampling algorithm that applies existing pointwise multivariate information measures to retain high-association points. No equations, fitted parameters, or self-citations are presented in the abstract or description that reduce the central claim to a definition or prior result by the same authors. The approach is described as leveraging standard information-theoretic quantities without internal redefinition or prediction-by-construction. This is the common case of an empirical heuristic whose validity is tested externally on datasets rather than derived tautologically.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review; no free parameters, axioms, or invented entities are specified in the provided text.

pith-pipeline@v0.9.0 · 5742 in / 1048 out tokens · 20438 ms · 2026-05-24T15:06:55.152638+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

71 extracted references · 71 canonical work pages · 1 internal anchor

  1. [1]

    Scientific discovery at the exascale

    Ahern, S.; Shoshani, A.; Ma, K.L.; Choudhary, A.; Critchlow, T.; Klasky, S.; Pascucci, V .; Ahrens, J.; Bethel, E.; Childs, H. Scientific discovery at the exascale. Report from the DOE ASCR 2011 Workshop on Exascale Data Management. Houston, TX. ; 2011

  2. [2]

    Data Exploration at the Exascale

    Childs, H. Data Exploration at the Exascale. Supercomput. Front. Innov. 2015, 2, 5–13

  3. [3]

    An Image-Based Approach to Extreme Scale in Situ Visualization and Analysis

    Ahrens, J.; Jourdain, S.; OLeary, P .; Patchett, J.; Rogers, D.H.; Petersen, M. An Image-Based Approach to Extreme Scale in Situ Visualization and Analysis. In Proceedings of the SC14: International Conference for High Performance Computing, Networking, Storage and Analysis, New Orleans, LA, USA, 16–21 November 2014 ; pp. 424–434, doi:10.1109/SC.2014.40

  4. [5]

    Explorable images for visualizing volume data

    Tikhonova, A.; Correa, C.D.; Ma, K. Explorable images for visualizing volume data. In Proceedings of the 2010 IEEE Pacific Visualization Symposium (PacificVis), Taipei, Taiwan, 2–5 March 2010; pp. 177–184, doi:10.1109/PACIFICVIS.2010.5429595

  5. [6]

    In Situ Distribution Guided Analysis and Visualization of Transonic Jet Engine Simulations

    Dutta, S.; Chen, C.M.; Heinlein, G.; Shen, H.W.; Chen, J.P . In Situ Distribution Guided Analysis and Visualization of Transonic Jet Engine Simulations. IEEE Trans. Vis. Comput. Graph. 2017, 23, 811–820

  6. [7]

    In Situ Eddy Analysis in a High-Resolution Ocean Climate Model

    Woodring, J.; Petersen, M.; Schmei βer, A.; Patchett, J.; Ahrens, J.; Hagen, H. In Situ Eddy Analysis in a High-Resolution Ocean Climate Model. IEEE Trans. Vis. Comput. Graph. 2016, 22, 857–866, doi:10.1109/TVCG.2015.2467411

  7. [8]

    Visualizing Multivariate Volume Data from Turbulent Combustion Simulations

    Akiba, H.; Ma, K.; Chen, J.H.; Hawkes, E.R. Visualizing Multivariate Volume Data from Turbulent Combustion Simulations. Comput. Sci. Eng. 2007, 9, 76–83, doi:10.1109/MCSE.2007.42

  8. [9]

    An Application of Multivariate Statistical Analysis for Query-Driven Visualization

    Gosink , L.J.; Garth, C.; Anderson, J.C.; Bethel, E.W.; Joy, K.I. An Application of Multivariate Statistical Analysis for Query-Driven Visualization. IEEE Trans. Vis. Comput. Graph. 2011, 17, 264–275, doi:10.1109/TVCG.2010.80

  9. [10]

    CoDDA: A Flexible Copula-based Distribution Driven Analysis Framework for Large-Scale Multivariate Data

    Hazarika, S.; Dutta, S.; Shen, H.; Chen, J. CoDDA: A Flexible Copula-based Distribution Driven Analysis Framework for Large-Scale Multivariate Data. IEEE Trans. Vis. Comput. Graph. 2019, 25, 1214–1224, doi:10.1109/TVCG.2018.2864801

  10. [11]

    Association Analysis for Visual Exploration of Multivariate Scientific Data Sets

    Liu, X.; Shen, H.W. Association Analysis for Visual Exploration of Multivariate Scientific Data Sets. IEEE Trans. Vis. Comput. Graph. 2016, 22, 955–964, doi:10.1109/TVCG.2015.2467431

  11. [12]

    An Information-Aware Framework for Exploring Multivariate Data Sets

    Biswas, A.; Dutta, S.; Shen, H.; Woodring, J. An Information-Aware Framework for Exploring Multivariate Data Sets. IEEE Trans. Vis. Comput. Graph. 2013, 19, 2683–2692, doi:10.1109/TVCG.2013.133

  12. [13]

    Query-driven visualization of large data sets

    Stockinger, K.; Shalf, J.; Wu, K.; Bethel, E.W. Query-driven visualization of large data sets. In Proceedings of the VIS 05. IEEE Visualization, 2005, Minneapolis, MN, USA, 23–28 October 2005; pp. 167–174, doi:10.1109/VISUAL.2005.1532792

  13. [15]

    Yan and W

    Wang, K.; Wei, T.; Shareef, N.; Shen, H. Statistical visualization and analysis of large data using a value-based spatial distribution. In Proceedings of the 2017 IEEE Pacific Visualization Symposium (PacificVis), Seoul, Korea, 18–21 April 2017; pp. 161–170, doi:10.1109/PACIFICVIS.2017.8031590

  14. [16]

    Homogeneity guided probabilistic data summaries for analysis and visualization of large-scale data sets

    Dutta, S.; Woodring, J.; Shen, H.W.; Chen, J.P .; Ahrens, J. Homogeneity guided probabilistic data summaries for analysis and visualization of large-scale data sets. In Proceedings of the 2017 IEEE Pacific Visualization Symposium (PacificVis), Seoul, Korea, 18–21 April 2017, pp. 111–120, doi:10.1109/PACIFICVIS.2017.8031585

  15. [17]

    Interactive desktop analysis of high resolution simulations: application to turbulent plume dynamics and current sheet formation

    Clyne, J.; Mininni, P .; Norton, A.; Rast, M. Interactive desktop analysis of high resolution simulations: application to turbulent plume dynamics and current sheet formation. New J. Phys. 2007, 9, 301

  16. [18]

    Spatiotemporal Wavelet Compression for Visualization of Scientific Simulation Data

    Li, S.; Sane, S.; Orf, L.; Mininni, P .; Clyne, J.; Childs, H. Spatiotemporal Wavelet Compression for Visualization of Scientific Simulation Data. In Proceedings of the 2017 IEEE International Conference on Cluster Computing (CLUSTER), Honolulu, HI, USA, 5–8 September 2017; pp. 216–227, doi:10.1109/CLUSTER.2017.15

  17. [20]

    Compressing the Incompressible with ISABELA: In-situ Reduction of Spatio-temporal Data

    Lakshminarasimhan, S.; Shah, N.; Ethier, S.; Klasky, S.; Latham, R.; Ross, R.; Samatova, N.F. Compressing the Incompressible with ISABELA: In-situ Reduction of Spatio-temporal Data. In Euro-Par 2011 Parallel Processing; Jeannot, E.; Namyst, R.; Roman, J., Eds.; Springer: Berlin/Heidelberg, Germany, 2011; pp. 366–379

  18. [21]

    In Situ Data-driven Adaptive Sampling for Large-scale Simulation Data Summarization

    Biswas, A.; Dutta, S.; Pulido, J.; Ahrens, J. In Situ Data-driven Adaptive Sampling for Large-scale Simulation Data Summarization. In Proceedings of the Workshop on In Situ Infrastructures for Enabling Extreme-Scale Analysis and Visualization, Dallas, TX, USA, 12 November 2018; ACM: New York, NY, USA, 2018; pp. 13–18, doi:10.1145/3281464.3281467

  19. [22]

    Javed and N

    Wei, T.; Dutta, S.; Shen, H. Information Guided Data Sampling and Recovery Using Bitmap Indexing. In Proceedings of the 2018 IEEE Pacific Visualization Symposium (PacificVis), Kobe, Japan, 10–13 April 2018; pp. 56–65, doi:10.1109/PacificVis.2018.00016

  20. [23]

    In-situ Sampling of a Large-scale Particle Simulation for Interactive Visualization and Analysis

    Woodring, J.; Ahrens, J.; Figg, J.; Wendelberger, J.; Habib, S.; Heitmann, K. In-situ Sampling of a Large-scale Particle Simulation for Interactive Visualization and Analysis. In Proceedings of the 13th Eurographics/IEEE—VGTC Conference on Visualization. Eurographics Association, Bergen, Norway, 1–3 June 2011; pp. 1151–1160, doi:10.1111/j.1467-8659.2011.01964.x

  21. [24]

    Taming Massive Distributed Datasets: Data Sampling Using Bitmap Indices

    Su, Y.; Agrawal, G.; Woodring, J.; Myers, K.; Wendelberger, J.; Ahrens, J. Taming Massive Distributed Datasets: Data Sampling Using Bitmap Indices. In Proceedings of the 22Nd International Symposium on High-performance Parallel and Distributed Computing, New York, NY, USA, 17–21 June 2013; ACM: New York, NY, USA, 2013; pp. 13–24, doi:10.1145/2462902.2462906

  22. [25]

    Word association norms, mutual information, and lexicography

    Church, K.W.; Hanks, P . Word association norms, mutual information, and lexicography. In Proceedings of the 27th Annual Meeting on Association for Computational Linguistics, Vancouver, BC, Canada, 26–29 June 1989; Association for Computational Linguistics: Stroudsburg, PA, USA, 1989; pp. 76–83, doi:10.3115/981623.981633

  23. [26]

    Two multivariate generalizations of pointwise mutual information

    Van de Cruys, T. Two multivariate generalizations of pointwise mutual information. In Proceedings of the Workshop on Distributional Semantics and Compositionality, Portland, OR, USA, 24 June 2011; Association for Computational Linguistics: Stroudsburg, PA, USA, 2011; pp. 16–20

  24. [27]

    Elements of Information Theory, 2nd ed.; Wiley Series in Telecommunications and Signal Processing; Wiley-Interscience, New York, NY, USA: 2006

    Cover, T.M.; Thomas, J.A. Elements of Information Theory, 2nd ed.; Wiley Series in Telecommunications and Signal Processing; Wiley-Interscience, New York, NY, USA: 2006

  25. [28]

    A Mathematical Theory of Communication

    Shannon, C.E. A Mathematical Theory of Communication. SIGMOBILE Mob. Comput. Commun. Rev. 2001, 5, 3–55, doi:10.1145/584091.584093

  26. [29]

    Fifty years of Shannon theory

    Verdú, S. Fifty years of Shannon theory. Inf. Theory IEEE Trans. 1998, 44, 2057–2078, doi:10.1109/18.720531

  27. [30]

    Importance-Driven Focus of Attention

    Viola, I.; Feixas, M.; Sbert, M.; Gröller, M.E. Importance-Driven Focus of Attention. IEEE Trans. Vis. Comput. Graph. 2006, 12, 933–940, doi:10.1109/TVCG.2006.152

  28. [31]

    Automated multi-modality image registration based on information theory

    Collignon, A.; Maes, F.; Delaere, D.; Vandermeulen, D.; Suetens, P .; Marchal, G. Automated multi-modality image registration based on information theory. Inf. Process. Med. Imaging , 1995, 3, 263–274

  29. [32]

    Medical image registration.Phys

    Hill, D.L.G.; Batchelor, P .G.; Holden, M.; Hawkes, D.J. Medical image registration.Phys. Med. Biol. 2001, 46, R1. Entropy 2019, xx, 5 26 of 27

  30. [33]

    Multi-modal volume registration by maximization of mutual information

    III, W.M.W.; Viola, P .; Atsumi, H.; Nakajima, S.; Kikinis, R. Multi-modal volume registration by maximization of mutual information. Med. Image Anal. 1996, 1, 35–51, doi:10.1016/S1361-8415(01)80004-9

  31. [34]

    Multimodality image registration by maximization of mutual information

    Maes, F.; Collignon, A.; Vandermeulen, D.; Marchal, G.; Suetens, P . Multimodality image registration by maximization of mutual information. IEEE Trans. Med. Imaging 1997, 16, 187–198, doi:10.1109/42.563664

  32. [35]

    Mutual-information-based registration of medical images: A survey

    Pluim, J.P .W.; Maintz, J.B.A.; Viergever, M.A. Mutual-information-based registration of medical images: A survey. IEEE Trans. Med. Imaging 2003, 22, 986–1004

  33. [36]

    An Information Theory Framework for the Analysis of Scene Complexity

    Feixas, M.; Acebo, E.D.; Bekaert, P .; Sbert, M. An Information Theory Framework for the Analysis of Scene Complexity. In Computer Graphics Forum; Blackwell Publishers, Ltd.: Oxford, UK; Boston, MA, USA, 1999

  34. [37]

    Shape complexity based on mutual information

    Rigau, J.; Feixas, M.; Sbert, M. Shape complexity based on mutual information. In Proceedings of the Shape Modeling and Applications, 2005 International Conference, Cambridge, MA, USA, 13–17 June 2005; pp. 355–360, doi:10.1109/SMI.2005.42

  35. [38]

    A unified information-theoretic framework for viewpoint selection and mesh saliency

    Feixas, M.; Sbert, M.; González, F. A unified information-theoretic framework for viewpoint selection and mesh saliency. ACM Trans. Appl. Percept. 2009, doi:10.1145/1462055.1462056

  36. [39]

    Isosurface Similarity Maps

    Bruckner, S.; Möller, T. Isosurface Similarity Maps. Comput. Graph. Forum 2010, 29, 773–782

  37. [40]

    Evaluating Isosurfaces with Level-set-based Information Maps

    Wei, T.H.; Lee, T.Y.; Shen, H.W. Evaluating Isosurfaces with Level-set-based Information Maps. In Proceedings of the 15th Eurographics Conference on Visualization, Leipzig, Germany, 17–21 June 2013; Eurographics Association: Aire-la-Ville, Switzerland, 2013; pp. 1–10, doi:10.1111/cgf.12087

  38. [41]

    An Information-Theoretic Observation Channel for Volume Visualization

    Bramon, R.; Ruiz, M.; Bardera, A.; Boada, I.; Feixas, M.; Sbert, M. An Information-Theoretic Observation Channel for Volume Visualization. Comput. Graph. Forum 2013, 32, 411–420

  39. [42]

    Coherent view-dependent streamline selection for importance-driven flow visualization

    Ma, J.; Wang, C.; Shene, C.K. Coherent view-dependent streamline selection for importance-driven flow visualization. Proc. SPIE 2013, 8654, doi:10.1117/12.2001887

  40. [43]

    A Unified Approach to Streamline Selection and Viewpoint Selection for 3D Flow Visualization

    Tao, J.; Ma, J.; Wang, C.; Shene, C.K. A Unified Approach to Streamline Selection and Viewpoint Selection for 3D Flow Visualization. IEEE Trans. Vis. Comput. Graph. 2013, 19, 393–406

  41. [44]

    Multimodal Data Fusion Based on Mutual Information

    Bramon, R.; Boada, I.; Bardera, A.; Rodriguez, J.; Feixas, M.; Puig, J.; Sbert, M. Multimodal Data Fusion Based on Mutual Information. IEEE Trans. Vis. Comput. Graph. 2012, 18, 1574 –1587, doi:10.1109/TVCG.2011.280

  42. [45]

    Information Theory-Based Automatic Multimodal Transfer Function Design

    Bramon, R.; Ruiz, M.; Bardera, A.; Boada, I.; Feixas, M.; Sbert, M. Information Theory-Based Automatic Multimodal Transfer Function Design. IEEE J. Biomed. Health Inform. 2013, 17, 870–880, doi:10.1109/JBHI.2013.2263227

  43. [46]

    Information-based Transfer Functions for Multimodal Visualization

    Haidacher, M.; Bruckner, S.; Kanitsar, A.; Gröller, M.E. Information-based Transfer Functions for Multimodal Visualization. In Proceedings of the First Eurographics conference on Visual Computing for Biomedicine, Delft, The Netherlands, 6–7 October 2008;VCBM; Botha, C.P .; Kindlmann, G.; Niessen, W.; Preim, B., Eds.; Eurographics Association: 2008; pp. 101–108

  44. [47]

    Pointwise Information Guided Visual Analysis of Time-varying Multi-fields

    Dutta, S.; Liu, X.; Biswas, A.; Shen, H.W.; Chen, J.P . Pointwise Information Guided Visual Analysis of Time-varying Multi-fields. In Proceedings of the SIGGRAPH Asia 2017 Symposium on Visualization, Bangkok, Thailand, 27–30 November 2017; ACM: New York, NY, USA, 2017; pp. 17:1–17:8, doi:10.1145/3139295.3139298

  45. [48]

    Information Theory Tools for Visualization; A K Peters: Natick, MA, USA; CRC Press: Boca Raton, FL, USA, 2016

    Chen, M.; Feixas, M.; Viola, I.; Bardera, A.; Shen, H.W.; Sbert, M. Information Theory Tools for Visualization; A K Peters: Natick, MA, USA; CRC Press: Boca Raton, FL, USA, 2016

  46. [49]

    An Information-theoretic Framework for Visualization

    Chen, M.; Jänicke, H. An Information-theoretic Framework for Visualization. IEEE Trans. Vis. Comput. Graph. 2010, 16, 1206–1215, doi:10.1109/TVCG.2010.132

  47. [50]

    Informational Aesthetics Measures

    Rigau, J.; Feixas, M.; Sbert, M. Informational Aesthetics Measures. IEEE Comput. Graph. Appl. 2008, 28, 24–34

  48. [51]

    Information Theory Tools for Computer Graphics; Synthesis Lectures on Computer Graphics and Animation; Morgan and Claypool Publishers: Fort Collins, CO, USA, 2009

    Sbert, M.; Feixas, M.; Rigau, J.; Chover, M.; Viola, I. Information Theory Tools for Computer Graphics; Synthesis Lectures on Computer Graphics and Animation; Morgan and Claypool Publishers: Fort Collins, CO, USA, 2009

  49. [52]

    Information Theory in Scientific Visualization

    Wang, C.; Shen, H.W. Information Theory in Scientific Visualization. Entropy 2011, 13, 254–273, doi:10.3390/e13010254

  50. [53]

    Visualization-Aware Sampling for Very Large Databases

    Park, Y.; Cafarella, M.J.; Mozafari, B. Visualization-Aware Sampling for Very Large Databases. arXiv 2015, arXiv:1510.03921

  51. [54]

    Centrality clustering-based sampling for big data visualization

    Nguyen, T.T.; Song, I. Centrality clustering-based sampling for big data visualization. In Proceedings of the 2016 International Joint Conference on Neural Networks (IJCNN), Vancouver, BC, Canada, 24–29 July 2016; pp. 1911–1917, doi:10.1109/IJCNN.2016.7727433. Entropy 2019, xx, 5 27 of 27

  52. [55]

    Weighted Finite Population Sampling to Maximize Entropy.Biometrika 1994, 81, 457–469

    Chen, X.H.; Dempster, A.P .; Liu, J.S. Weighted Finite Population Sampling to Maximize Entropy.Biometrika 1994, 81, 457–469

  53. [56]

    An Exact Algorithm for Maximum Entropy Sampling

    Ko, C.W.; Lee, J.; Queyranne, M. An Exact Algorithm for Maximum Entropy Sampling. Oper. Res. 1995, 43, 684–691, doi:10.1287/opre.43.4.684

  54. [57]

    Maximum entropy sampling

    Shewry, M.C.; Wynn, H.P . Maximum entropy sampling. J. Appl. Stat. 1987, 14, 165–170, doi:10.1080/02664768700000020

  55. [58]

    Multifield-Graphs: An Approach to Visualizing Correlations in Multifield Scalar Data

    Sauber, N.; Theisel, H.; Seidel, H.P . Multifield-Graphs: An Approach to Visualizing Correlations in Multifield Scalar Data. IEEE Trans. Vis. Comput. Graph. 2006, 12, 917–924, doi:10.1109/TVCG.2006.165

  56. [59]

    Multifield visualization using local statistical complexity

    Jänicke, H.; Wiebel, A.; Scheuermann, G.; Kollmann, W. Multifield visualization using local statistical complexity. IEEE Trans. Vis. Comput. Graph. 2007, 13, 1384–1391, doi:10.1109/TVCG.2007.70615

  57. [60]

    30 Years of Multidimensional Multivariate Visualization

    Wong, P .C.; Bergeron, R.D. 30 Years of Multidimensional Multivariate Visualization. InScientific Visualization, Overviews, Methodologies, and Techniques; IEEE Computer Society: Washington, DC, USA, 1997; pp. 3–33

  58. [61]

    Visualization of Multi-Variate Scientific Data

    Fuchs, R.; Hauser, H. Visualization of Multi-Variate Scientific Data. In Computer Graphics Forum; Blackwell Publishing Ltd.: Oxford, UK, 2009; Volume 28, pp. 1670–1690, doi:10.1111/j.1467-8659.2009.01429.x

  59. [62]

    Sampling: Design and Analysis; Advanced (Cengage Learning), Cengage Learning: 2009

    Lohr, S. Sampling: Design and Analysis; Advanced (Cengage Learning), Cengage Learning: 2009

  60. [63]

    Bayesian Computation with R; Use R; Springer: New York, NY, USA, 2009

    Albert, J. Bayesian Computation with R; Use R; Springer: New York, NY, USA, 2009

  61. [64]

    Information theoretical analysis of multivariate correlation

    Watanabe, S. Information theoretical analysis of multivariate correlation. IBM J. Res. Dev. 1960, 4, 66–82, doi:10.1147/rd.41.0066

  62. [65]

    On Sequential Monte Carlo Sampling Methods for Bayesian Filtering

    Doucet, A.; Godsill, S.; Andrieu, C. On Sequential Monte Carlo Sampling Methods for Bayesian Filtering. Stat. Comput. 2000, 10, 197–208

  63. [66]

    Chen and G

    Lawrence, E.; Wiel, S.V .; Bent, R. Model Bank State Estimation for Power Grids Using Importance Sampling. Technometrics 2013, 55, 426–435, doi:10.1080/00401706.2013.822424

  64. [67]

    Variable Interactions in Query-Driven Visualization

    Gosink, L.; Anderson, J.; Bethel, W.; Joy, K. Variable Interactions in Query-Driven Visualization. IEEE Trans. Vis. Comput. Graph. 2007, 13, 1400–1407, doi:10.1109/TVCG.2007.70519

  65. [68]

    Deep Water Impact Ensemble Data Set

    Patchett, J.; Gisler, G. Deep Water Impact Ensemble Data Set. Los Alamos National Laboratory, LA-UR-17-21595

  66. [69]

    Available online: https://oceans11.lanl.gov/deepwaterimpact/)

  67. [70]

    Three-dimensional simulations of oblique asteroid impacts into water

    Gisler, G.R.; Heberling, T.; Plesko, C.S.; Weaver, R.P . Three-dimensional simulations of oblique asteroid impacts into water. J. Space Saf. Eng. 2018, 5, 106–114

  68. [71]

    Distance between Sets

    Levandowsky, M.; Winter, D. Distance between Sets. Nature 1971, 234, doi:10.1038/234034a0

  69. [72]

    IEEE Transactions on Image Processing 26(5), 2274–2285 (2017)

    Bovik, A.C.; Sheikh, H.R.; Simoncelli, E.P . Image quality assessment: From error visibility to structural similarity. IEEE Trans. Image Process. 2004, 13, 600–612, doi:10.1109/TIP .2003.819861

  70. [73]

    Measuring and Testing Dependence by Correlation of Distances

    Székely, G.J.; Rizzo, M.L.; Bakirov, N.K. Measuring and Testing Dependence by Correlation of Distances. Ann. Stat. 2007, 35, 2769–2794

  71. [74]

    A compact multivariate histogram representation for query-driven visualization

    Lu, K.; Shen, H. A compact multivariate histogram representation for query-driven visualization. In Proceedings of the 2015 IEEE 5th Symposium on Large Data Analysis and Visualization (LDAV), Chicago, IL, USA, 25–26 October 2015; pp. 49–56, doi:10.1109/LDAV .2015.7348071. c⃝ 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open a...