pith. sign in

arxiv: 2605.07424 · v1 · submitted 2026-05-08 · 💻 cs.LG

A Flexible Adaptive Stable Clustering Algorithm for Archive-Scale Online Mass Spectrometry

Pith reviewed 2026-05-11 02:07 UTC · model grok-4.3

classification 💻 cs.LG
keywords clustering algorithmmass spectrometryatmospheric aerosolsscalable clusteringdeterministic clusteringlinear runtimeenvironmental data
0
0 comments X

The pith

FASC clustering achieves over 99.5 percent purity and linear runtime scaling on 25 million mass spectra through deterministic convergence rules.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces Flexible Adaptive Stable Clustering as a method to handle the large data volumes from online mass spectrometry without the usual trade-offs between speed, metric choice, and result stability. It claims that a Density-Augmented Similarity Selection rule together with geometric constraints, placed inside a dynamical systems setup, produces convergence that stays the same no matter the order of the input data. On standard benchmark sets the method reaches high accuracy scores, and on real atmospheric aerosol data it processes 25 million spectra in strictly linear time while identifying chemical aging sequences and rare tracers. This separation of the similarity measure from the core optimization logic is presented as the key to avoiding the stochastic drift seen in prior approaches.

Core claim

FASC is a dynamical systems framework that architecturally decouples the similarity kernel from rigorous optimization logic. It employs a Density-Augmented Similarity Selection rule and geometric constraints to guarantee deterministic, order-independent convergence. After validation on canonical machine-learning ground truths with greater than 99.5 percent cluster purity and 0.99 Adjusted Rand Index, the framework was applied to 25 million mass spectra of atmospheric aerosols, where it exhibited strictly linear empirical runtime scaling while autonomously mapping aging pathways of secondary inorganic aerosols and isolating ultra-rare industrial tracers.

What carries the argument

Density-Augmented Similarity Selection rule combined with geometric constraints inside a dynamical systems framework that decouples the similarity kernel from optimization logic.

If this is right

  • Enables extraction of chemical insights from multi-terabyte data streams generated by online mass spectrometry.
  • Reveals atmospheric aging pathways of secondary inorganic aerosols without manual intervention.
  • Detects industrial tracers present at abundances below 0.2 percent in large environmental datasets.
  • Supplies a scalable infrastructure for mining environmental big data with guaranteed stability.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The kernel-logic separation could allow the same convergence guarantees to be used with similarity measures other than those tested here.
  • Strict linear scaling opens the possibility of applying the method to continuous, real-time streams rather than only archived batches.
  • If the order-independence holds beyond the tested distributions, the approach may transfer to other high-volume scientific domains that require stable grouping of streaming observations.

Load-bearing premise

The Density-Augmented Similarity Selection rule and geometric constraints will produce deterministic and order-independent convergence for arbitrary mass-spectrometry feature distributions without dataset-specific tuning.

What would settle it

Running the algorithm on the same mass-spectra collection in multiple different orders and obtaining inconsistent cluster assignments, or observing cluster purity below 99.5 percent on the canonical benchmark sets.

read the original abstract

Modern online mass spectrometry generates multi-terabyte data streams critical for understanding Earth's environmental systems. However, extracting actionable chemical insights from these repositories is impeded by a computational bottleneck: existing clustering methods force a compromise among scalability, metric flexibility, and algorithmic stability. Here, we introduce Flexible Adaptive Stable Clustering (FASC), a dynamical systems framework that resolves these constraints by architecturally decoupling the similarity kernel from rigorous optimization logic. Unlike legacy heuristics that suffer from stochastic drift and algorithmic blending, FASC employs a Density-Augmented Similarity Selection rule and geometric constraints to guarantee deterministic, order-independent convergence. After validating FASC on canonical machine-learning ground truths (achieving >99.5% cluster purity and 0.99 Adjusted Rand Index), we deployed the framework on 25 million mass spectra of atmospheric aerosols. Demonstrating strictly linear empirical runtime scaling (O(N)), FASC autonomously mapped atmospheric aging pathways of secondary inorganic aerosols while isolating ultra-rare industrial tracers (<0.2% abundance), providing a scalable infrastructure for mining environmental big data.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper introduces Flexible Adaptive Stable Clustering (FASC), a dynamical systems framework for clustering large-scale online mass spectrometry data. It decouples the similarity kernel from optimization logic via a Density-Augmented Similarity Selection rule and geometric constraints, claiming to guarantee deterministic, order-independent convergence without dataset-specific tuning. Validation on canonical ML ground-truth datasets yields >99.5% cluster purity and 0.99 Adjusted Rand Index; deployment on 25 million atmospheric aerosol spectra demonstrates strictly linear O(N) empirical runtime, autonomous mapping of secondary inorganic aerosol aging pathways, and isolation of ultra-rare industrial tracers (<0.2% abundance).

Significance. If the convergence guarantees and empirical results hold, FASC would provide a valuable scalable infrastructure for mining multi-terabyte environmental MS repositories, addressing the scalability-stability-flexibility trade-off that limits existing methods. The linear runtime on 25M spectra and autonomous pathway mapping without post-hoc adjustments could enable new analyses of atmospheric aging and rare tracers in big environmental data.

major comments (2)
  1. [Abstract and §3] Abstract and §3 (Algorithm Description): The claim that the Density-Augmented Similarity Selection rule plus geometric constraints 'guarantee deterministic, order-independent convergence' for arbitrary MS feature distributions is load-bearing for all reported metrics, yet no formal proof, invariance analysis, or pseudocode is supplied to establish these properties or identify failure modes (e.g., under high-dimensional sparsity or multimodal aerosol spectra).
  2. [§4 and §5] §4 (Validation) and §5 (Application): The reported >99.5% purity, 0.99 ARI, and strictly linear O(N) scaling on 25M spectra are presented without baseline comparisons to standard scalable clustering methods, error bars, or ablation studies on the Density-Augmented rule; this leaves open whether the results depend on hidden post-hoc adjustments or hold only for the tested distributions.
minor comments (2)
  1. [Abstract] Abstract: The phrase 'strictly linear empirical runtime scaling (O(N))' should be accompanied by measured constants or memory scaling to substantiate archive-scale applicability.
  2. [§2] Notation: The term 'dynamical systems framework' is used without a precise mapping to the clustering update rules or fixed-point analysis.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback on the convergence guarantees and validation sections. We address each major comment below and will incorporate revisions to strengthen the manuscript.

read point-by-point responses
  1. Referee: [Abstract and §3] Abstract and §3 (Algorithm Description): The claim that the Density-Augmented Similarity Selection rule plus geometric constraints 'guarantee deterministic, order-independent convergence' for arbitrary MS feature distributions is load-bearing for all reported metrics, yet no formal proof, invariance analysis, or pseudocode is supplied to establish these properties or identify failure modes (e.g., under high-dimensional sparsity or multimodal aerosol spectra).

    Authors: The deterministic and order-independent properties follow directly from the Density-Augmented Similarity Selection rule, which assigns fixed scores independent of input order, together with geometric constraints that enforce separation without iterative reordering. We agree a formal treatment is needed for rigor. In the revised manuscript we will add to §3 a proof sketch establishing order-invariance (the selection sequence is permutation-invariant by construction of the density-augmented metric) and an analysis of failure modes, including cases of extreme sparsity where bandwidth adaptation may be required. Full pseudocode will also be included. These additions clarify the scope without altering the reported results. revision: yes

  2. Referee: [§4 and §5] §4 (Validation) and §5 (Application): The reported >99.5% purity, 0.99 ARI, and strictly linear O(N) scaling on 25M spectra are presented without baseline comparisons to standard scalable clustering methods, error bars, or ablation studies on the Density-Augmented rule; this leaves open whether the results depend on hidden post-hoc adjustments or hold only for the tested distributions.

    Authors: We concur that explicit baselines and controls would improve transparency. The revised §4 will include comparisons to Mini-Batch K-Means, DBSCAN, and HDBSCAN on the canonical datasets, reporting ARI and purity with standard deviations across 10 random initializations. An ablation removing the Density-Augmented rule will be added to quantify its contribution. For the 25M-spectra deployment in §5, runtime scaling plots will include error bars derived from repeated subsampling experiments. These revisions confirm the absence of post-hoc tuning and demonstrate robustness across the tested distributions. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical claims rest on independent validation rather than self-referential definitions or fitted inputs

full rationale

The abstract and available description present FASC as an algorithm whose core rule (Density-Augmented Similarity Selection plus geometric constraints) is asserted to produce deterministic convergence, with performance (>99.5% purity, 0.99 ARI, linear scaling on 25M spectra) reported as measured outcomes on canonical ground-truth datasets and real aerosol spectra. No equations, parameter-fitting steps, or self-citations are shown that would make these results reduce by construction to the inputs or to prior author work. The derivation chain is therefore self-contained as an empirical demonstration; the reported metrics do not tautologically restate the algorithm's assumptions.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only the abstract is available; no explicit free parameters, background axioms, or newly postulated entities are stated in the provided text.

pith-pipeline@v0.9.0 · 5505 in / 1144 out tokens · 48156 ms · 2026-05-11T02:07:24.868338+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

49 extracted references · 49 canonical work pages

  1. [1]

    org/10.1021/acs.est.7b02184

    Hollender, J., Schymanski, E.L., Singer, H.P., Ferguson, P.L.: Nontarget Screening with High Resolution Mass Spectrometry in the Environment: Ready to Go? Environmental Science & Technology51(20), 11505–11512 (2017) https://doi. org/10.1021/acs.est.7b02184 . Accessed 2025-10-11

  2. [2]

    Analytical and Bioanalytical Chemistry397(3), 943–951 (2010) https://doi.org/10.1007/ s00216-010-3608-9

    Krauss, M., Singer, H., Hollender, J.: LC–high resolution MS in environmen- tal analysis: from target screening to the identification of unknowns. Analytical and Bioanalytical Chemistry397(3), 943–951 (2010) https://doi.org/10.1007/ s00216-010-3608-9 . Accessed 2025-10-23

  3. [3]

    Nature498(7453), 255–260 (2013) https://doi.org/10.1038/498255a

    Marx, V.: The big challenges of big data. Nature498(7453), 255–260 (2013) https://doi.org/10.1038/498255a . Accessed 2025-10-11

  4. [4]

    Analytical and Bioanalytical Chemistry407(21), 6237–6255 (2015) https://doi.org/10.1007/s00216-015-8681-7

    Schymanski, E.L., Singer, H.P., Slobodnik, J., Ipolyi, I.M., Oswald, P., Krauss, M., Schulze, T., Haglund, P., Letzel, T., Grosse, S., Thomaidis, N.S., Bletsou, A., Zwiener, C., Ib´ a˜ nez, M., Portol´ es, T., De Boer, R., Reid, M.J., Onghena, M., Kunkel, U., Schulz, W., Guillon, A., Noyon, N., Leroy, G., Bados, P., Bogialli, S., Stipaniˇ cev, D., Rostkow...

  5. [5]

    Annals of Data Science2(2), 165–193 (2015) https://doi.org/10.1007/s40745-015-0040-1

    Xu, D., Tian, Y.: A Comprehensive Survey of Clustering Algorithms. Annals of Data Science2(2), 165–193 (2015) https://doi.org/10.1007/s40745-015-0040-1 . Accessed 2025-10-11

  6. [6]

    The FAIR guiding principles for scientific data management and stewardship,

    Wilkinson, M.D., Dumontier, M., Aalbersberg, I.J., Appleton, G., Axton, M., 34 Baak, A., Blomberg, N., Boiten, J.-W., Da Silva Santos, L.B., Bourne, P.E., Bouw- man, J., Brookes, A.J., Clark, T., Crosas, M., Dillo, I., Dumon, O., Edmunds, S., Evelo, C.T., Finkers, R., Gonzalez-Beltran, A., Gray, A.J.G., Groth, P., Goble, C., Grethe, J.S., Heringa, J., ’T ...

  7. [7]

    Lloyd, Least squares quantization in PCM, IEEE Trans

    Lloyd, S.: Least squares quantization in PCM. IEEE Transactions on Informa- tion Theory28(2), 129–137 (1982) https://doi.org/10.1109/TIT.1982.1056489 . Accessed 2025-10-23

  8. [8]

    ACM Computing Surveys31(3), 264–323 (1999) https://doi.org/10.1145/331499.331504

    Jain, A.K., Murty, M.N., Flynn, P.J.: Data clustering: a review. ACM Computing Surveys31(3), 264–323 (1999) https://doi.org/10.1145/331499.331504 . Accessed 2025-10-11

  9. [9]

    In: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining

    Ester, M., Kriegel, H.-P., Sander, J., Xu, X.: A density-based algorithm for discov- ering clusters in large spatial databases with noise. In: Proceedings of the Second International Conference on Knowledge Discovery and Data Mining. KDD’96, pp. 226–231. AAAI Press, Portland, Oregon (1996)

  10. [10]

    ACM Trans- actions on Knowledge Discovery from Data10(1), 1–51 (2015) https://doi.org/ 10.1145/2733381

    Campello, R.J.G.B., Moulavi, D., Zimek, A., Sander, J.: Hierarchical Density Estimates for Data Clustering, Visualization, and Outlier Detection. ACM Trans- actions on Knowledge Discovery from Data10(1), 1–51 (2015) https://doi.org/ 10.1145/2733381 . Accessed 2025-08-30 35

  11. [11]

    WIREs Data Mining and Knowledge Discovery2(1), 86–97 (2012) https://doi

    Murtagh, F., Contreras, P.: Algorithms for hierarchical clustering: an overview. WIREs Data Mining and Knowledge Discovery2(1), 86–97 (2012) https://doi. org/10.1002/widm.53 . Accessed 2025-10-23

  12. [12]

    Neural Networks4(4), 493– 504 (1991) https://doi.org/10.1016/0893-6080(91)90045-7

    Carpenter, G.A., Grossberg, S., Rosen, D.B.: ART 2-A: An adaptive resonance algorithm for rapid category learning and recognition. Neural Networks4(4), 493– 504 (1991) https://doi.org/10.1016/0893-6080(91)90045-7 . Accessed 2025-10-23

  13. [13]

    Cytometry Part A89(12), 1084–1096 (2016) https://doi.org/10.1002/cyto.a.23030

    Weber, L.M., Robinson, M.D.: Comparison of clustering methods for high- dimensional single-cell flow and mass cytometry data. Cytometry Part A89(12), 1084–1096 (2016) https://doi.org/10.1002/cyto.a.23030 . Accessed 2025-10-23

  14. [14]

    IEEE Transactions on Neural Networks 9(3), 544–559 (1998) https://doi.org/10.1109/72.668896

    Frank, T., Kraiss, K.-F., Kuhlen, T.: Comparative analysis of fuzzy ART and ART-2A network clustering performance. IEEE Transactions on Neural Networks 9(3), 544–559 (1998) https://doi.org/10.1109/72.668896 . Accessed 2025-08-30

  15. [15]

    Rapid Communi- cations in Mass Spectrometry39(S1), 9153 (2025) https://doi.org/10.1002/rcm

    Bittremieux, W., Laukens, K., Noble, W.S., Dorrestein, P.C.: Large-scale tandem mass spectrum clustering using fast nearest neighbor searching. Rapid Communi- cations in Mass Spectrometry39(S1), 9153 (2025) https://doi.org/10.1002/rcm. 9153 . Accessed 2025-08-30

  16. [16]

    Journal of Proteome Research, 8–00448 (2018) https://doi.org/10.1021/acs.jproteome.8b00448

    Wang, L., Li, S., Tang, H.: msCRUSH: Fast Tandem Mass Spectral Clustering Using Locality Sensitive Hashing. Journal of Proteome Research, 8–00448 (2018) https://doi.org/10.1021/acs.jproteome.8b00448 . Accessed 2025-10-28

  17. [17]

    Journal of Pro- teome Research22(6), 1639–1648 (2023) https://doi.org/10.1021/acs.jproteome

    Xu, W., Kang, J., Bittremieux, W., Moshiri, N., Rosing, T.: HyperSpec: Ultrafast Mass Spectra Clustering in Hyperdimensional Space. Journal of Pro- teome Research22(6), 1639–1648 (2023) https://doi.org/10.1021/acs.jproteome. 2c00612 . Accessed 2025-08-30 36

  18. [18]

    Annales de l’institut Henri Poincar´ e10(4), 215–310 (1948)

    Fr´ echet, M.: Les ´ el´ ements al´ eatoires de nature quelconque dans un espace dis- tanci´ e. Annales de l’institut Henri Poincar´ e10(4), 215–310 (1948). Publisher: INSTITUT HENRI POINCAR ´E ET GAUTHIER-VILLARS

  19. [19]

    Science344(6191), 1492–1496 (2014) https://doi.org/10.1126/science.1242072

    Rodriguez, A., Laio, A.: Clustering by fast search and find of density peaks. Science344(6191), 1492–1496 (2014) https://doi.org/10.1126/science.1242072 . Accessed 2025-10-23

  20. [20]

    IEEE Transactions on Pattern Analysis and Machine IntelligenceP AMI-6(1), 81–87 (1984) https: //doi.org/10.1109/TPAMI.1984.4767478

    Selim, S.Z., Ismail, M.A.: K-Means-Type Algorithms: A Generalized Conver- gence Theorem and Characterization of Local Optimality. IEEE Transactions on Pattern Analysis and Machine IntelligenceP AMI-6(1), 81–87 (1984) https: //doi.org/10.1109/TPAMI.1984.4767478 . Accessed 2025-10-23

  21. [21]

    In: Proceedings of the 2004 SIAM International Conference on Data Mining, pp

    Banerjee, A., Merugu, S., Dhillon, I., Ghosh, J.: Clustering with Bregman Divergences. In: Proceedings of the 2004 SIAM International Conference on Data Mining, pp. 234–245. Society for Industrial and Applied Mathematics, Philadelphia, PA, USA (2004). https://doi.org/10.1137/1.9781611972740.22 . https://epubs.siam.org/doi/10.1137/1.9781611972740.22Accesse...

  22. [22]

    Atmospheric Chemistry and Physics24(12), 7001–7012 (2024) https://doi.org/ 10.5194/acp-24-7001-2024

    Shi, S., Zhai, J., Yang, X., Ruan, Y., Huang, Y., Chen, X., Zhang, A., Ye, J., Zheng, G., Cai, B., Zeng, Y., Wang, Y., Xing, C., Zhang, Y., Fu, T.-M., Zhu, L., Shen, H., Wang, C.: Technical note: Determining chemical composition of atmospheric single particles by a standard-free mass calibration algorithm. Atmospheric Chemistry and Physics24(12), 7001–701...

  23. [23]

    Dall’Osto, M., Harrison, R.M., Coe, H., Williams, P.I., Allan, J.D.: Real time chemical characterization of local and regional nitrate aerosols. Atmos. Chem. Phys. (2009) 37

  24. [24]

    Journal of Geophysical Research: Atmospheres 113(D12), 2007–009393 (2008) https://doi.org/10.1029/2007JD009393

    Moffet, R.C., Qin, X., Rebotier, T., Furutani, H., Prather, K.A.: Chemically segregated optical and microphysical properties of ambient aerosols measured in a single-particle mass spectrometer. Journal of Geophysical Research: Atmospheres 113(D12), 2007–009393 (2008) https://doi.org/10.1029/2007JD009393 . Number: D12. Accessed 2024-06-13

  25. [25]

    Atmospheric Environment41(40), 9397–9409 (2007) https://doi.org/10.1016/j.atmosenv.2007

    Reinard, M.S., Adou, K., Martini, J.M., Johnston, M.V.: Source characterization and identification by real-time single particle mass spectrometry. Atmospheric Environment41(40), 9397–9409 (2007) https://doi.org/10.1016/j.atmosenv.2007. 09.001 . Number: 40. Accessed 2023-12-14

  26. [26]

    Analytical Chemistry88(22), 10893–10899 (2016) https: //doi.org/10.1021/acs.analchem.6b02139

    Dexter, A., Race, A.M., Styles, I.B., Bunch, J.: Testing for Multivariate Nor- mality in Mass Spectrometry Imaging Data: A Robust Statistical Approach for Clustering Evaluation and the Generation of Synthetic Mass Spectrome- try Imaging Data Sets. Analytical Chemistry88(22), 10893–10899 (2016) https: //doi.org/10.1021/acs.analchem.6b02139 . Accessed 2025-10-23

  27. [27]

    Bioinformatics21(15), 3201–3212 (2005) https://doi.org/ 10.1093/bioinformatics/bti517

    Handl, J., Knowles, J., Kell, D.B.: Computational cluster validation in post- genomic data analysis. Bioinformatics21(15), 3201–3212 (2005) https://doi.org/ 10.1093/bioinformatics/bti517 . Accessed 2025-10-23

  28. [28]

    Annual Review of Biomedical Data Science3(1), 61– 87 (2020) https://doi.org/10.1146/annurev-biodatasci-011420-031537

    Alexandrov, T.: Spatial Metabolomics and Imaging Mass Spectrometry in the Age of Artificial Intelligence. Annual Review of Biomedical Data Science3(1), 61– 87 (2020) https://doi.org/10.1146/annurev-biodatasci-011420-031537 . Accessed 2025-10-23

  29. [29]

    Di- mensionality reduction for visualizing single-cell data us- ing UMAP,

    Becht, E., McInnes, L., Healy, J., Dutertre, C.-A., Kwok, I.W.H., Ng, L.G., Ginhoux, F., Newell, E.W.: Dimensionality reduction for visualizing single-cell data using UMAP. Nature Biotechnology37(1), 38–44 (2019) https://doi.org/ 10.1038/nbt.4314 . Accessed 2025-10-23 38

  30. [30]

    Nature Methods16(3), 243–245 (2019) https://doi.org/10.1038/ s41592-018-0308-4

    Linderman, G.C., Rachh, M., Hoskins, J.G., Steinerberger, S., Kluger, Y.: Fast interpolation-based t-SNE for improved visualization of single-cell RNA- seq data. Nature Methods16(3), 243–245 (2019) https://doi.org/10.1038/ s41592-018-0308-4 . Accessed 2025-10-23

  31. [31]

    Journal of Educational Psychology24, 498–520 (1933)

    Hotelling, H.: Analysis of a complex of statistical variables into principal components. Journal of Educational Psychology24, 498–520 (1933)

  32. [32]

    Nature Communications10(1), 5416 (2019) https://doi.org/10.1038/ s41467-019-13056-x

    Kobak, D., Berens, P.: The art of using t-SNE for single-cell transcrip- tomics. Nature Communications10(1), 5416 (2019) https://doi.org/10.1038/ s41467-019-13056-x . Accessed 2025-10-23

  33. [33]

    Diversity of synaptic protein complexes as a function of the abundance of their constituent proteins: A modeling approach

    Diaz-Papkovich, A., Anderson-Trocm´ e, L., Ben-Eghan, C., Gravel, S.: UMAP reveals cryptic population structure and phenotype heterogeneity in large genomic cohorts. PLOS Genetics15(11), 1008432 (2019) https://doi.org/10.1371/journal. pgen.1008432 . Accessed 2025-10-23

  34. [34]

    Science315(5814), 972–976 (2007) https://doi.org/10.1126/science.1136800

    Frey, B.J., Dueck, D.: Clustering by Passing Messages Between Data Points. Science315(5814), 972–976 (2007) https://doi.org/10.1126/science.1136800 . Accessed 2025-10-23

  35. [35]

    In: Proceedings of the 15th International Conference on Neural Information Processing Systems: Natural and Synthetic

    Ng, A.Y., Jordan, M.I., Weiss, Y.: On spectral clustering: analysis and an algo- rithm. In: Proceedings of the 15th International Conference on Neural Information Processing Systems: Natural and Synthetic. NIPS’01, pp. 849–856. MIT Press, Cambridge, MA, USA (2001). event-place: Vancouver, British Columbia, Canada

  36. [36]

    Campello, R.J.G.B., Moulavi, D., Sander, J.: Density-Based Clustering Based on Hierarchical Density Estimates. In: Pei, J., Tseng, V.S., Cao, L., Motoda, H., Xu, G., Hutchison, D., Kanade, T., Kittler, J., Kleinberg, J.M., Mat- tern, F., Mitchell, J.C., Naor, M., Nierstrasz, O., Pandu Rangan, C., Steffen, 39 B., Sudan, M., Terzopoulos, D., Tygar, D., Vard...

  37. [37]

    In: Proceedings of the 24rd International Conference on Very Large Data Bases

    Weber, R., Schek, H.-J., Blott, S.: A Quantitative Analysis and Performance Study for Similarity-Search Methods in High-Dimensional Spaces. In: Proceedings of the 24rd International Conference on Very Large Data Bases. VLDB ’98, pp. 194–205. Morgan Kaufmann Publishers Inc., San Francisco, CA, USA (1998)

  38. [38]

    Journal of Geophysical Research: Atmo- spheres112(D22), 2006–008269 (2007) https://doi.org/10.1029/2006JD008269

    Holecek, J.C., Spencer, M.T., Prather, K.A.: Analysis of rainwater samples: Comparison of single particle residues with ambient particle chemistry from the northeast Pacific and Indian oceans. Journal of Geophysical Research: Atmo- spheres112(D22), 2006–008269 (2007) https://doi.org/10.1029/2006JD008269 . Number: D22. Accessed 2024-06-13

  39. [39]

    SIGMOD Rec

    Zhang, T., Ramakrishnan, R., Livny, M.: BIRCH: an efficient data cluster- ing method for very large databases. SIGMOD Rec.25(2), 103–114 (1996) https://doi.org/10.1145/235968.233324 . Place: New York, NY, USA Publisher: Association for Computing Machinery

  40. [40]

    Atmospheric Environment40(39), 7614–7631 (2006) https://doi.org/10.1016/j.atmosenv.2006.06.053

    Dallosto, M., Harrison, R.: Chemical characterisation of single airborne particles in Athens (Greece) by ATOFMS. Atmospheric Environment40(39), 7614–7631 (2006) https://doi.org/10.1016/j.atmosenv.2006.06.053 . Number: 39. Accessed 2024-11-14

  41. [41]

    Environmen- tal Science & Technology45(20), 8648–8655 (2011) https://doi.org/10.1021/ es2011836

    Hatch, L.E., Creamean, J.M., Ault, A.P., Surratt, J.D., Chan, M.N., Seinfeld, J.H., Edgerton, E.S., Su, Y., Prather, K.A.: Measurements of Isoprene-Derived 40 Organosulfates in Ambient Aerosols by Aerosol Time-of-Flight Mass Spectrom- etry—Part 2: Temporal Variability and Formation Mechanisms. Environmen- tal Science & Technology45(20), 8648–8655 (2011) h...

  42. [42]

    Bzdek, B.R., Ridge, D.P., Johnston, M.V.: Amine exchange into ammonium bisulfate and ammonium nitrate nuclei. Atmos. Chem. Phys. (2010)

  43. [43]

    Atmospheric Environment43(40), 6408–6414 (2009) https://doi.org/10.1016/j.atmosenv.2009

    Healy, R.M., O’Connor, I.P., Hellebust, S., Allanic, A., Sodeau, J.R., Wenger, J.C.: Characterisation of single particles from in-port ship emissions. Atmospheric Environment43(40), 6408–6414 (2009) https://doi.org/10.1016/j.atmosenv.2009. 07.039 . Number: 40. Accessed 2024-03-28

  44. [44]

    Atmospheric Chemistry and Physics12(4), 1681–1700 (2012) https://doi.org/10.5194/acp-12-1681-2012

    Healy, R.M., Sciare, J., Poulain, L., Kamili, K., Merkel, M., M¨ uller, T., Wieden- sohler, A., Eckhardt, S., Stohl, A., Sarda-Est` eve, R., McGillicuddy, E., O’Connor, I.P., Sodeau, J.R., Wenger, J.C.: Sources and mixing state of size-resolved ele- mental carbon particles in a European megacity: Paris. Atmospheric Chemistry and Physics12(4), 1681–1700 (2...

  45. [45]

    Atmospheric Environment (2011)

    Bi, X.: Mixing state of biomass burning particles by single particle aerosol mass spectrometer in the urban area of PRD, China. Atmospheric Environment (2011)

  46. [46]

    International Journal of Mass Spectrometry258(1-3), 142–150 (2006) https://doi.org/10.1016/j.ijms

    Qin, X., Prather, K.A.: Impact of biomass emissions on particle chemistry dur- ing the California Regional Particulate Air Quality Study. International Journal of Mass Spectrometry258(1-3), 142–150 (2006) https://doi.org/10.1016/j.ijms. 2006.09.004 . Number: 1-3. Accessed 2024-06-13

  47. [47]

    Environmental Science & Technology33(18), 3068–3076 (1999) https://doi.org/10.1021/es980544p

    Silva, P.J., Liu, D.-Y., Noble, C.A., Prather, K.A.: Size and Chemical Character- ization of Individual Particles Resulting from Biomass Burning of Local Southern 41 California Species. Environmental Science & Technology33(18), 3068–3076 (1999) https://doi.org/10.1021/es980544p . Accessed 2025-12-04

  48. [48]

    Sci- ence279(5354), 1184–1187 (1998) https://doi.org/10.1126/science.279.5354.1184

    Gard, E.E., Kleeman, M.J., Gross, D.S., Hughes, L.S., Allen, J.O., Morrical, B.D., Fergenson, D.P., Dienes, T., G¨ alli, M.E., Johnson, R.J., Cass, G.R., Prather, K.A.: Direct Observation of Heterogeneous Chemistry in the Atmosphere. Sci- ence279(5354), 1184–1187 (1998) https://doi.org/10.1126/science.279.5354.1184 . eprint: https://www.science.org/doi/pd...

  49. [49]

    Journal of Geophysical Research: Atmospheres115(D15), 2009–013606 (2010) https://doi.org/10.1029/2009JD013606

    Pratt, K.A., Twohy, C.H., Murphy, S.M., Moffet, R.C., Heymsfield, A.J., Gaston, C.J., DeMott, P.J., Field, P.R., Henn, T.R., Rogers, D.C., Gilles, M.K., Seinfeld, J.H., Prather, K.A.: Observation of playa salts as nuclei in orographic wave clouds. Journal of Geophysical Research: Atmospheres115(D15), 2009–013606 (2010) https://doi.org/10.1029/2009JD013606...