Fraud Detection in Cryptocurrency Markets with Spatio-Temporal Graph Neural Networks
Pith reviewed 2026-05-08 03:55 UTC · model grok-4.3
The pith
Spatio-temporal graph neural networks detect coordinated cryptocurrency fraud by modeling connections between assets from hourly market data.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors demonstrate that three graph construction methods based on aggregated hourly cryptocurrency market data, when processed by a unified spatio-temporal GNN architecture combining attention-based spatial aggregation with temporal Transformer encoding, yield significant improvements over standard machine learning baselines for detecting pump-and-dump schemes on a dataset spanning more than three years. The work establishes that learned market connectivity supplies the relational structure needed to identify coordinated manipulation events.
What carries the argument
A spatio-temporal Graph Neural Network that applies attention-based spatial aggregation over graphs built from hourly market data and uses temporal Transformer encoding to process time series of those graph representations.
If this is right
- Detection performance improves when models explicitly represent transfers and repetitions across tokens rather than analyzing each asset in isolation.
- Attention mechanisms within the spatial layer learn which asset connections carry the strongest manipulation signals.
- Temporal Transformer components allow the system to track how coordinated activity unfolds over hours and days.
- The overall framework applies to any setting where fraud involves repeated interactions among a group of related financial instruments.
Where Pith is reading between the lines
- Similar graph constructions could be tested on other coordinated financial crimes such as insider trading rings or wash trading across exchanges.
- Replacing the fixed hourly aggregation with event-driven graph updates might enable earlier detection before a scheme fully develops.
- The three graph construction methods likely differ in how they encode volume correlations versus price movements, suggesting future work could combine them adaptively.
Load-bearing premise
Market manipulation schemes are defined by coordination, repetition, and transfers among related assets that graphs from aggregated hourly data can capture.
What would settle it
Running the same pump-and-dump detection task on the three-year dataset and finding that the graph models show no statistically significant accuracy gain over non-graph baselines, or that ablating the graph edges removes the reported performance lift.
Figures
read the original abstract
Technological advancements in cryptocurrency markets have increased accessibility for investors, but concurrently exposed them to the risks of market manipulations. Existing fraud detection mechanisms typically rely on machine learning methods that treat each financial asset (i.e., token) and its related transactions independently. However, market manipulation strategies are rarely isolated events, but are rather characterized by coordination, repetition, and frequent transfers among related assets. This suggests that relational structure constitutes an integral component of the signal and can be effectively represented through graphical means. In this paper, we propose three graph construction methods that rely on aggregated hourly market data. The proposed graphs are processed by a unified spatio-temporal Graph Neural Network (GNN) architecture that combines attention-based spatial aggregation with temporal Transformer encoding. We evaluate our methodology on a real-world dataset comprised of pump-and-dump schemes in cryptocurrency markets, spanning a period of over three years. Our comparative results showcase that our graph-based models achieve significant improvements over standard machine learning baselines in detecting anomalous events. Our work highlights that learned market connectivity provides substantial gains for detecting coordinated market manipulation schemes.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes three graph construction methods from aggregated hourly cryptocurrency market data to capture coordinated market manipulations such as pump-and-dump schemes. These graphs are processed by a unified spatio-temporal GNN architecture combining attention-based spatial aggregation with temporal Transformer encoding. The approach is evaluated on a real-world 3-year dataset of anomalous events, with comparative results claiming significant improvements over standard machine learning baselines.
Significance. If the reported gains hold under rigorous controls, the work offers a timely advance in financial fraud detection by explicitly modeling relational coordination among assets rather than treating them independently. The attention-plus-Transformer design is a natural fit for spatio-temporal market signals and could inform monitoring tools in decentralized markets.
major comments (2)
- [§4] §4 (Experiments): the central claim of 'significant improvements' is presented without accompanying quantitative metrics (e.g., precision, recall, F1, AUC), exact baseline specifications, standard deviations across runs, or statistical significance tests, preventing assessment of effect size and reproducibility.
- [§3.1–3.2] §3.1–3.2 (Graph construction): the three methods depend on free parameters (connectivity thresholds and aggregation windows) listed in the axiom ledger; no sensitivity analysis or ablation on these choices is reported, which directly affects the robustness of the learned market connectivity.
minor comments (2)
- [Abstract] Abstract: the performance claim would be more informative if it included at least one concrete metric (e.g., 'F1-score improvement of X%') rather than the qualitative term 'significant'.
- [§3] Notation: the distinction between the three graph-construction variants could be clarified with a small comparison table early in §3.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on our manuscript. We address each major comment point by point below, indicating planned revisions where appropriate. Our responses focus on clarifying and strengthening the experimental reporting and robustness analysis without altering the core contributions.
read point-by-point responses
-
Referee: §4 (Experiments): the central claim of 'significant improvements' is presented without accompanying quantitative metrics (e.g., precision, recall, F1, AUC), exact baseline specifications, standard deviations across runs, or statistical significance tests, preventing assessment of effect size and reproducibility.
Authors: We acknowledge that the current presentation of results in §4 lacks sufficient detail for full reproducibility and effect-size assessment. While the manuscript includes comparative performance figures, we did not report the full suite of metrics (precision, recall, F1, AUC), exact baseline implementations and hyperparameters, standard deviations over multiple runs, or formal statistical tests. In the revised version we will expand §4 with a comprehensive results table containing all requested metrics (mean ± std over 5 runs), precise baseline descriptions, and p-values from paired t-tests against the best baseline to quantify significance. revision: yes
-
Referee: §3.1–3.2 (Graph construction): the three methods depend on free parameters (connectivity thresholds and aggregation windows) listed in the axiom ledger; no sensitivity analysis or ablation on these choices is reported, which directly affects the robustness of the learned market connectivity.
Authors: We agree that the absence of sensitivity analysis on the connectivity thresholds and aggregation windows limits claims about robustness. These parameters were chosen via preliminary validation on a held-out subset of the dataset using domain knowledge of typical pump-and-dump time scales. In the revision we will add a dedicated subsection under §3 that reports performance variation when each parameter is swept over a grid of plausible values (e.g., thresholds from 0.1 to 0.9, windows from 1 h to 24 h), together with an ablation that isolates the contribution of each graph-construction choice to final detection metrics. revision: yes
Circularity Check
No significant circularity; empirical claims are self-contained
full rationale
The paper proposes three graph-construction methods from hourly market data and a unified spatio-temporal GNN (attention-based spatial aggregation plus temporal Transformer) that are evaluated on a three-year pump-and-dump dataset. Performance gains are shown via direct comparison against standard ML baselines. No derivation reduces by construction to its own inputs, no fitted parameter is relabeled as a prediction, and no load-bearing premise rests on a self-citation chain. The argument is externally falsifiable through the reported comparative metrics and remains independent of the authors' prior work.
Axiom & Free-Parameter Ledger
free parameters (1)
- graph construction thresholds and aggregation choices
Reference graph
Works this paper leans on
-
[1]
Netherlands Authority for the Financial Markets (AFM), “Market manipulation,” Regulatory brochure (Market Abuse Regulation context), 2017, brochure processed ESMA Q&As up to 6 July 2017
work page 2017
-
[2]
Market manipulation: A survey,
T. J. Putni n ¸ˇs, “Market manipulation: A survey,” Journal of Economic Surveys, vol. 26, no. 5, pp. 952–967, 2012
work page 2012
-
[3]
An examination of the cryptocurrency pump-and-dump ecosystem,
J. T. Hamrick, F. Rouhi, A. Mukherjee, A. Feder, N. Gandal, T. Moore, and M. Vasek, “An examination of the cryptocurrency pump-and-dump ecosystem,” Information Processing and Management, vol. 58, no. 4, p. 102610, 2021
work page 2021
-
[4]
Some traders are talking up cryptocur- rencies, then dumping them, costing others millions,
S. Shifflett and P. Vigna, “Some traders are talking up cryptocur- rencies, then dumping them, costing others millions,” The Wall Street Journal, Aug. 2018, available at: https://www.wsj.com/graphics/ cryptocurrency-schemes-generate-big-coin/
work page 2018
-
[5]
B. Merewhuader, C. Polite, and T. Viane, “Class action complaint, merewhuaderet al.v. safemoon llcet al., no. 2:22-cv-01108 (c.d. cal. feb. 17, 2022),” Filed in U.S. District Court for the Central District of California, 2022
work page 2022
-
[6]
The anatomy of a cryptocurrency Pump-and-Dump scheme,
J. Xu and B. Livshits, “The anatomy of a cryptocurrency Pump-and-Dump scheme,” in 28th USENIX Security Symposium (USENIX Security 19). Santa Clara, CA: USENIX Association, 2019, pp. 1609–1625
work page 2019
-
[7]
Graph based anomaly detection and description: a survey,
L. Akoglu, H. Tong, and D. Koutra, “Graph based anomaly detection and description: a survey,” Data Mining and Knowledge Discovery, vol. 29, no. 3, pp. 626–688, 2015
work page 2015
-
[8]
A comprehensive survey on graph anomaly detection with deep learning,
X. Ma, J. Wu, S. Xue, J. Yang, C. Zhou, Q. Z. Sheng, H. Xiong, and L. Akoglu, “A comprehensive survey on graph anomaly detection with deep learning,” IEEE transactions on knowledge and data engineering, vol. 35, no. 12, pp. 12 012–12 038, 2021. 7
work page 2021
-
[9]
Enhancing graph neural network-based fraud detectors against camouflaged fraudsters,
Y . Dou, Z. Liu, L. Sun, Y . Deng, H. Peng, and P. S. Yu, “Enhancing graph neural network-based fraud detectors against camouflaged fraudsters,” in Proceedings of the 29th ACM International Conference on Information & Knowledge Management (CIKM ’20). Virtual Event, Ireland: Association for Computing Machinery, Oct. 2020
work page 2020
-
[10]
GAMLNet: a graph based framework for the detection of money laundering,
J. Schmidt, D. Pasadakis, M. Sathe, and O. Schenk, “GAMLNet: a graph based framework for the detection of money laundering,” in 2024 11th IEEE Swiss Conference on Data Science (SDS), 2024, pp. 241–245
work page 2024
-
[11]
Tokenscout: Early detection of ethereum scam tokens via temporal graph learning,
C. Wu, J. Chen, Z. Zhao, K. He, G. Xu, Y . Wu, H. Wang, H. Li, Y . Liu, and Y . Xiang, “Tokenscout: Early detection of ethereum scam tokens via temporal graph learning,” in Proceedings of the 2024 on ACM SIGSAC Conference on Computer and Communications Security, CCS 2024, Salt Lake City, UT, USA, October 14-18, 2024, B. Luo, X. Liao, J. Xu, E. Kirda, and ...
work page 2024
-
[12]
Profit or deceit? mitigating pump and dump in defi via graph and contrastive learning,
C. Wu, J. Chen, J. Li, J. Xu, J. Jia, Y . Hu, Y . Feng, Y . Liu, and Y . Xiang, “Profit or deceit? mitigating pump and dump in defi via graph and contrastive learning,” IEEE Transactions on Information Forensics and Security, vol. 20, pp. 8994–9008, 2025
work page 2025
-
[13]
Z. Lin, Q. Luo, D. Wu, J. Shen, L. Li, X. Nong, and Z. Qin, “Detecting illicit transactions in bitcoin: a wavelet-temporal graph transformer approach for anti-money laundering,” Scientific Reports, vol. 16, no. 1, p. 1548, 2026
work page 2026
-
[14]
Anomaly detection in global financial markets with graph neural networks and nonextensive entropy,
K. da Costa, “Anomaly detection in global financial markets with graph neural networks and nonextensive entropy,” 2023, arXiv:2308.02914v2
-
[15]
M. La Morgia, A. Mei, F. Sassi, and J. Stefa, “The pump and dump scheme in the bitcoin era: Real time detection of cryptocurrency market manipulations,” in 2020 29th International Conference on Computer Communications and Networks (ICCCN). IEEE, 2020, pp. 1–9
work page 2020
-
[16]
L. Breiman, “Random forests,” Machine Learning, vol. 45, no. 1, pp. 5–32, 2001
work page 2001
-
[17]
Xgboost: A scalable tree boosting system,
T. Chen and C. Guestrin, “Xgboost: A scalable tree boosting system,” in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ser. KDD ’16. New York, NY , USA: Association for Computing Machinery, 2016, p. 785–794
work page 2016
-
[18]
G. Corso, H. Stark, S. Jegelka, T. Jaakkola, and R. Barzilay, “Graph neural networks,” Nature Reviews Methods Primers, vol. 4, no. 1, p. 17, 2024
work page 2024
-
[19]
Spatio-temporal graph convolutional net- works: A deep learning framework for traffic forecasting,
B. Yu, H. Yin, and Z. Zhu, “Spatio-temporal graph convolutional net- works: A deep learning framework for traffic forecasting,” in Proceedings of the 27th International Joint Conference on Artificial Intelligence, 2018, pp. 3634–3640
work page 2018
-
[20]
Spatio-temporal graph neural networks for predictive learning in urban computing: A survey,
G. Jin, Y . Liang, Y . Fang, Z. Shao, J. Huang, J. Zhang, and Y . Zheng, “Spatio-temporal graph neural networks for predictive learning in urban computing: A survey,” IEEE transactions on knowledge and data engineering, vol. 36, no. 10, pp. 5388–5408, 2023
work page 2023
-
[21]
Connecting the dots: Multivariate time series forecasting with graph neural networks,
Z. Wu, S. Pan, G. Long, J. Jiang, X. Chang, and C. Zhang, “Connecting the dots: Multivariate time series forecasting with graph neural networks,” in Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2020, pp. 753–763
work page 2020
-
[22]
Semi-supervised classification with graph convolutional networks,
T. N. Kipf and M. Welling, “Semi-supervised classification with graph convolutional networks,” in 5th International Conference on Learning Representations, ICLR 2017, Toulon, France, April 24-26, 2017, Conference Track Proceedings, 2017
work page 2017
-
[23]
P. Velickovic, G. Cucurull, A. Casanova, A. Romero, P. Li `o, and Y . Bengio, “Graph attention networks,” in 6th International Conference on Learning Representations, ICLR 2018, Vancouver,BC, Canada, April 30 - May 3, 2018, Conference Track Proceedings, 2018
work page 2018
-
[24]
How powerful are graph neural networks?
K. Xu, W. Hu, J. Leskovec, and S. Jegelka, “How powerful are graph neural networks?” in 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019, 2019
work page 2019
-
[25]
Binance spot api documentation: Rest api,
Binance, “Binance spot api documentation: Rest api,” https://developers. binance.com/docs/binance-spot-api-docs/rest-api, accessed: 2026-02-01
work page 2026
-
[26]
CoinGecko, “Coingecko api documentation,” https://docs.coingecko.com/ reference/introduction, accessed: 2026-02-01
work page 2026
-
[27]
Sparse quadratic approxi- mation for graph learning,
D. Pasadakis, M. Bollh ¨ofer, and O. Schenk, “Sparse quadratic approxi- mation for graph learning,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 45, no. 9, pp. 11 256–11 269, 2023
work page 2023
-
[28]
Discrete graph structure learning for forecasting multiple time series,
C. Shang, J. Chen, and J. Bi, “Discrete graph structure learning for forecasting multiple time series,” inInternational Conference on Learning Representations (ICLR), 2021
work page 2021
-
[29]
Correlation, hierarchies, and networks in financial markets,
M. Tumminello, T. Aste, T. Di Matteo, and R. N. Mantegna, “Correlation, hierarchies, and networks in financial markets,” Journal of Economic Behavior & Organization, vol. 75, no. 1, pp. 40–58, 2010
work page 2010
-
[30]
Exploring graph neural networks for stock market predictions with rolling window analysis,
D. Matsunaga, T. Suzumura, and T. Takahashi, “Exploring graph neural networks for stock market predictions with rolling window analysis,” CoRR, vol. abs/1909.10660, 2019
-
[31]
Graph wavenet for deep spatial-temporal graph modeling,
Z. Wu, S. Pan, G. Long, J. Jiang, and C. Zhang, “Graph wavenet for deep spatial-temporal graph modeling,” in Proceedings of the 28th International Joint Conference on Artificial Intelligence, ser. IJCAI’19. AAAI Press, 2019, p. 1907–1913
work page 2019
-
[32]
Masked label prediction: Unified message passing model for semi-supervised clas- sification,
Y . Shi, Z. Huang, S. Feng, H. Zhong, W. Wang, and Y . Sun, “Masked label prediction: Unified message passing model for semi-supervised clas- sification,” inProceedings of the Thirtieth International Joint Conference on Artificial Intelligence, IJCAI-21, Z.-H. Zhou, Ed. International Joint Conferences on Artificial Intelligence Organization, 8 2021, pp. 1...
work page 2021
-
[33]
A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, Ł. Kaiser, and I. Polosukhin, “Attention is all you need,” Advances in Neural Information Processing Systems, vol. 30, 2017
work page 2017
-
[34]
Pump and dumps in the bitcoin era: Real time detection of cryptocurrency market manipulations,
M. La Morgia, A. Mei, F. Sassi, and J. Stefa, “Pump and dumps in the bitcoin era: Real time detection of cryptocurrency market manipulations,” in 2020 29th International Conference on Computer Communications and Networks (ICCCN), 2020, pp. 1–9
work page 2020
-
[35]
L´opez de Prado, Advances in Financial Machine Learning
M. L´opez de Prado, Advances in Financial Machine Learning. Hoboken, NJ: John Wiley & Sons, 2018
work page 2018
-
[36]
T. Saito and M. Rehmsmeier, “The precision-recall plot is more informative than the roc plot when evaluating binary classifiers on imbalanced datasets,” PloS one, vol. 10, no. 3, p. e0118432, 2015. 8 VI. APPENDIX A. Features Feature Type Definition OHLCV raw Open/high/low/close and volume. quote asset volume raw V olume in quote currency. num trades raw T...
work page 2015
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.