pith:N6EZ6AJS
Topical Shifts in the Dark Web: A Longitudinal Analysis of Content from the Cybercrime Ecosystem
Dark web cybercrime discussions concentrate 75% of their volume in a small set of persistent core topics that last a median of 75 months.
arxiv:2605.15345 v1 · 2026-05-14 · cs.CR
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{N6EZ6AJSJIPNRRUHTAH4BSXANG}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
approximately 75% of total discussion volume is concentrated in a small set of persistent core topics, while short-lived themes account for approximately 3% of activity. The median topic lifespan is 75 months, indicating gradual thematic evolution rather than abrupt replacement.
The longitudinal topic-modeling framework that combines domain-specific embeddings, density-based clustering and temporal aggregation correctly identifies thematic clusters and measures their prevalence and lifespan at the website level without major distortion from snapshot collection biases or hyperparameter choices.
Longitudinal topic modeling on a large dark web dataset finds 75% of discussion volume in persistent core topics with a median lifespan of 75 months and only 3% in short-lived themes.
References
Formal links
Receipt and verification
| First computed | 2026-05-20T00:00:53.616821Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
6f899f01324a1ed8c687980fc0cae069b71a49d814411a825d1cbc9d825eaf37
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/N6EZ6AJSJIPNRRUHTAH4BSXANG \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 6f899f01324a1ed8c687980fc0cae069b71a49d814411a825d1cbc9d825eaf37
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "2ec8eeba59bf08dfee022acefd617dd72f373e6816ea11d7989261cb13633fe6",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.CR",
"submitted_at": "2026-05-14T19:14:53Z",
"title_canon_sha256": "c5c1fd099ef8c604eb8d938da6dc661dc44d9d044e2578fcf87caee96a9c8aae"
},
"schema_version": "1.0",
"source": {
"id": "2605.15345",
"kind": "arxiv",
"version": 1
}
}