pith. sign in

Integrity report for SomaliWeb v1: A Quality-Filtered Somali Web Corpus with a Matched Tokenizer and a Public Language-Identification Benchmark

A machine-verified record of the checks Pith has run against this paper: detector runs, findings, signed bundle events, and canonical identifiers.

arXiv:2605.18232 · pith:2026:GUVSMPMUAXN4YZYQJQLPUMY66Q

0Critical
0Advisory
5Detectors run
2026-05-26Last checked

Paper page arXiv integrity.json bundle.json

Detector runs

ai_meta_artifact completed v1.0.0 · findings 0 · 2026-05-26 23:38:46.306168+00:00
claim_evidence completed v1.0.0 · findings 0 · 2026-05-25 13:43:49.607708+00:00
doi_title_agreement completed v1.0.0 · findings 0 · 2026-05-25 13:32:45.327218+00:00
doi_compliance completed v1.0.0 · findings 0 · 2026-05-25 11:27:37.892539+00:00
cited_work_retraction completed v1.0.0 · findings 0 · 2026-05-23 00:22:45.383654+00:00

Findings

No public integrity findings for this paper.

Signed record

The machine-readable record for this paper lives at /pith/GUVSMPMUAXN4YZYQJQLPUMY66Q/integrity.json. Pith Number bundles also include signed pith.integrity.v1 events where a Pith Number exists.