pith:ZLZTKUJA
A Systematic Evaluation of Imbalance Handling Methods in Biomedical Binary Classification
Imbalance handling boosts complex models on unstructured biomedical data but harms simple ones.
arxiv:2605.14147 v1 · 2026-05-13 · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{ZLZTKUJAYXPUKWIPQV26DXQGTK}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
clear benefits were observed for more complex models and unstructured data: (a) ROS and RW consistently enhanced the performance of powerful models; (b) direct F1-score optimization demonstrated utility primarily for unstructured text and image data; and (c) RUS and SMOTE consistently degraded performance and are therefore not recommended.
That the three chosen public datasets and the selected model architectures sufficiently represent the broader space of biomedical binary classification problems so that the observed patterns generalize.
Random oversampling and re-weighting boost complex models on unstructured biomedical data, but undersampling and SMOTE degrade results and simple models on tabular data see no benefit.
References
Receipt and verification
| First computed | 2026-05-17T23:39:11.617459Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
caf3355120c5df45590f8575e1de069a84adc372ca6e45c6f0b9408aa61771b3
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/ZLZTKUJAYXPUKWIPQV26DXQGTK \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: caf3355120c5df45590f8575e1de069a84adc372ca6e45c6f0b9408aa61771b3
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "eaeacbea86cb77724acb8d2b0c3ae14d722b91944782a9fa53bd7189b76c53f6",
"cross_cats_sorted": [],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.LG",
"submitted_at": "2026-05-13T21:57:38Z",
"title_canon_sha256": "a44340b3470611b140eb2bbba369beab5a7aae9958673643771f9884001ed26d"
},
"schema_version": "1.0",
"source": {
"id": "2605.14147",
"kind": "arxiv",
"version": 1
}
}