pith:SOH5VECQ
Mind the Gap: Impact of Synthetic Conversational Data on Multi-Talker ASR and Speaker Diarization
Synthetic conversational data approaches real-data baselines and mixing both yields substantial gains for multi-talker ASR and speaker diarization.
arxiv:2605.15442 v1 · 2026-05-14 · eess.AS
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{SOH5VECQLZ76DNVTH6JJPW3OZ6}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
synthetic-only training approaches real-data baselines, and combining simulated data with real recordings yields substantial gains over real-only training across both tasks.
The specific simulation choices and acoustic augmentations in FastMSS produce mixtures whose statistical properties are close enough to real conversational recordings that performance trends observed on synthetic data will transfer to real-world use.
Task-dependent simulation strategies for synthetic conversational data allow synthetic-only training to approach real-data baselines for multi-talker ASR and diarization, with mixing yielding further gains.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-20T00:00:58.798834Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
938fda90505e7fe1b6b33f9297db6ecfb4f833dcfcd2f4ee65238276b5fdcec2
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/SOH5VECQLZ76DNVTH6JJPW3OZ6 \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 938fda90505e7fe1b6b33f9297db6ecfb4f833dcfcd2f4ee65238276b5fdcec2
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "c7db74dcf0436e3d25462141da0e6e42e0e66a74898fd36161926e839a4da411",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "eess.AS",
"submitted_at": "2026-05-14T21:53:10Z",
"title_canon_sha256": "e32003fb5efc1f3058cacc7ca202331703c5124bc925962b434adeddbd4d0377"
},
"schema_version": "1.0",
"source": {
"id": "2605.15442",
"kind": "arxiv",
"version": 1
}
}