Ishigaki-IDS-Bench supplies 166 verified bilingual examples plus audits to measure LLMs on producing standard-compliant IDS XML from BIM requirements, with best models at 65.6% macro F1 but only 27.7% passing content audits.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Ishigaki-IDS-Bench: A Benchmark for Generating Information Delivery Specification from BIM Information Requirements
Ishigaki-IDS-Bench supplies 166 verified bilingual examples plus audits to measure LLMs on producing standard-compliant IDS XML from BIM requirements, with best models at 65.6% macro F1 but only 27.7% passing content audits.