The authors create and release a large real-world dirty postal address dataset with ground truth to benchmark data cleaning methods and highlight limitations of existing approaches.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.DB 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Clean Me If You Can: A Large Collection of Real-World Addresses for Data Cleaning Benchmarking
The authors create and release a large real-world dirty postal address dataset with ground truth to benchmark data cleaning methods and highlight limitations of existing approaches.