A new catalog classifying 35 data error types into missing, incorrect, and redundant categories for tabular data, with definitions and examples to improve data quality management.
Journal of Management Information Systems12(4), 5–33 (1996)
3 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 3verdicts
UNVERDICTED 3roles
background 1polarities
background 1representative citing papers
A multi-layer ELT data quality framework using LLM semantic tests detected all 16 injected anomalies versus 7 in a manual baseline, with full cross-store agreement in 106 seconds.
The evaluation finds proprietary tools like Informatica and Ataccama offer more comprehensive features and emerging LLM assistance than open-source options like Great Expectations and Deequ, but LLM integration across all tools is limited to rule creation with no direct data validation support.
citing papers explorer
-
A Catalog of Data Errors
A new catalog classifying 35 data error types into missing, incorrect, and redundant categories for tabular data, with definitions and examples to improve data quality management.
-
A Multi-Layer Testing Framework for Automated Data Quality Assurance in Cloud-Native ELT Pipelines
A multi-layer ELT data quality framework using LLM semantic tests detected all 16 injected anomalies versus 7 in a manual baseline, with full cross-store agreement in 106 seconds.
-
Evaluating Data Quality Tools: Measurement Capabilities and LLM Integration
The evaluation finds proprietary tools like Informatica and Ataccama offer more comprehensive features and emerging LLM assistance than open-source options like Great Expectations and Deequ, but LLM integration across all tools is limited to rule creation with no direct data validation support.