pith. sign in

arxiv: 2307.04468 · v1 · pith:R7ROMFXLnew · submitted 2023-07-10 · 💻 cs.LG

Badgers: generating data quality deficits with Python

classification 💻 cs.LG
keywords databadgersqualitydeficitsfraunhofer-iesegeneratinggithubhttps
0
0 comments X
read the original abstract

Generating context specific data quality deficits is necessary to experimentally assess data quality of data-driven (artificial intelligence (AI) or machine learning (ML)) applications. In this paper we present badgers, an extensible open-source Python library to generate data quality deficits (outliers, imbalanced data, drift, etc.) for different modalities (tabular data, time-series, text, etc.). The documentation is accessible at https://fraunhofer-iese.github.io/badgers/ and the source code at https://github.com/Fraunhofer-IESE/badgers

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.