NeurBench: A Benchmark Suite for Learned Database Components with Drift Modeling

Zhanhao Zhao , Haotian Gao , Naili Xing , Lingze Zeng , Meihui Zhang , Gang Chen , Manuel Rigger , Beng Chin Ooi

Authors on Pith no claims yet

classification 💻 cs.DB

keywords driftcomponentsdatabaselearneddataunderworkloadneurbench

read the original abstract

Learned database components, which deeply integrate machine learning into their design, have been extensively studied in recent years. Given the dynamism of databases, where data and workloads continuously drift, it is crucial for learned database components to remain effective and efficient in the face of data and workload drift. Robustness, therefore, is a key factor in assessing their practical applicability. Although recent works examine learned database components under specific drift, they fail to enable systematic performance evaluations across a broad range of drift or under customized drift as needed. This paper presents NeurBench, a new benchmark suite that supports evaluating learned database components under measurable and controllable data and workload drift. We quantify diverse types of drift by introducing a key concept called the drift factor. Building on this formulation, we propose a drift-aware data and workload generation framework that effectively simulates real-world drift while preserving inherent correlations. Experimental results demonstrate the effectiveness of NeurBench in generating realistic data and workload drift, while providing insights into the performance of representative learned database components under different drift scenarios.

This paper has not been read by Pith yet.

NeurBench: A Benchmark Suite for Learned Database Components with Drift Modeling

discussion (0)