pith. sign in

arxiv: 2606.07843 · v1 · pith:OTI5H3UDnew · submitted 2026-06-05 · 💻 cs.DB · cs.IR· cs.LG

RACT: Retrieval Augmented Column-Table Learning and Prediction for Multi-Table Schema Matching

classification 💻 cs.DB cs.IRcs.LG
keywords matchingschemacolumnsmulti-tabletablescolumndifferentexperiments
0
0 comments X
read the original abstract

Schema matching, a critical task for integrating data from diverse sources, seeks to identify correspondences between columns across different schemas. In multi-table holistic schema matching, columns with similar semantic meaning may reside in tables with different contexts due to heterogeneous schema designs, where similarity-based techniques are inadequate. The focus of this paper is exploiting referential context into schema matching by introducing RACT learning and prediction, a self-supervised framework enabling the probabilistic retrieval of candidate tables for source columns to constrain relevant column candidates. Experiments demonstrate that this approach outperforms similarity-based baselines on matching multi-table schemas. In subsequent matching experiments, constraining the column search space via top-t tables improves both average matching precision and completeness by up to +70%.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.