pith. machine review for the scientific record. sign in

arxiv: 1804.01503 · v2 · submitted 2018-04-04 · 💻 cs.AI · cs.CL

Recognition: unknown

Abstractive Tabular Dataset Summarization via Knowledge Base Semantic Embeddings

Authors on Pith no claims yet
classification 💻 cs.AI cs.CL
keywords basedatadatasetknowledgetypesabstractivedescriptiveembedding
0
0 comments X
read the original abstract

This paper describes an abstractive summarization method for tabular data which employs a knowledge base semantic embedding to generate the summary. Assuming the dataset contains descriptive text in headers, columns and/or some augmenting metadata, the system employs the embedding to recommend a subject/type for each text segment. Recommendations are aggregated into a small collection of super types considered to be descriptive of the dataset by exploiting the hierarchy of types in a pre-specified ontology. Using February 2015 Wikipedia as the knowledge base, and a corresponding DBpedia ontology as types, we present experimental results on open data taken from several sources--OpenML, CKAN and data.world--to illustrate the effectiveness of the approach.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.