Analysing Data-To-Text Generation Benchmarks

Claire Gardent; Laura Perez-Beltrachini

arxiv: 1705.03802 · v1 · pith:EG66GJLUnew · submitted 2017-05-10 · 💻 cs.CL

Analysing Data-To-Text Generation Benchmarks

Laura Perez-Beltrachini , Claire Gardent This is my paper

classification 💻 cs.CL

keywords data-to-textdata-setssurfacerealisersdevelopmentevaluationlinguisticallyanalysing

0 comments

read the original abstract

Recently, several data-sets associating data to text have been created to train data-to-text surface realisers. It is unclear however to what extent the surface realisation task exercised by these data-sets is linguistically challenging. Do these data-sets provide enough variety to encourage the development of generic, high-quality data-to-text surface realisers ? In this paper, we argue that these data-sets have important drawbacks. We back up our claim using statistics, metrics and manual evaluation. We conclude by eliciting a set of criteria for the creation of a data-to-text benchmark which could help better support the development, evaluation and comparison of linguistically sophisticated data-to-text surface realisers.

This paper has not been read by Pith yet.

Analysing Data-To-Text Generation Benchmarks

discussion (0)