pith. sign in

arxiv: 1703.03923 · v1 · pith:IIOSYNPOnew · submitted 2017-03-11 · 💻 cs.IR · cs.CL

A German Corpus for Text Similarity Detection Tasks

classification 💻 cs.IR cs.CL
keywords similaritycorpusdetectiontextassessdocumentsevaluategerman
0
0 comments X
read the original abstract

Text similarity detection aims at measuring the degree of similarity between a pair of texts. Corpora available for text similarity detection are designed to evaluate the algorithms to assess the paraphrase level among documents. In this paper we present a textual German corpus for similarity detection. The purpose of this corpus is to automatically assess the similarity between a pair of texts and to evaluate different similarity measures, both for whole documents or for individual sentences. Therefore we have calculated several simple measures on our corpus based on a library of similarity functions.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.