pith. sign in

arxiv: 1612.04113 · v1 · pith:XNYX6DAZnew · submitted 2016-12-13 · 💻 cs.CL

Vicinity-Driven Paragraph and Sentence Alignment for Comparable Corpora

classification 💻 cs.CL
keywords alignmentsentencealgorithmscomparablecorporaparagraphvicinity-drivenaddress
0
0 comments X
read the original abstract

Parallel corpora have driven great progress in the field of Text Simplification. However, most sentence alignment algorithms either offer a limited range of alignment types supported, or simply ignore valuable clues present in comparable documents. We address this problem by introducing a new set of flexible vicinity-driven paragraph and sentence alignment algorithms that 1-N, N-1, N-N and long distance null alignments without the need for hard-to-replicate supervised models.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.