pith. sign in

arxiv: 1808.07214 · v2 · pith:DG3YRKBWnew · submitted 2018-08-22 · 💻 cs.CL

A Characterwise Windowed Approach to Hebrew Morphological Segmentation

classification 💻 cs.CL
keywords approachhebrewaccuracyanalysismorphologicalsegmentationtaskachieves
0
0 comments X
read the original abstract

This paper presents a novel approach to the segmentation of orthographic word forms in contemporary Hebrew, focusing purely on splitting without carrying out morphological analysis or disambiguation. Casting the analysis task as character-wise binary classification and using adjacent character and word-based lexicon-lookup features, this approach achieves over 98% accuracy on the benchmark SPMRL shared task data for Hebrew, and 97% accuracy on a new out of domain Wikipedia dataset, an improvement of ~4% and 5% over previous state of the art performance.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.