pith. sign in

arxiv: 1804.05095 · v1 · pith:2HCWDIRWnew · submitted 2018-04-13 · 💻 cs.CL

Automatic Language Identification System for Hindi and Magahi

classification 💻 cs.CL
keywords identificationlanguageaccuracyautomatichindilanguagesmagahisystem
0
0 comments X
read the original abstract

Language identification has become a prerequisite for all kinds of automated text processing systems. In this paper, we present a rule-based language identifier tool for two closely related Indo-Aryan languages: Hindi and Magahi. This system has currently achieved an accuracy of approx 86.34%. We hope to improve this in the future. Automatic identification of languages will be significant in the accuracy of output of Web Crawlers.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.