Language discrimination and clustering via a neural network approach
classification
❄️ cond-mat.dis-nn
cs.CLcs.NEphysics.soc-ph
keywords
languagesdendrogramneuralaccordinganalyzeapproachclassifyclustering
read the original abstract
We classify twenty-one Indo-European languages starting from written text. We use neural networks in order to define a distance among different languages, construct a dendrogram and analyze the ultrametric structure that emerges. Four or five subgroups of languages are identified, according to the "cut" of the dendrogram, drawn with an entropic criterion. The results and the method are discussed.
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.