pith. sign in

arxiv: 1602.01323 · v1 · pith:EP6HRBBJnew · submitted 2016-02-03 · 💻 cs.LG

Biclustering Readings and Manuscripts via Non-negative Matrix Factorization, with Application to the Text of Jude

classification 💻 cs.LG
keywords manuscriptsreadingscontaminationfactorizationfamiliesjudematrixnon-negative
0
0 comments X
read the original abstract

The text-critical practice of grouping witnesses into families or texttypes often faces two obstacles: Contamination in the manuscript tradition, and co-dependence in identifying characteristic readings and manuscripts. We introduce non-negative matrix factorization (NMF) as a simple, unsupervised, and efficient way to cluster large numbers of manuscripts and readings simultaneously while summarizing contamination using an easy-to-interpret mixture model. We apply this method to an extensive collation of the New Testament epistle of Jude and show that the resulting clusters correspond to human-identified textual families from existing research.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.