pith. sign in

arxiv: 1901.07327 · v2 · pith:QHCFPMBJnew · submitted 2019-01-22 · 🧬 q-bio.GN

Identifying centromeric satellites with dna-brnn

classification 🧬 q-bio.GN
keywords dna-brnncentromericclasseshumanidentifyingsatellitesequencesaccelerate
0
0 comments X
read the original abstract

Summary: Human alpha satellite and satellite 2/3 contribute to several percent of the human genome. However, identifying these sequences with traditional algorithms is computationally intensive. Here we develop dna-brnn, a recurrent neural network to learn the sequences of the two classes of centromeric repeats. It achieves high similarity to RepeatMasker and is times faster. Dna-brnn explores a novel application of deep learning and may accelerate the study of the evolution of the two repeat classes. Availability and implementation: https://github.com/lh3/dna-nn Contact: hli@jimmy.harvard.edu

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.